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Process for the production of fine chemicals 

[0001 .0.0.0] The present invention relates to a process for the production of the fine 
chemical in a microorgansm, a plant cell, a plant, a plant tissue or in one or more parts 
thereof. The invention furthermore relates to nucleic acid molecules, polypeptides, 
5 nuC !sic acid constructs, vectors., antisense molecules, antibodies, host cells, plant 
tissue, propagtion material, harvested material, plants, microorganisms as well as 
agricultural compositions and to their use. 

[0002.0.0.0] Certain products and by-products of naturally-occurring metabolic 
processes in cells have utility in a wide array of industries, including, but not limited to, 
10 the food, feed, cosmetics, and pharmaceutical industries and agriculture. These 

molecules, collectively termed fine chemicals' include molecules such as vitamins for 
example vitamin A, D. E, K, B 1f B 2 , B 6 . B 12 . C. pantothenic acid, biotin orfolic acid; 
substances with vitamin-like character for example vitamin F. lipoic acid, ubiquinones, 
choline, myoinsositiol, vitamin U (S-methylmethionine), flavours for example vanillin, 
1 5 coumarin, isoeugenol. eugenol, (R)-carvone, (S)-carvone, menthol, jasmone or 
famesol; nutraceuticals for example phytosterols, flavonoids, anthocyanidins, 
isoflavons or isoprenoids; detergents; fatty acids such as saturated fatty acids, mono 
unsaturated fatty acids (singular MUFA, plural MUFAS), poly unsaturated fatty acids 
(singular PUFA, plural PUFAS), waxes or lipids containing said fatty acids; 
20 carbohydrates for example cellulose, starch, dextrin, pectin, xanthangum, carrageenan 
or alginate; sugars for example monosaccharides such as glucose, fructose, manose, 
sorbose, ribose, ribulose, xylose, xylulose or galactose, disaccharides such as lactose, 
sucrose, saccharose, maltose, isomaltose or cellobiose, trisaccharides such as 
raffinose or maltotriose; carboxylic acids for example citric acid, a-ketoglutaric acid, 
25 ferulic acid, sinapic acid or lactic acid; carotinoids for example cc-carotene, S- carotene, 
zeaxanthine, lutein, astaxanthine, lycopene, phyotoene or phytofluene, amino acids for 
example lysine, threonine, methionine, tryptophane, phenylalanine or tyrosine, 
cofactors for example heme or quinines, enzymes for example lipases, esterases, 
proteases, amylases, glucosidases etc. and other compounds [as described e.g. in 
30 Kuninaka, A. (1996) Nucleotides and related compounds, p. 561-612, in Biotechnology 
vol. 6. Rehm et al., eds. VCH: Weinheim, in Industial Microbiology and Biotechnology. 
Demain et al., second edition, ASM Press Washington, D.C. 1999, in Ullmann's 
Encyclopedia of Industrial Chemistry, vol. A27, Vitamins, p. 443-613 (1996) VCH: 
Weinheim and Ong, AS., Niki, E. & Packer, L (1995) Nutrition, Lipids, Health, and 
35 Disease Proceedings of the UNESCO/Confederation of Scientific and Technological 
Associations in Malaysia, and the Society for Free Radical Research, Asia, held Sept. 
1-3, 1994 at Penang, Malaysia, AOCS Press, (1995)), enzymes, and all other 
chemicals described in Gutcho (1983) Chemicals by Fermentation, Noyes Data 
Corporation, ISBN: 0818805086 and references contained therein]. Carotinoids are 
40 added for example to soft drinks, margarines or to animal feed for example to colour 
egg yolk or the flesh offish. In the food industry polycarbohydrates are widely used as 
thickener. Polyunsaturated fatty acids are added for example to infant formulas to 
create a higher nutrition value of such formulas. PUFAs have for example a positive 
influence on the cholesterol level of the blood in humans and therefore are useful in the 
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protection of heart diseases. Fine chemicals for example PUFAS can be isolated from 
animal sources such as for example fish or produced with microorganisms through the 
large-scale culture of microorganisms developed to produce and accumulate or secrete 
large quantities of one or more desired molecules. 

[0003.0.0.0] In large scale fine chemicals are produced with microorganism in the 
fermentation industry, which is responsible for the manufacturing of at least five major 
ingredient categories: antibiotics, organic acids, amino acids, enzymes, vitamins and 
other related products. There are production facilities in all important areas of the world 
especially in Europe, the US and Asia. Companies continuously try to optimize the 
production processes, the organisms and thereby increasing the efficiency but, as in 
the case of amino acids and organic acids, with already high conversion rates based- . 
on feeded carbon source, the limitations of such work become evident. All fermentation 
processes depend on the efficient utilization of carbohydrates, supplied mainly in the 
form of oils, glucose or molasses. It is therefore the availability and pricing of these raw 
materials that influence the competitiveness of fermentation products versus production 
for example in plants. Amino acids, organic acids and vitamins are offered at very low 
prices. For such products the question is whether it is still economical to continue 
fermentation production in future. And that is frequently a question of comparing the 
availability and pricing of carbohydrates with the future markets. 

[0004.0.0.0] Particularly useful organisms for the production of fine chemicals are 
microorganisms such as the algae, fungi, bacteria or plants. Through strain selection, a 
number of mutant strains of microorganisms have been developed which produce an 
array of desirable compounds including vitamins, amino acids, PUFAs etc. However, 
selection of strains improved for the production of a particular molecule is a time- 
consuming and difficult process. 

[0005.0.0.0] Alternatively the production of fine chemicals can be most conveniently 
performed via the large scale production of plants developed to produce for example 
carotinoids, carbohydrates or PUFAS. For example for the production of carotinoids 
plants such as marigold are used. Particularly well-suited plants for this purpose are 
sugar producing plants such as sugar beet or sugar cane or oilseed plants containing 
high amounts of lipid compounds such as rapeseed, canola, linseed, soybean, 
sunflower, borage and evening primrose. But also other crop plants containing sugars, 
oils or lipids and fatty acids are well suited as mentioned in the detailed description of 
this invention. Through conventional breeding, a number of mutant plants have been 
developed which produce an array of desirable lipids and fatty acids, cofactors and 
enzymes. However, selection of new plant cultivars improved for the production of a 
particular molecule is a time-consuming and difficult process or even impossible if the 
compound does not naturally occur in the respective plant as for example in the case of 
C 20 and higher C-carbon chain polyunsaturated fatty acids. 

[0006.0.0.0] Carbohydrates are an important dietary nutrient, which is mostly used to 
supply energy to the body, as" well as, a carbon source for synthesis of other 
compounds such as fats or proteins. Furthermore mono- and disaccharides are widely 
used in the food and feed industry as sweetener. Saccharides have varying degrees of 
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sweetness on a relative scale. Fructose is the sweetest. For example in the United 
States 22 million tons of sugar and other sweeteners were consumed in 1999. Said 
natural sweetener consumption includes the consumption of sugar, com sweeteners 
such as high fructose corn syrup as main, product and others such as honey or maple 
5 syrup. All natural sugar based sweeteners have a market shar of around 36 to 40 
percent. 

[0007.0.0.0] Whether unsaturated or saturated fatty acids are preferred in the food 
and feed industry depends on the intended purpose; thus, for example, lipids with 
unsaturated fatty acids, specifically polyunsaturated fatty acids, are preferred in human 

10 nutrition since they have a positive effect on the cholesterol level in the blood and thus 
. on.the possibility of heart disease. They are used in a variety of dietetic foodstuffs or 
medicaments. In addition PUFAs are commonly used in food, feed and in the cosmetic 
industry. Poly unsaturated co-3- and/or w-6-fatty acids are an important part of animal 
feed and human food. Because of the common composition of human food poly 

1 5 unsaturated oo-3-fatty acids, which are an essential component of fish oil, should be 
added to the food to increase the nutritional value of the food; thus, for example, poly 
unsaturated fatty acids such as docosahexaenoic acid or eicosapentaenoic acid are 
added as mentioned above to infant formula to increase its nutritional value. 

[0008.0.0.0] Vitamins such as vitamin C, vitamin B1 2 or vitamin B2 are typically 
20 produced with microorganism as mentioned above in the fermentation industry. Vitamin 
C can be produced generally in a combined process using biotransformation steps in 
combination with classical chemical synthesis. In another production process vitamin C 
is produced by fermentation alone. In general organisms such as Arthrobacter, 
Gluconobacter, Corynebacterium, Brevibacterium or Erwinia are used for vitamin C 
25 production. Vitamin B2 and vitamin B12 are produced with organisms such as Bacillus, 
Streptomyces, Citrobacter, Klebsiella, Propionibacterium or Ashbya in large scale 
fermentation. 

[0009.0.0.0] Commonly vitamin E and A are procuded in a classical chemical 
process or isolated from as natural vitamin E from plant oils. Vitamin E is an important 

30 natural fat-soluble antioxidants. As such, vitamin E protects cell membranes from the 
damage caused by free radicals. High doses of vitamin E have also been linked to a 
decreased ability of the blood to clot, which may be beneficial in those individuals at 
risk for heart disease by reducing the risk of heart attack. A vitamin E deficiency leads 
to pathophysiological situations in humans and animals. Of the different types of 

35 vitamin E, the alpha tocopherol form is typically considered the "gold standard" in terms 
of antioxidant activity - although the most recent research suggests that the other 
chemical forms may possess equivalent or superior antioxidant protection. Vitamin E 
compounds therefore are of high economical value as additives in the food and feed 
sectors, in pharmaceutical formulations and in cosmetic applications. Vitamin A is 

40 another fat-soluble vitamin that is part of a family of compounds including retinol. retinal 
and beta-carotene. Beta-carotene is also known as pro-vitamin A because it can be 
converted into vitamin A when additional levels are required. Vitamin A is needed by all 
of the body's tissues for general growth and repair processes and is especially 
important for bone formation, healthy skin/hair, night vision and function of the immune 
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system. Vitamin A may help boost immune system function and resistance to infection. 
Vitamin A derivatives are widely used in cosmetics and dermatological treatments for 
skin preparations designed to combat skin aging and treat acne. Vitamin A has been 
used for decades as a treatment for various vision-related conditions, including night 
5 blindness, cataracts, conjunctivitis, retinopathy and macular degeneration. 

[0010.0.0.0] An economical method for producing of vitamins such as vitamin C, B2, 
B12 or vitamin E and food- and feedstuffs with increased vitamin content are therefore 
very important Particularly economical methods are biotechnological methods utilizing 
vitamin-producing organisms, which are either natural or optimized by genetic 
10 modification. 

[001 1.0.0.0] Carotenoids are a large family of compounds including over 600 
members such as p-carotene, lycopene or lutein. Carotenoids are widely distributed in 
fruits and vegetables and are responsible, along with flavonoids, for contributing the 
color to many plants (a rule of thumb is the brighter, the better). In terms of nutrition, p- 
15 carotene's primary role is as mentioned above a precursor to vitamin A. p-carotene as 
most other carotenoids, is a powerful antioxidant - so it has been recommended to 
protect against a variety of diseases such as cancer, cataracts and heart disease. 

[001 2.0.0.0] The introduction of a new gene or new genes for the synthesis of fine 
chemicals into an organism or cell may not just increase the biosynthetic flux towards 

20 an end product it may also increase or create de novo a new compound composition. 
Similarly, other genes involved in the import of nutrients necessary for the biosynthesis 
of one or more fine chemicals (e.g., fatty acids, polar and neutral lipids, vitamins, 
enzymes etc.) may be increased in number or activity such that these precursors, 
cofactors, or intermediate compounds are increased in concentration within the cell or 

25 within the storing compartment thus increasing further the capability of the cell to 
produce the fine chemical as described herein. 

[0013.0.0.0] Amino acids are used in many branches of industry, including the food, 
animal feed, cosmetics, pharmaceutical and chemical industries. Amino acids such as 
D,L-methionine, L-lysine or L-threonine are used in the Animal feed industry. The 

30 essential amino acids valine, leucine, isoieucine, lysine, threonine, methionine, 

tyrosine, phenylalanine and tryptophan are particularly important for the nutrition of 
mammals especially humans and a number of livestock species. Glycine, L-methionine 
and tryptophan are all used in the pharmaceutical industry. Glutamine, valine, leucine, 
isoieucine, histidine, arginine, proline, serine and alanine are used in the 

35 pharmaceutical and cosmetics industries. Threonine, tryptophan and d, L-methionine 
are widely used feed additives (Leuchtenberger, W. (1996) Amino acids - technical 
production and use, pp. 466-502 in Rehm et al., (Ed.) Biotechnology vol. 6, chapter 
14a, VCH Weinheim). Moreover, amino acids are suitable for the chemical industry as 
precursors for the synthesis of synthetic amino acids and proteins, such as 

40 N-acetylcysteine, S-carboxymethyl-L-cysteine, (S)-5-hydroxytryptophan and other 
subtances described in Ullmann's Encyclopedia of Industrial Chemistry, vol. A2, pp. 
57-97, VCH Weinheim, 1985. To prefent physiological malnutritions the human body 
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has a need for essentiall amino acids such as arginine, histidine, isoleucine, leucine, 
lysine, methionine, tyrosine, phenylalanine, threonine, tryptophan, and valine. Based 
on their content of amino acids, foods are often classified as complete, partially 
complete, or incomplete protein sources. In order for a protein to be complete, it must 
5 contain all of the essential amino acids. This is the reason that many nutritionists rank 
non-rneat foods as being incomplete. The foods do contain all amino acids, but some 
may be in lower proportions than are required, and, therefore, should be combined with 
another food containing higher amounts of these amino acids or should be 
supplemented with said essential amino acids. 

10 [0014.0.0.0] Over one million tonnes of amino acids are currently produced annually; 
their market value amounts to over 2.5 billion US dollars. They are currently produced 
by four competing processes: Extraction from protein hydrolysates, for example L- 
cystine, L-leucine or L-tyrosine, chemical synthesis, for example of D,L-methionine, 
conversion of chemical precursors in an enzyme or cell reactor, for example L- 

1 5 phenylalanine, and fermentative production by growing, on an industrial scale, bacteria 
which have been developed to produce and secrete large amounts of the desired 
molecule in question. An organism, which is particularly suitable for this purpose is 
Corynebacterium glutamicum. which is used for example for the production of L-lysine 
or L-glutamic acid. Other amino acids which are produced by fermentation are, for 

20 example, L-threonine, L-tryptophan, L-aspartic acid and L-phenylalanine. 

[001 5.0.0.0] The biosynthesis of the natural amino acids in organisms capable of 
producing them, for example bacteria, has been characterised thoroughly; for a review 
of the bacterial amino acid biosynthesis and its regulation, see Umbarger, H.E. (1978) 
Ann. Rev. Biochem. 47: 533 - 606]. 

25 [001 6.0.0.0] It is known that amino acids are produced by fermentation of strains of 
coryneform bacteria, in particular Corynebacterium glutamicum. Due to their great 
importance, the production processes are constantly being improved. Process 
improvements can relate to measures regarding technical aspects of the fermentation, 
such as. for example, stirring and oxygen supply, or the nutrient media composition, 
30 such as, for example, the sugar concentration during fermentation, or to the work-up to 
give the product, for example by ion exchange chromatography, or to the intrinsic 
performance properties of the microorganism itself. Bacteria from other genera such as 
Escherichia or Bacillus are also used for the production of amino acids. A number of 
mutant strains, which produce an assortment of desirable compounds from the group 

35 of the sulfur-containing fine chemicals have been developed via strain selection. The 
performance properties of said microorganisms are improved with respect tothe 
production of a particular molecule by applying methods of mutagenesis, selection and 
mutant selection. Methods for the production of methionine have also been developed. 
In this manner, strains are obtained which are, for example, resistant to 

40 antimetabolites, such as. for example, the methionine analogues ct-methylmethionine, 
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ethionine, norleucine, N-acetylnorleucine, S-trifluoromethylhomocysteine, 2-amino-5- 
heprenoitic acid, selenomethionine, methionine sutfoximine, methoxine, 1- 
aminocyclopentanecarboxylic acid or which are auxotrophic for metabolites with 
regulatory importance and which produce sulfur-containing fine chemicals such as, for 
5 example, L-methionine. However, such processes developed for the production of 
methionine have the disadvantage that their yields are too low for being economically 
exploitable and that they are therefore not yet competitive with regard to chemical 
synthesis. 

[0017.0.0.0] Zeh (Plant Physiol., Vol. 127, 2001: 792-802) describes increasing the 
10 . roethionine_content in potato plants by inhibiting threonine synthase by what is known 
as antisense technology. This leads to a reduced threonine synthase activity without 
the threonine content in the plant being reduced. This technology is highly complex; the 
enzymatic activity must be inhibited in a very differentiated manner since otherwise 
auxotrophism for the amino acid occurs and the plant will no longer grow. 

15 [001 8.0.0.0] US 5,589,61 6 teaches the production of higher amounts of amino acids 
in plants by overexpressing a monocot storage protein in dicots. WO 96/38574, WO 
97/07665, WO 97/28247, US 4,886,878, US 5,082*993 and US 5,670,635 are following 
this approach. That means in all the aforementioned intellectual property rights different 
proteins or polypeptides are expressed in plants. Said proteins or polypeptides should 

20 function as amino acid sinks. Other methods for increasing amino acids such as lysine 
are disclosed in WO 95/15392, WO 96/38574, WO 89/11789 or WO 93/19190. In this 
cases speziell enzymes in the amino acid biosynthetic pathway such as the 
diphydrodipicolinic acid synthase are deregulated. This leads to an increase in the 
production of lysine in the different plants. Another approach to increase the level of 

25 amino acids in plants is disclosed in EP-A-0 271 408. EP-A-0 271 408 teaches the 
mutagenensis of plant and selection afterwards with inhibitors of certain enzymes of 
amino acid biosynthetic pathway. 

[0019.0.0.0] Methods of recombinant DNA technology have also been used for some 
years to improve Corynebacterium strains producing L-amino acids by amplifying 
30 individual amino acid biosynthesis genes and investigating the effect on amino acid 
production. 

[0020.0.0.0] As described above, the essential amino acids are necessary for 
humans and many mammals, for example for livestock. L-methionine is important as 
methyl group donor for the biosynthesis of, for example, choline, creatine, adrenaline, 
35 bases and RNA and DNA, histidine, and for the transmethylation following the 

formation of S-adenosylmethionine or as a sulfhydryl group donor for the formation of 
cysteine. Moreover, L-methionine appears to have a positive effect in depression. 
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[0021 .0.0.0] Improving the quality of foodstuffs and animal feeds is an important task 
of the food-and-feed industry. This is necessary since, for example, certain amino 
acids, which occur in plants are limited with regard to the supply of mammals. 
Especially advantageous for the quality of foodstuffs and animal feeds is as balanced 
5 as possible an amino acid profile since a great excess of an amino acid above a 

j. c I ~~ «,ii4har nocitiwa offoot «n th« Utilisation Of the 

Specmc concenirauoi i in uie iuuu uao nw miuia K ^.«.w ~. 

food since other amino acids suddenly become limiting. A further increase in quality is 
only possible via addition of further amino acids, which are limiting under these 
conditions. The targeted addition of the limiting amino acid in the form of synthetic 

1 0 products must be carried out with extreme caution in order to avoid amino acid 

imbalance. For example, the addition of an essential amino acid stimulates protein 

digestion, which may cause deficiency situations for the second or third limiting amino 
acid, in particular. In feeding experiments, for example casein feeding experiments, the 
additional provision of methionine, which is limiting in casein, has revealed the fatty 

1 5 degeneration of liver, which could only be alleviated after the additional provision of 
tryptophan. 

[0022.0.0.0] To ensure a high quality of foods and animal feeds, it is therefore 
necessary to add fine chemicals that means a plurality of compounds such as amino 
acids, vitamins, organic acids, PUFAS etc. in a balanced manner to suit the organism. 

20 Such supplemented food is named as "functional foods" or "nutraceuticals". 
Nutraceuticals shall provide a health benefit to humans beyond basic nutrition. 
Functional foods have health-promoting or disease-preventing effects. Examples 
include omega-3 fatty acids (found in many fish, flaxseed oil, soybean oil, canola oil, 
and walnuts), which reduce risk of coronary heart disease and lycopene in tomatoes. 

25 which has been associated with reduced risk of certain cancers. 

[0023.0:0.0] From a practical standpoint it would be of great advantage to produce 
an organism such as a microorganism or a plant containing a combination of different 
fine chemicals such as amino acids, vitamins, organic acids, carotenoids, PUFAS etc. 
at the same time in an sufficient amount to provide optimal growth and health benefit to 
30 animals or humans instead of combining different food or supplementing food or feed 
with different fine chemicals. 

[0024.0.0.0] It is therefore an object of the present invention to develop an 
inexpensive process for the synthesis of a combination of fine chemicals such as 
amino acids like tryptophane, proline, arginine, phenylalanine, tyrosine, alanine, 

35 glycine, threonine, serine, valine, isoleucine or leucine especially essential amino acids 
such as tryptophane, arginine, phenylalanine, tyrosine, threonine, valine, isoleucine or 
leucine, carbohydrates such as raffinose, maltose or inositol, vitamins such as y-, a- or 
p-tocopherol, organic acids such as ferulic acid, malate or sinapic acid, carotenoids 
such as p-carotene etc. at the same time in an sufficient amount to provide optimal 

40 growth and health benefit to animals or humans 
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[0025.0.0.0] It was now found that this object is achieved by providing the process 
according to the invention described herein and the embodiments characterized in the 
claims. 

[0026.0.0.0] Accordingly, in a first embodiment, the invention relates to a process for 
5 the production of a fine chemical, whereby the fine chemical is at least one compound 
selected from the group consisting of tryptophane, proline, arginine, phenylalanine, 
tyrosine, alanine, glycine, threonine, serine, valine, isoleucine, leucine, raffinose, 
maltose, inositol, ferulic acid, malate, y- tocopherol, a- tocopherol, p-tocopherol, cerotic 
acid, lignoceric acid, putrescine, sinapic acid, 3,4-dihydroxyphenylaline (= DOPA), 

10 stearic acid and p-carotene.. Accordingly, in the. present invention, the.term "the fine . . 
chemical" as used herein relates to an amino acid selected from the group consisting of 
tryptophane, proline, arginine, phenylalanine, tyrosine, alanine, glycine, threonine, 
serine, valine, isoleucine and leucine; a vitamin selected from the group consisting of y- 
tocopherol, a- tocopherol and p-tocopherol, a fatty acid selected from the group 

15 consisting of cerotic acid, lignoceric acid and stearic acid, . an organic acid selected 
from the group consisting of ferulic acid, malate and sinapic acid or a compound 
selected from the group consisting of raffinose, putrescine, 3,4-dihydroxyphenylaline (= 
DOPA) and p-carotene or mixtures thereof containing at least two, three, four or five 
compounds selected from the aforementioned groups, preferably 6, 7, "8 or 9 

20 compounds selected from the aforementioned groups, most preferably 10, 11, 12, 13, 
14, 15, 16, 17 or more compounds selected from the aforementioned groups. Further, 
the term "the fine chemicals" as used herein also relates to fine chemicals comprising 
at least one compound selected from the group consisting of tryptophane, proline, 
arginine, phenylalanine, tyrosine, alanine, glycine, threonine, serine, valine, isoleucine, 

25 leucine, raffinose, maltose, inositol, ferulic acid, malate, y- tocopherol, a- tocopherol, p- 
tocopherol, cerotic acid, lignoceric acid, putrescine, sinapic acid, 3,4- 
dihydroxyphenylaline (= DOPA), stearic acid and p-carotene. 

[0027.0.0.0] In one embodiment, the term "the fine chemical" or "fine chemical" 
means at least one compound selected from the group consisting of L-tryptophane, 

30 L-proline, L-arginine, L-phenylalanine, L-tyrosine, L-alanine, glycine, L-threonine, 
L-serine, L-valine, L-isoleucine, L-leucine, raffinose, maltose, inositol, ferulic acid, 
malate, y-tocopherol, a- tocopherol, p-tocopherol, cerotic acid, lignoceric acid, 
putrescine, sinapic acid, 3,4-dihydroxyphenylaline (= DOPA), stearic acid and 
P-carotene. Throughout the specification the term M the fine chemical" means the 

35 aforementioned compounds, its salts, ester or amids in free form or bound to other 

chemical compounds especially proteins. In a preferred embodiment, the term "the fine 
chemical" or "fine chemical" means at least one compound selected from the group 
consisting of L-tryptophane, L-proline, L-arginine, L-phenylalanine, L-tyrosine, 
L-alanine, glycine, L-threonine, L-serine, L-valine, L-isoleucine and L-leucine in free 

40 form or its salts or bound to proteins. 
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[0028.0.0.0] Accordingly, the present invention relates to a process for the production 
of fine chemical comprising 

(a) increasing or generating the biological activity represented by a protein as 
depicted in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 

5 34, 36, 38, 40, 42, 44, 46, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76. 78, 80, 82, 

84] 86*, 88! 90, 92, 94, 96, 98, 1 00, 1 02, 1 04, 1 06, 1 08, 1 1 0, 1 1 2, 1 14, 1 1 6, 1 1 8, 
120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 
152, 154, 156, 158, 160, 162, 164, 166. 168, 170, 172, 174, 176, 178, 180, 182, 
184, 186, 188. 190, 192. 194, 196, 198, 200. 202, 204, 206, 208, 210, 212. 214. 

10 216. 218. 220. 222, 224. 226, 228, 230, 232, 234. 236, 238, 240, 242, 244, 246, 

248. 250, 252, 254, 256, 258, 260, 262, 264, 266. 268. 270. 272, 274, 276, 278, 
280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300, 302, 304, 306, 308, 310, 
312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 
344, 346, 348, 350, 352, 354, 356, 358. 360. 362, 364. 366, 368. 370. 372, 374, 

15 376, 378, 380, 382, 384, 386, 388. 390, 392 or 394 in a non-human organism, or 

in one or more parts thereof; and 

(b) growing the organism under conditions which permit the production of the fine 
chemical in said organism. 

[0029.0.0.0] Comprises/comprising and grammatical variations thereof when used in 
20 this specification are to be taken to specify the presence of stated features, integers, 
steps or components or groups thereof, but not to preclude the presence or addition of 
one or more other features, integers, steps, components or groups thereof. 

[0030.0.0.0] Preferably, this process further comprises the step of recovering the fine 
chemical, which is synthesized by the organism from the organism and/or from the 

25 culture medium used for the growth or maintenance of the organism. The term 

"recovering" means the isolation of the fine chemical in different purities, that means on 
the one hand harvesting of the biological material, which contains the fine chemical 
without further purification and on the other hand purities of the fine chemical between 
5% and 100% purity, preferred purities are in the range of 10% and 99%. In one 

30 embodiment, the purities are 20%. 30%, 40%. 50%, 60%. 70%, 80%, 90%. 95% or 
99% or more. 

[0031 .0.0.0] Advantageously the process for the production of the fine chemical 
leads to an enhanced production of the fine chemical. The terms "enhanced" or 
"increase" mean at least a 10%. 20%, 30%, 40% or 50%, preferably at least 60%, 70%, 
35 80%, 90% or 1 00%. more preferably 1 50%, 200%, 300%. 400% or 500% higher 

production of the fine chemical in comparison to the reference as defined below, e.g. 
that means in comparison to an organism without the aforementioned modification of 
the activity of a protein having the biological activity represented by a protein as 
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depicted in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 
38, 40, 42, 44, 46, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 
92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 
128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 
5 162, 164, 166, 168, 170. 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 
196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 
230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 
264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 
298, 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 
10 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 
366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386, 388, 390, 392 or 394, 

[0032.0.0.0] Surprisingly it was found, that the transgenic expression of the 
Kjuyveromyces lactis, Cryptococcus neoformans, Neurospora crassa, Penicillium 
mameffei, Mucor rouxii, Schizophyllum commune, Paracoccidioides brasiliensis, 

15 Aspergillus fumigatus, Suiilus bovinus, Candida albicans., Trichoderma reesei, Ashbya 
gossypii, Yarrowia lipolytica, Ustilago maydis, Emericella nidulans, Trichomonas 
vaginalis, Colletotrichum trifolii, Biumeria graminis, Dictyostelium discoideum, 
Saccaromyces cerevisiae, Schizosaccharomyces pombe, Entamoeba histolytica, 
Oryza sativa, Brassica napus, Glycine max, Beta vulgaris, Lotus japonicus, Zinnia 

20 elegans, Zea mays, Cicer arietinum, Arabidopsis thaliana, Hordeum vulgare, Nicotiana 
tabacum, Gossypium hirsutum, Physcomitrella patens, Fucus distichus, Medicago 
truncutula, Homo sapiens, Caenorhabditits elegans, Tigriopus japonicus, 
Rhopalosiphum padi, Mus musculus, Discopyge ommata, Canis lupus, Drosophila 
melanogaster, Anopheles gambiae, Aplysia California, Ciona savignyi, Ciona 

25 intestinalis, Hemicentrotus pulcherrimus, Giardia lamblia, Gallus gallus, Brachydanio 
rerio, Xenopus laevis, Xenopus tropicalis, Schistosoma japonicum, Schistosoma 
mansoni, Encephalitozoon cUniculi, Wuchereria bancrofti, Cavia porcellus, Sus scrofa, 
Rattus norvegicus, Pneumocystis carinii or Pagrus major proteins as depicted in SEQ 
ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 

30 46, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 
100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126. 128, 130, 132, 
134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 
168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 
202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 

35 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 
270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300, 302, 
304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 
338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 
372, 374, 376, 378, 380, 382, 384, 386, 388, 390, 392 or 394 for example in 

40 Arabidopsis thaliana conferred an increase in the fine chemical content of the 
transformed plants. 
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[0033.0.0.0] In accordance with the invention, the term "organism" as understood 
herein relates always to a non-human organism, in particular to an animal or plant 
organism or to a microorganism. Further, the term "animal" as understood herein 
relates always to a non-human animal. 

5 [0034.0.0.0] The sequence depicted in SEQ ID NO: 1 (YNL090W) from 
Saccharomyces cerevisiae has been published in Madaule et ai. [1 98/ , 
"Characterization of two members of the rho gene family from the yeast 
Saccharomyces cerevisiae"; Proc. Natl. Acad. Sci., USA 84(3):779-83], and named as 
rho2. The gene encodes as all the other sequences mentioned above and use in the 
10 inventive process a non-essential GTPase of the rho/rac subfamily of the ras-like 
GTPases. The protein may play a role in the establishment of cell polarity or in 
microtubule assembly. Accordingly, in one embodiment, the process of the present 
invention comprises the use of a gene product "involved in in the establishment of cell 
polarity or in microtubule assembly" from Saccaromyces cerevisiae or its homolog, e.g. 

1 5 as shown herein, for the production of the fine chemical, meaning of preferably for the 
production of essential amino acids, in particular for increasing the amount of an 
essential amino acid in free or bound form in an organism or a part thereof, as 
mentioned herein. That means the increase of its biological activity by overexpression 
of the responsible gene leads to an increase of the fine chemical. 

20 [0035.0.0.0] The term "biological activity" means the biological function of the protein 
of the invention. In contrast to the term "biological activity" the term "activity" means the 
increase in the production of the compound produced by the inventive process. The 
term "biological activity" preferably refers to for example the enzymatic function, 
transporter or carrier function, DNA-packaging function, heat shock protein function, 

25 recombination protein function or regulatory function of a peptide or protein in an 

organism, a tissue, a cell or a cell compartment. Suitable substrates are low-molecular- 
weight compounds and also the protein interaction partners of a protein. The term 
"increase" of the biological function refers, for example, to the increase in binding 
capacity or binding strength of a protein for at least one substrate in an organism, a 

30 tissue, a cell or a cell compartment - for example by one of the methods described 
herein below - in comparison with the wild type of the same genus and species to 
which this method has not been applied, under otherwise identical conditions (such as, 
for example, culture conditions, age of the plants and the like). Increase is also 
understood as meaning the modification of the substrate specificity as can be 

35 expressed for example, by the kcat/Km value. In this context, an increase of the 
function of at least 10%, advantageously of at least 20%. preferably at least 30%. 
especially preferably of at least 40%, 50%, 60 %, 70%, 80%, 90% or more, very 
especially preferably of at least 150%, 200%, 250%, 300% or more, in companson with 
the untreated organism is advantageous. 



WO 2005/014828 PCT/EP2004/008136 

12 

[0036.0.0.0] Homologues ( = homologs) of the present gene products can be derived 
from any organisms as long as the liomologue confers the herein mentioned activity, in 
particular, confers an increase in the fine chemical amount or content Further, in the 
present invention, the term "homologue" relates to the sequence of an organism having 
5 the highest sequence homology to the herein mentioned or listed sequences of all 
expressed sequences of said organism. However, the person skilled in the art knows, 
that preferably, the homologue has said the fine chemical increasing activity and, if 
known, the same biological function or activity in the organism as the YNL090W protein 
as depicted in SEQ ID NO: 2. In one embodiment, the homolog of the SEQ ID NO: 2 is 

10 a homolog having said biological acitivity and being derived from Eukaryot such as 
... plants like the families Anacardiaceae, Asteraceae, Apiaceae, Betulaceae, 
Boraginaceae, Brassicaceae, Bromeliaceae, Caricaceae, Cannabaceae, 
Convolvulaceae, Chenopodiaceae, Cucurbitaceae, Elaeagnaceae, Ericaceae, 
Euphorbiaceae, Fabaceae, Geraniaceae, Gramineae, Juglandaceae, Lauraceae, 

15 Leguminosae or Linaceae; algae, fungi or mosses. In another embodiment, the 

homolog of SEQ ID NO: 2 is a homolog having said acitivity and being derived from 
bacteria. In a further embodiment, the homolog of the SEQ ID NO: 2 is a homolog 
having said acitivity and being derived from fungi. I 

[0037.0.0.0] Further homologs of are described herein below. 

20 [0038.0.0.0] In accordance with the invention, a protein or polypeptide has the "the 
biological activity represented by a protein as depicted in SEQ ID NO: 2, 4, 6, 8, 10, 12, 
14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 56, 58, 60, 62, 64, 66, 
68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 
110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 

25 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 
178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 
212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 
246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 
280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300, 302, 304, 306, 308, 310, 312, 

30 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 
348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 
382, 384, 386, 388, 390, 392 or 394" if its de novo activity, or its increased expression 
directly or indirectly leads to an increased the fine chemical level in the organism or a 
part thereof, preferably in a cell of said organism and the protein has the above 

35 mentioned activities of a protein as depicted in SEQ ID NO: 2, 4, 6, 6, 10, 12, 14, 16, 
18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 56, 58, 60, 62, 64, 66, 68, 70, 
72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110. 112, 
114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 
148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 

40 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 
216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 
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250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 
284, 286, 288, 290, 292, 294, 296. 298, 300, 302, 304, 306, 308, 310, 312, 314, 316, 
318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 
352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380. 382, 384, 
5 386, 388, 390, 392 or 394. During the specification the activity or preferably the 

i . ■ _ I ! I ~t I- ~ nmtntn r\r l->r»l\/T\OPltiHo (ir S»n |->l iHttiC PtCACi ITlOleCUlS Or 

DlOIUyiUcll auilVliy VJI suui a [Jiuraii w»« f/uij ^w^w— — — . — . . 

sequence encoding such protein or polypeptide is identical or similar if it still has the 
biological or enzymatic activity of a protein as depicted in SEQ ID NO: 2, 4. 6, 8, 10, 
12, 14, 16, 18, 20, 22, 24, 26, 28. 30, 32, 34, 36, 38, 40, 42, 44, 46, 56, 58, 60, 62, 64, 
10 66. 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90. 92, 94, 96, 98, 100, 102, 104, 106, 
108, 110, 112,-114. 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138. 140. 
142, 144. 146, 148, 150, 152, 154. 156. 158, 160, 162. 164, 166, 168, 170, 172, 174, 
176', 178. 180, 182. 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 
21o', 212, 214, 216. 218. 220. 222. 224; 226, 228, 230, 232, 234, 236, 238, 240, 242, 
15 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272. 274. 276. 
278, 280, 282, 284, 286, 288, 290 s 292, 294, 296, 298, 300, 302, 304, 306, 308, 310, 
312*, 314', 316, 318, 320, 322, 324, 326, 328, 330, 332, 334. 336, 338, 340, 342, 344, 
346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 
380, 382,' 384, 386. 388. 390, 392 or 394, if it has at least 10% of the original biological 
20 or enzymatic activity, preferably 20%, particularly preferably 30%, most particularly 

preferably 40% in comparison to a protein as depicted in SEQ ID NO: 2, 4, 6, 8, 10, 12, 
14, 16, 18, 20, 22, 24, 26, 28, 30. 32. 34, 36, 38, 40, 42, 44, 46, 56, 58, 60, 62, 64, 66, 
68, 70. 72. 74. 76, 78, 80, 82, 84, 86. 88. 90. 92. 94, 96, 98, 100, 102, 104, 106, 108, 
110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 
25 144, 146', 148, 150, 152. 154, 156, 158. 160. 162. 164. 166. 168. 170, 172, 174, 176, 
178', 180. 182, 184, 186. 188. 190. 192. 194. 196. 198, 200. 202, 204, 206, 208, 210, 
212*. 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 
246, 248, 250, 252, 254. 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 
280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300, 302, 304, 306, 308, 310, 312, 
30 314', 316. 318. 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346. 
348, 350. 352, 354. 356. 358, 360, 362, 364, 366, 368, 370. 372, 374, 376, 378, 380, 
382, 384, 386, 388, 390, 392 or 394 of Kluyveromyces lactis, Cryptococcus 
neoformans, Neurospora crassa, Penicillium marneffei, Mucor rouxii, Schizophyllum 
commune. Paracoccidioides brasiliensis, Aspergillus fumigatus, Suillus bovinus, 
35 Candida albicans, Trichoderma reesei, Ashbya gossypii, Yarrowia lipolytics, Ustilago 
maydis, Emericella nidulans, Trichomonas vaginalis, Colletotrichum trifolii, Blumeria 
graminis, Dictyostelium discoideum, Saccaromyces cerevisiae, Schizosaccharomyces 
pombe, Entamoeba histolytica, Oryza sativa, Brassica napus, Glycine max, Beta 
vulgaris, Lotus japonicus, Zinnia elegans, Zea mays, Cicer arietinum, Arabidopsis 
40 thaliana, Hordeum vulgare. Nicotiana tabacum. Gossypium hirsutum, Physcomitrella 
patens, Fucus distichus, Medicago truncutula, Homo sapiens, Caenorhabditits elegans, 
Tigriopus japonicus, Rhopalosiphum padi, Mus musculus, Discopyge ommata, Canis 
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lupus, Drosophila melanogaster, Anopheles gambiae, Aplysia California, Ciona 
savignyi, Ciona intestinalis, Hemicentrotus pulcherrimus, Giardia lamblia, Gallus gallus, 
Brachydanio rerio, Xenopus laevis, Xenopus tropicalis, Schistosoma japonicum, 
Schistosoma mansoni, Encephalitozoon cuniculi, Wuchereria bancrofti, Cavia 
5 porcellus, Sus scrofa and/or Pagrus major. 

[0039.0.0.0] The terms "increased", "rised", "extended", "enhanced", "improved" or 
"amplified" relate to a corresponding change of a property in an organism, a part of an 
organism such as a tissue, seed, root, leave, flower etc. or in a cell and are 
interchangeable. Preferably, the overall activity in the volume is increased or 

10 enhanced in cases if the increase or enhancement is related to the increase. or 
enhancement of an activity of a gene product, independent whether the amount of 
gene product or the specific activity of the gene product or both is increased or 
enhanced or whether the amount, stability or translation efficacy of the nucleic acid 
sequence or gene encoding for the gene product is increased or enhanced. The terms 

15 "reduction", "decrease" or "deletion" relate to a corresponding change of a property in 
an organism, a part of an organism such as a tissue, seed, root, leave, flower etc. or in 
a cell. Preferrably, the overall activity in the volume is reduced, decreased or deleted in 
cases if the reduction, decrease or deletion is related to the reduction, decrease or 
deletion of an activity of a gene product, independent whether the amount of gene 

20 product or the specific activity of the gene product or both is reduced, decreased or 
deleted or whether the amount, stability or translation efficacy of the nucleic acid 
sequence or gene encoding for the gene product is reduced, decreased or deleted. 

[0040.0.0.0] The terms "increase" or "decrease" relate to a corresponding change of 
a property an organism or in a part of an organism, such as a tissue, seed, root, leave, 
25 flower etc. or in a cell. Preferrably, the overall activity in the volume is increased in 

cases the increase relates to the increase of an activity of a gene product, independent 
whether the amount of gene product or the specific activity of the gene product or both 
is increased or generated or whether the amount, stability or translation efficacy of the 
nucleic acid sequence or gene encoding for the gene product is increased. 

30 [0041,0.0.0] Under "change of a property" it is understood that the activity, 

expression level or amount of a gene product or the metabolite content is changed in a 
specific volume relative to a corresponding volume of a control, reference or wild type, 
including the de novo creation of the activity or expression. 

[0042.0.0.0] The terms "increase" or "decrease" include the change of said property 
35 in only parts of the subject of the present invention, for example, the modification can 
be found in compartment of a cell, like a organelle, or in a part of a plant, like tissue, 
seed, root, leave, flower etc. but is not detectable if the overall subject, i.e. complete 
cell or plant, is tested. Preferably, the increase or decrease is found cellular, thus the 
term "increase of an acitivity" or "increase of a metabolite content" relates to the cellular 
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increase compared to the wild typ cell. Typically said increase is based on the higher 
content of the transcribed nucleic acid or the protein translated from said nucleic acid 
sequence in the modified cell in comparison to the wild type cell. 

[0043.0.0.0] Accordingly, the term "increase" or "decrease" means that the specific 
5 activity of an enzyme as well as the amount of a compound or metabolite, e.g. of a 
polypeptide, a nucleic acid molelcule or of the fine chemical of the invention or an 
encoding mRNA or DNA, can be increased or decreased in a volume. 

[0044.0.0.0] The terms "wild type", "control" or "reference" are exchangeable and can 
be a cell or a part of organisms such as an organelle or a tissue, or an organism, in 

10 particular a microorganism or a plant, which was not modified or treated according to 
the herein described process according to the invention. Accordingly, the cell or a part 
of organisms such as an organelle or a tissue, or an organism, in particular a 
microorganism or a plant used as wild typ, control or reference corresponds to the cell, 
organism or part thereof as much as possible and is in any other property but in the 

15 result of the process of the invention as identical to the subject matter of the invention 
as possible. Thus, the wild type, control or reference is treated identically or as identical 
. as possible, saying that only conditions or properties might be different which do not 
influence the quality of the tested property. 

[0045.0.0.0] Preferably, any comparison is carried out under analogous conditions. 
20 The term "analogous conditions" means that all conditions such as, for example, 
culture or growing conditions, assay conditions (such as buffer composition, 
temperature, substrates, pathogen strain, concentrations and the like) are kept identical 
between the experiments to be compared. 

[0046.0.0.0] The "reference", "control", or "wild type" is preferably a subject, e.g. an 
25 organelle, a cell, a tissue, an organism, in particular a plant or a microorganism, which 
was not modified or treated according to the herein described process of the invention 
and is in any other property as similar to the subject matter of the invention as possible. 
The reference, control or wild type is in its genome, transcriptome, proteome or meta- 
bolome as similar as possible to the subject of the present invention. Preferably, the 

30 term "reference-" "control-" or "wild type-"-organelle, -cell, -tissue or -organism, in 
particular plant or microorganism, relates to an organelle, cell, tissue or organism, in 
particular plant or micororganism, which is nearly genetically identical to the organelle, 
cell, tissue or organism, in particular microorganism or plant, of the present invention or 
a part thereof preferably 95%, more peferred are 98%, even more preferred are 

35 99,00%, in particular 99,10%. 99,30%, 99,50%. 99,70%, 99,90%, 99,99%, 99, 999% or 
more.. Most preferable the "reference", "control", or "wild type" is a subject, e.g. an 
organelle, a cell, a tissue, an organism, which is genetically identical to the organism, 
cell or organelle used according to the process of the invention except that the 
responsible or acvitivity conferring nucleic acid molecules or the gene product encoded 
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by them are amended, manipulated, exchanged or introduced according to the 
inventive process. 

[0047.0.0.0] Preferably, the reference, control or wild type differs form the subject of 
the present invention only in the cellular activity of the polypeptide of the invention, e.g. 
5 as result of an increase in the level of the nucleic acid molecule of the present invention 
or an increase of the specific activity of the polypeptide of the invention, e.g. by or in 
the expression level or activity of an protein having the biological activity represented 
by a protein as depicted in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 
30, 32, 34, 36, 38, 40, 42, 44, 46, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 

10 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 
122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 
156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 
190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 
224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 

15 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 
292, 294, 296, 298, 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 
326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 
360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386, 388, 390, 392 or 
394 or its homologs, its biochemical or genetical causes and the increased amount of 

20 the fine chemical. 

[0048.0.0.0] In case, a control, reference or wild type differing from the subject of the 
present invention only by not being subject of the process of the invention can not be 
provided, a control, reference or wild type can be an organism in which the cause for 
the modulation of an activity conferring the increase of the fine chemical or expression 

25 of the nucleic acid molecule of the invention as described herein has been switched 
back or off, e.g. by knocking out the expression of responsible gene product, e.g. by 
antisense inhibition, by inactivation of an activator or agonist, by activation of an 
inhibitor or antagonist, by inhibition through adding inhibitory antibodies, by adding 
active compounds as e.g. hormones, by introducing negative dominant mutants, etc. A 

30 gene production can for example be knocked out by introducing inactivating point 
mutations, which lead to an enzymatic activity inhibition or a destabilization or an 
inhibition of the ability to bind to cofactors etc. 

[0049.0.0.0] Accordingly, preferred reference subject is the starting subject of the 
present process of the invention. Preferably, the reference and the subject matter of 
35 the invention are compared after standardization and normalization, e.g. to the amount 
of total RNA, DNA, or protein or activity or expression of reference genes, like 
housekeeping genes, such as ubiquitin, actin or ribosomal proteins. 
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[0050.0.0.0] A series of mechanisms exists via which a modification of the a protein, 
e.g. the polypeptide of the invention can directly or indirectly affect the yield, production 
and/or production efficiency of the amino acid. 

[0051 .0.0.0] For example, the molecule number or the specific activity of the 
5 polypeptide or the nucleic acid molecule may be increased. Larger amounts of the fine 
chemical can be produced if the polypeptide or the nucleic acid of the invention ss 
expressed de novo in an organism lacking the activity of said protein. However, it is 
also possible to increase the expression of the gene which is naturally present in the 
organisms, for example by modifying the regulation of the gene, or by increasing the 
1 0 stability Of the corresponding mRNA or of the corresponding gene product encoded by 
the nucleic acid molecule of the invention, or by introducing homologous genes from 
other organisms which are differently regulated, eg. not feedback sensitive or not 
feedback regulated. 

[0052.0.0.0] This also applies analogously to the combined increased expression of 
1 5 the nucleic acid molecule of the present invention or its gene product with that of 

further enzymes of the biochemical pathway of the fine chemical e.g. of the amino acid 
biosynthesis pathway, e.g. which are useful for the synthesis of the fine chemicals. 

[0053.0.0.0] The increase, decrease or modulation according to this invention can be 
constitutive, e.g. due to a stable permanent transgenic expression or to a stable 

20 mutation in the corresponding endogenous gene encoding the nucleic acid molecule of 
the invention or to a modulation of the expression or of the behaviour of a gene 
conferring the expression of the polypeptide of the invention, or transient, e.g. due to 
an transient transformation or temporary addition of a modulator such as a agonist or 
antagonist or inducible, e.g. after transformation with a inducible construct carrying the 

25 nucleic acid molecule of the invention under control of a induceable promoter and 
adding the inducer, e.g. tetracycline or as described herein below. 

[0054.0.0.0] The increase in activity of the polypeptide amounts in a cell, a tissue, a 
organelle, an organ or an organism or a part thereof preferably to at least 5%, 
preferably to at least 20% or at to least 50%, especially preferably to at least 70%, 
30 80%, 90% or more, very especially preferably are to at least 200%, 300% or 400%, 
most preferably are to at least 500%. 600% or more in comparison to the control, 
reference or wild type. 

[0055.0.0.0] The specific activity of a polypeptide encoded by a nucleic acid molecule 
of the present invention or of the polypeptide of the present invention can be tested as 
35 described in the examples. In particular, the expression of a protein in question in a 
cell, e.g. a plant cell or a microorganism and the detection of an increase the fine 
chemical level in comparison to a control is an easy test and can be performed as 
described in the state of the art. 
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[0056.0.0.0] The term "increase" includes, that a compound or an activity is 
introduced into a cell de novo or that the compound or the activity has not been 
detectable before, in other words it is "generated". 

[0057.0.0.0] Accordingly, in the following, the term "increasing" also comprises the 
5 term "generating" or "stimulating". The increased activity manifests itself iri an increase 
of the fine chemical. 

[0058.0.0.0] In case the biological activity of the Saccaromyces cerevisiae protein 
YNL090W as depicted in SEQ ID NO: 2, or the biological activity of the of 
Kluyveromyces lactis, Cryptococcus neoformans, Neurospora crassa, Penicillium 

10 marneffei, Mucor rouxii, Schizophyllum commune, Paracoccidioides brasiliensis, 

Aspergillus fumigatus, Suillus bovinus, Candida albicans, Trichoderma reesei, Ashbya 
gossypii, Yarrowia lipolytica, Ustilago maydis, Emericella nidulans, Trichomonas 
vaginalis, Colletotrichum trifolii, Blumeria graminis, Dictyostelium discoideum, 
Schizosaccharomyces pombe, Entamoeba histolytica, Oryza sativa, Brassica napus, 

15 Glycine max, Beta vulgaris, Lotus japonicus, Zinnia elegans, Zea mays, Cicer 

arietinum, Arabidopsis thaliana, Hordeum vulgare, Nicotiana tabacum, Gossypium 
hirsutum, Physcomitrella patens, Fucus distichus, Medicago truncutula, Homo sapiens, 
Caenorhabditits elegans, Tigriopus japonicus, Rhopalosiphum padi, Mus musculus, 
Discopyge ommata, Canis lupus, Drosophila melanogaster, Anopheles gambiae, 

20 Aplysia California, Ciona savignyi, Ciona intestinalis, Hemicentrotus pulcherrimus, 

Giardia lamblia, Gallus gallus, Brachydanio rerio, Xenopus laevis, Xenopus tropicalis, 
Schistosoma japonicum, Schistosoma mansoni, Encephalitozoon cuniculi, Wuchereria 
bancrofti, Cavia porcellus, Sus scrofa, Rattus norvegicus, Pneumocystis carinii or 
Pagrus major proteins as depicted SEQ ID NO: 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 

25 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 
80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 
120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 
154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 
188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 

30 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 
256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 
290, 292, 294, 296, 298, 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 
324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 
358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386, 388, 390, 

35 392 or 394 or its homologs is increased, preferably, in one embodment an increase of 
the fine chemical of at least 100%, 150%, or preferably to at least 200%, 250%, 300%, 
350% or 400%, especially prerferably to at least 450%, 500%, 550%, 600% or more is 
conferred. 

[0059.0.0.0] In case the biological activity of the Saccaromyces cerevisiae protein 
40 YNL090W as depicted in SEQ ID NO: 2, or the biological activity of the Kluyveromyces 
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lactis, Cryptococcus neoformans, Neurospora crassa, Penicillium marneffei, Mucor 
rouxii', Schizophyllum commune, Paracoccidioides brasiliensis, Aspergillus fumigatus, 
Suillus bovinus, Candida albicans, Trichoderma reesei, Ashbya gossypii, Yarrowia 
lipolytica, Ustilago maydis, Emericella nidulans, Trichomonas vaginalis, Colletotrichum 
5 trifolii, Blumeria graminis, Dictyostelium discoideum. Schizosaccharomyces pombe, 
Entamoeba histolytica, Oryza sativa. Brassica.napus, Glycine max, Beta vulgaris, 
Lotus japonicus, Zinnia elegans, Zea mays, Cicer arietinum, Arabidopsis thaliana, 
Hordeum vulgare, Nicotiana tabacum, Gossypium hirsutum, Physcomitrella patens, 
Fucus distichus, Medicago truncutula, Homo sapiens, Caenorhabditits elegans, 
1 o Tigriopus japonicus, Rhopalosiphum padi, Mus musculus, Discopyge ommata, Canis 
lupus, Drosophila melanogaster, Anopheles gambiae, Aplysia California, Ciona 
savignyi, Ciona intestinalis, Hemicentrotus pulcherrimus, Giardia lamblia, Gallus gallus, 
Brachydanio rerio, Xenopus laevis, Xenopus tropicalis, Schistosoma japonicum, 
Schistosoma mansoni, Encephalitozoon cuniculi, Wuchereria bancrofti, Cavia 
15 porcellus, Sus scrofa, Rattus norvegicus, Pneumocystis carinii or Pagrus major 

proteins as depicted SEQ ID NO: 4, 6, 8, 10, 12, 14, 16,. 18. 20, 22, 24, 26, 28, 30, 32, 
34 36 38 40, 42, 44, 46, 56, 58, 60. 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 
88 9o' 92. 94. 96. 98, 100. 102, 104, 106, 108, 110, 112. 114. 116, 118, 120, 122. 124, 
126 128 130. 132. 134. 136. 138. 140. 142, 144, 146. 148. 150, 152. 154. 156, 158, 
20 160* 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 
194, 196, 198, 200, 202, 204. 206, 208. 210, 212, 214, 216. 218. 220. 222. 224, 226, 
228 230 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 
262 264', 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 
296', 298, 300, 302, 304, 306, 308, 310, 312, 314. 316, 318, 320, 322, 324, 326, 328, 
25 330 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 360. 362, 
364, 366. 368, 370. 372. 374. 376. 378. 380. 382. 384, 386, 388, 390, 392 or 394 or its 
homologs is increased, preferably, an increase of the fine chemical in conferred, 
preferably of the fine chemical such as essential amino acids e.g. tryptophane, 
arginine, phenylalanine, tyrosine, threonine, valine, isoleucine and/or leucine, non- 
30 essential amino acids e.g. proline, alanine, glycine or serine, modified amino acids e.g. 
3.4-dihydroxyphenylalanine, carbohydrates e.g. raffinose, inositol or iso-maltose, 
vitamins e.g. a-tocopherol, B-tocopherol or v-tocopherol, organic acids e.g. ferulic acid, 
sinapic acid or malate, fatty acids e.g. cerotic acid, lignoceric acid. 2-hydroxy-palmitic 
acid or stearic acid, carotinoids e.g. B-carotene or mixtures thereof is conferred. 

35 [0060.0.0.0] In this context, the fine chemical amount in a cell, preferably in a tissue, 
more preferred in a organism as a plant or a microorganim or part thereof, is increased 
by at least 3%, 4%, 5%, 6%. 7%, 8% or 9% or more, especially preferably are at least 
10%, 20%, 40%. 50% or more, very especially preferably are more than 60%, 70%, 
80%', 90%, 100% or more and most preferably are 150% or more, such as 200%, 

40 250%, 300%, 350%, 400%, 450%, 500%, 550% or 600%. 
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[0061 .0.0.0] The fine chemical can be contained in the organism either in its free 
form and/or bound to proteins, polypeptids or other compounds such as 
polysaccharides, lipids, glycoproteins or glycolipids etc. or mixtures thereof. 
Accordingly, in one embodiment, the amount of the free form in a cell, preferably in a 
5 tissue, more preferred in a organism as a plant or a microorganim or part thereof, is 
increased by at least 3%, 4%, 5%, 6%, 7%, 8% or 9% or more, especially preferably 
are at least 1 0%, 20%, 40%, 50% or more, very especially preferably are more than 
60%, 70%, 80%, 90%, 100% or more and most preferably are 150% or more, such as 
200%, 250%, 300%, 350%, 400%, 450%, 500%, 550% or 600%, Accordingly, in an 

10 other embodiment, the amount of the bound the fine chemical in a cell, preferably in a 
tissue, more preferred in a organism as a plant or a microorganism or part thereof, is 
increased by at least 3%, 4%, 5%, 6%, 7%, 8% or 9% or more, especially preferably 
are at least 10%, 20%, 40%, 50% or more, very especially preferably are more than 
60%, 70%, 80%, 90%, 100% Or more and most preferably are 150% or more, such as 

15 200%, 250%, 300%, 350%, 400%, 450%, 500%, 550% or 600%. 

[0062.0.0.0] A protein having an activity conferring an increase in the amount or level 
of the fine chemical preferably has the structure of the polypeptide described herein, in 
particular of the polypeptides comprising the consensus sequence shown in SEQ ID 
NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID 

20 NO: 52, SEQ ID NO: 397, SEQ ID NO: 398, SEQ ID NO: 399 and/or SEQ ID NO: 400 
as described herein, or is encoded by the nucleic acid molecule characterized herein or 
the nucleic acid molecule according to the invention, for example by the nucleic acid 
molecule as shown in SEQ ID NO: 1, 3. 5, 7, 9, 11. 13, 15, 17, 19, 21, 23, 25, 27, 29, 
31 , 33, 35, 37, 39, 41 43, 45, 55, 57, 59, 61 . 63, 65, 67, 69, 71 , 73, 75, 77, 79, 81 , 83, 

25 85, 87, 89, 91, 93, 95,97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 
123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 
157, 159, 161, 163, 165. 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 
191, 193. 195, 197, 199, 201, 203, 205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 
225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 

30 259, 261, 263, 265, 267. 269. 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 291, 
293, 295. 297, 299, 301, 303, 305, 307, 309, 311, 313, 315, 317, 319, 321, 323, 325, 
327, 329, 331, 333, 335, 337, 339, 341, 343, 345, 347, 349, 351, 353, 355, 357, 359. 
361, 363, 365, 367, 369, 371, 373, 375, 377, 379, 381, 383, 385, 387, 389, 391or 393 
or its herein described functional homologues and has the herein mentioned activity. 

35 [0063.0.0.0] For the purposes of the present invention, the terms "the fine chemical" 
or "fine chemical" such as essential amino acids e.g. tryptophane, arginine, 
phenylalanine, tyrosine, threonine, valine, isoleucine and/or leucine, non-essential 
amino acids e.g. proline, alanine, glycine or serine, modified amino acids e.g. 3,4- 
dihydroxyphenylalanine, carbohydrates e.g. raffinose, inositol or iso-maltose, vitamins 

40 e.g. a-tocopherol, 8-tocopherol or y-tocopherol, organic acids e.g. ferulic acid, sinapic 
acid or malate, fatty acids e.g. cerotic acid, lignoceric acid, 2-hydroxy-palmitic acid or 
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stearic acid, carotinoids e.g. B-carotene or mixtures thereof also encompass the 
corresponding salts, such as, for example, tryptophane hydrochloride, arginine 
hydrochloride, phenylalanine hydrochloride, tyrosine hydrochloride, threonine 
hydrochloride, valine hydrochloride, isoleucine hydrochloride or leucine hydrochloride 
5 or tryptophane sulfate, arginine sulfate, phenylalanine sulfate, tyrosine sulfate, 
' threonine sunate, valine suuaie, isuieuuiue »unau? ui ibuwi.s — 

[0064.0.0.0] Owing to the biological activity of the proteins which are used in the 
process according to the invention and which are encoded by nucleic acid molecules 
according to the invention, it is possible to produce compositions comprising the fine 
1 o chemical, i.e. an increased amount of the free chemical free or bound, e.g amino acid 
compositions. Depending on the choice of the organism used for the process according 
to the present invention, for example a microorganism or a plant, compositions or 
mixtures of various fine chemicals e.g. amino acids can be produced. 

[0065.0.0.0] The term "expression" refers to the transcription and/or translation of a 
1 5 codogenic gene segment or gene. As a rule, the resulting product is an mRNA or a 
protein. However, expression products can also include functional RNAs such as, for 
example, antisense, nucleic acids, tRNAs, snRNAs, rRNAs, RNAi, siRNA, ribozymes 
etc. Expression may be systemic, local or temporal, for example limited to certain cell 
types, tissuesorgans or time periods. 

20 [0066.0.0.0] In one embodiment, the process of the present invention comprises one 
or more of the following steps: 

a) stabilizing a protein conferring the increased expression of a protein encoded by 
the nucleic acid molecule of the invention or of the protein of the invention, e.g. of 
a protein having the biological activity represented by a protein as depicted in 
25 SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20. 22, 24, 26, 28, 30, 32, 34, 36, 38, 

40. 42. 44. 46. 56. 58. 60. 62, 64, 66, 68, 70. 72. 74, 76, 78, 80, 82, 84, 86, 88. 
90, 92, 94, 96. 98, 100. 102. 104. 106, 108, 110, 112, 114, 116, 118, 120, 122, 
124. 126, 128, 130. 132. 134, 136, 138. 140. 142. 144, 146, 148, 150, 152. 154. 
156, 158. 160. 162, 164. 166, 168, 170, 172, 174. 176. 178, 180. 182, 184, 186, 

30 188. 190, 192, 194, 196. 198. 200. 202, 204. 206, 208, 210, 212. 214, 216, 218, 

220. 222, 224. 226. 228. 230. 232. 234, 236, 238. 240. 242, 244. 246, 248. 250. 
252, 254, 256, 258. 260, 262, 264, 266, 268. 270. 272. 274. 276, 278, 280, 282, 
284, 286. 288, 290, 292, 294, 296, 298, 300, 302. 304. 306, 308. 310, 312. 314. 
31 6'. 318*. 320, 322. 324. 326. 328. 330. 332, 334. 336, 338, 340. 342, 344, 346. 

35 348, 350, 352, 354. 356, 358. 360, 362. 364, 366, 368. 370, 372, 374. 376. 378. 

380, 382, 384, 386, 388, 390. 392 or 394 or its homologs having the fine 
chemical increasing activity; 
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b) stabilizing a mRNA conferring the increased expression of a protein encoded by 
the nucleic acid molecule of the invention, e.g. of a protein having the biological 
activity represented by a protein as depicted in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 
16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 56, 58. 60, 62, 64, 

5 66, 68, 70, 72, 74, 76. 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 

106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 
138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162. 164. 166, 168, 
170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198.200, 
202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 

10 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260. 262. 264. 

266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294. 296. 
298, 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326. 328. 
330, 332, 334, 336, 338, 340, 342, 344, 346, 348. 350, 352, 354, 356, 358, 360, 
362, 364, 366, 368. 370, 372, 374, 376, 378, 380, 382. 384. 386, 388, 390, 392 

15 or 394 or its homologs or of an mRNA encoding the polypeptide of the present 

invention having the fine chemical increasing activity; 

c) increasing the specific activity of a protein conferring the increased expression of 
a protein encoded by the nucleic acid moleciile of the invention or of the protein 
of the invention having the fine chemical increasing activity, e.g. of a protein 

20 having the biological activity represented by a protein as depicted in SEQ ID NO: 

2, 4, 6. 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42. 44, 46, 
56, 58, 60. 62, 64, 66, 68, 70, 72, 74. 76, 78, 80, 82. 84, 86, 88, 90. 92. 94, 96, 
98, 100, 102. 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128. 
130, 132, 134, 136, 138, 140, 142, 144, 146. 148. 150. 152, 154. 156, 158, 160, 

25 162, 164, 166, 168. 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 

194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 
226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 
258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278. 280, 282, 284, 286, 288, 
290, 292, 294, 296, 298, 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 

30 322, 324, 326, 328, 330, 332, 334, 336. 338, 340, 342, 344, 346, 348, 350, 352, 

354, 356, 358, 360, 362, 364, 366, 368. 370, 372, 374, 376, 378, 380, 382, 384, 
386, 388, 390, 392 or 394 or its homologs, or decreasing the inhibition/ 
regulation of the protein of the invention; 

d) generating or increasing the expression of an endogenous or artificial 

35 transcription factor mediating the expression of a protein conferring the increased 

expression of a protein encoded by the nucleic acid molecule of the invention or 
of the protein of the invention having the fine chemical increasing activity, e.g. of 
a polypeptide having the biological activity represented by a protein as depicted 
in SEQ ID NO: 2, 4, 6, 8. 10, 12, 14, 16, 18, 20, 22. 24. 26, 28, 30, 32, 34, 36, 

40 38, 40. 42, 44, 46, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 

88, 90. 92. 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 
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122 124 126. 128. 130. 132. 134. 136. 138, 140, 142. 144, 146. 148. 150. 152, 
154'. 156', 158, 160. 162, 164, 166, 168, 170, 172. 174. 176. 178. 180. 182. 184, 
186, 188, 190, 192. 194, 196, 198, 200, 202, 204. 206, 208. 210, 212, 214. 216. 
218 220, 222, 224, 226, 228, 230. 232. 234, 236. 238. 240, 242, 244. 246. 248. 
250'. 252, 254, 256, 258. 260, 262, 264, 266, 268. 270. 272. 274, 276. 278. 280. 
000 inn 000 OQ/l -5QR oqr 300 302.304.306.308.310,312, 

ZOZ, ZO*f, ^OD, iOO, tw, Wt, £.*>-r, . -, , . - 

314 316, 318, 320. 322. 324, 326, 328. 330, 332, 334, 336, 338. 340. 342. 344. 
346*. 348, 350, 352, 354, 356, 358, 360, 362, 364. 366. 368, 370, 372. 374. 376, 
378, 380, 382, 384, 386, 388, 390, 392 or 394 or its homologs; 

- e) stimulating activity, of a protein conferring the. increased expression of a protein 
encoded by the nucleic acid molecule of the present invention or a protein of the 
present invention having the fine chemical increasing activity, e.g. of a protein 
having the biological activity represented by a protein as depicted in SEQ ID NO: 
2 4 6 8 10. 12. 14. 16, 18. 20. 22, 24, 26. 28. 30. 32. 34, 36. 38, 40, 42. 44, 46, 
56 58 60, 62. 64. 66. 68. 70, 72. 74, 76. 78, 80. 82. 84. 86, 88, 90, 92. 94. 96. 
98 100. 102. 104. 106. 108. 110, 112, 114, 116, 118. 120. 122. 124. 126. 128. 
130 132 134. 136. 138. 140, 142, 144, 146, 148. 150. 152. 154. 156, 158.160, 
162 164 166, 168. 170, 172, 174, 176, 178, 180. 182. 184. 186, 188, 190, 192, 
194 196', 198, 200, 202. 204. 206. 208. 210. 212. 214. 216, 218. 220. 222. 224, 
226 228, 230, 232. 234. 236. 238, 240, 242, 244, 246. 248. 250, 252. 254, 256. 
258 260 262, 264, 266, 268, 270, 272, 274, 276, 278. 280. 282, 284, 286. 288, 
290* 292 294, 296, 298. 300, 302, 304. 306, 308, 310, 312. 314. 316, 318, 320. 
322 324 326, 328. 330, 332, 334, 336, 338, 340, 342. 344. 346. 348, 350, 352. 
354*. 356. 358. 360, 362, 364, 366, 368, 370, 372, 374. 376. 378. 380. 382. 384, 
386', 388. 390, 392 or 394 or its homologs, by adding one or more exogenous 
inducing factors to the organismus or parts thereof; 

f) expressing a transgenic gene encoding a protein conferring the increased 
' expression of a polypeptide (= protein) encoded by the nucleic acid molecule or 
the protein of the invention, having the fine chemical increasing activity, e.g. of a 
I protein having the biological activity represented by a protein as depicted in SEQ 

ID NO: 2, 4, 6, 8, 10, 12. 14, 16, 18, 20, 22, 24, 26. 28, 30. 32, 34, 36, 38, 40, 42. 
44 46 56 58, 60, 62. 64. 66. 68. 70, 72, 74, 76, 78. 80. 82, 84, 86. 88, 90. 92. 
94 96, 98, 100. 102, 104, 106, 108, 110, 112, 114, 116. 118, 120. 122, 124, 126, 
128 130 132, 134, 136, 138, 140, 142, 144, 146, 148, 150. 152. 154, 156, 158. 
5 160* 162*. 164, 166, 168, 170. 172. 174. 176. 178, 180, 182. 184. 186, 188. 190. 

192 194 196, 198, 200, 202, 204, 206. 208, 210, 212, 214, 216. 218, 220, 222, 
224 226* 228. 230, 232, 234. 236, 238. 240, 242, 244, 246. 248, 250, 252, 254. 
256 258'. 260, 262, 264. 266. 268, 270, 272, 274. 276. 278. 280. 282, 284, 286. 
288 290 292, 294, 296. 298. 300, 302, 304, 306, 308, 310, 312. 314, 316, 318, 
0 320', 322*. 324, 326, 328, 330, 332, 334, 336, 338, 340, 342. 344. 346, 348, 350, 
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352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374. 376, 378, 380, 382, 
384, 386, 388, 390, 392 or 394 or its homologs; 

g) Increasing the copy number of a gene conferring the increased expression of a 
nucleic acid molecule encoding a protein encoded by the nucleic acid molecule 

5 of the invention or the protein of the invention the fine chemical increasing 

activity, e.g. of a protein having the biological activity represented by a protein as 
depicted in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 
34, 36, 38, 40, 42, 44, 46, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 
84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 

10 120, 122, 124, 126, 128. 130. 132. 134, 136, 138, 140, 142, 144, 146, 148, 150, 

152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 
184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 
216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242. 244. 246, 
248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268. 270, 272, 274, 276. 278, 

1 5 280. 282, 284, 286, 288, 290. 292, 294, 296, 298, 300, 302.. 304, 306, 308, 31 0, 

312. 314, 316, 318, 320, 322, 324, 326. 328. 330. 332, 334, 336, 338, 340, 342, 
344, 346, 348, 350, 352, 354, 356, 358, 360, 362. 364, 366, 368, 370, 372, 374, 
376, 378, 380, 382, 384, 386, 388, 390, 392 or 394 or its homologs; 

h) increasing the expression of the endogenous gene encoding the protein of the 
20 invention, e.g. a protein having the biological activity represented by a protein as 

depicted in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 
34, 36, 38, 40, 42. 44, 46, 56. 58, 60, 62, 64, 66, 68, 70, 72, 74. 76. 78, 80, 82, 
84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 
120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 

25 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178. 180. 182. 

184, 186. 188, 190, 192. 194, 196. 198. 200, 202, 204, 206, 208, 210, 212, 214, 
216, 218, 220. 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 
248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 
280, 282, 284, 286. 288. 290. 292, 294, 296, 298, 300, 302, 304, 306, 308, 310, 

30 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 

344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 
376, 378, 380, 382, 384, 386, 388, 390, 392 or 394 or its homologs, by adding 
positive expression or removing negative expression elements, e.g. homologous 
recombination can be used to either introduce positive regulatory elements like 

35 for plants the 35S enhancer into the promoter or to remove repressor elements 

form regulatory regions. Further gene conversion methods can be used to disrupt 
repressor elements or to enhance to acitivty of positive elements. Positive 
elements can be randomly introduced in plants by T-DNA or transposon 
mutagenesis and lines can be identified in which the positive elements have be 

40 integrated near to a gene of the invention, the expression of which is thereby 

enhanced; 
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i) modulating growth conditions of an organism in such a manner, that the 

expression or activity of the gene encoding the protein of the invention or the 
protein itself is enhanced for example microorganisms or plants can be grown 
under a higher temperature regime leading to an enhanced expression of heat 
5 shock proteins, e.g. the heat shock protein of the invention, which can lead an 

fine oh^mioal nroriuction: and/or 

CI II IU I VI IW HI iw ■ — • . |- • ' • 

j) selecting of organisms with expecially high activity of the protein of the invention 
from natural or from mutagenized resources and breeding them into the target 
organisms, eg the elite crops. 

10 [0067.0.0.6] Preferably, said mRNA is the nucleic acid molecule of the invention 
and/or the protein conferring the increased expression of a protein encoded by the 
nucleic acid molecule of the invention or the polypeptide having the herein mentioned 
activity, e.g. conferring the increase of the fine chemical after increasing the expression 
or activity of the encoded polypeptide or having the activity of a polypeptide having 
15 biological activity represented by a protein as depicted in SEQ ID NO: 2, 4, 6, 8, 10, 12, 
14 16, 18. 20, 22, 24, 26. 28. 30, 32, 34, 36, 38. 40, 42, 44, 46, 56, 58, 60, 62, 64, 66, 
68 70 72 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 
110 112, 114, 116, 118, 120, 122, 124, 126, 128. 130. 132. 134, 136, 138, 140, 142, 
144 146 148, 150, 152, 154. 156. 158, 160, 162, 164, 166, 168, 170, 172. 174, 176, 
20 178 180, 182. 184. 186. 188. 190. 192, 194, 196. 198. 200, 202, 204, 206, 208, 210, 
212 214. 216. 218. 220. 222, 224, 226, 228, 230. 232. 234, 236, 238, 240, 242, 244, 
246* 248*. 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 
280* 282. 284, 286, 288, 290, 292, 294, 296, 298, 300, 302, 304, 306, 308, 310, 312, 
314 316', 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 
25 348*. 350, 352, 354, 356, 358, 360, 362. 364, 366, 368, 370, 372, 374, 376, 378, 380, 
382, 384, 386, 388, 390, 392 or 394 or its homologs. 

[0068.0.0.0] In general, the amount of mRNA, polynucleotide or nucleic acid 
molecule in a cell or a compartment of an organism correlates with the amount of 
encoded protein and thus with the overall activity of the encoded protein in said 
30 volume. Said correlation is not always linear, the activity in the volume is dependent on 
the stability of the molecules, the degradation of the molecules or the presence of 
activating or inhibiting co-factors. Further, product and educt inhibitions of enzymes are 
well known and described in textbooks, e.g. Stryer, Biochemistry or Zinser et al. 
"Enzyminhibitoren7Enzyme inhibitors". 

35 [0069.0.0.0] The activity of the abovementioned proteins and/or poylpeptide encoded 
by the nucleic acid molecule of the present invention can be increased in various ways. 
For example, the activity in an organism or in a part thereof, like a cell, is increased via 
increasing the gene product number, e.g. by increasing the expression rate, like 
introducing a stronger promoter, or by increasing the stability of the mRNA expressed, 
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thus increasing the translation rate, and/or increasing the stability of the gene product, 
thus reducing the proteins decayed. Further, the activity or turnover of enzymes can be 
influenced in such a way that a reduction or increase of the reaction rate or a 
modification (reduction or increase) of the affinity to the substrate results, is reached. A 
5 mutation in the catalytic centre of a polypeptide of the invention, e.g. an enzyme, can 
modulate the turn over rate of the enzyme, e.g. a knock out of an essential amino acid 
can lead to a reduced or completely knock out activity of the enzyme, or the deletion or 
mutation of regulator binding sites can reduce a negative regulation like a feedback 
inhibition (or a substrate inhibition, if the substrate level is also increased). The specific 
10 activity of an enzyme of the present invention can be increased such that the turn over 
rate is increased or the binding of a eo-factor is improved. Improving the stability of the 
encoding mRNA or the protein can also increase the activity of a gene product. The 
stimulation of the activity is also under the scope of the term "increased activity". 

[0070.0.0.0] Moreover, the regulation of the abovementioned nucleic add sequences 
15 may be modified so that gene expression is increased. This can be achieved 

advantageously by means of heterologous regulatory sequences or by modifying, for 
example mutating, the natural regulatory sequences which are present The 
advantageous methods may 'also be combined with each other. 

[0071 .0.0.0] In general, an activity of a gene product in an organism or part thereof, 
20 in particular in a plant cell, a plant, or a plant tissue or a part thereof or in a 

microorganism can be increased by increasing the amount of the specific encoding 
mRNA or the corresponding protein in said organism or part thereof. "Amount of protein 
or mRNA" is understood as meaning the molecule number of polypeptides or mRNA 
molecules in an organism, a tissue, a cell or a cell compartment. "Increase" in the 
25 amount of a protein means the quantitative increase of the molecule number of said 
protein in an organism, a tissue, a cell or a cell compartment or part thereof - for 
example by one of the methods described herein below - in comparison to a wild type, 
control or reference. 

[0072.0.0.0] The increase in molecule number amounts preferably to at least 1 %, 
30 preferably to more than 10%, more preferably to 30% or more, especially preferably to 
50%, 70% or more, very especially preferably to 100%, most preferably to 500% or 
more. However, a de novo expression is also regarded as subject of the present 
invention. 

[0073.0.0.0] A modification, i.e. an increase or decrease, can be caused by 
35 endogenous or exogenous factors. For example, an increase in activity in an organism 
or a part thereof can be caused by adding a gene product or a precursor or an activator 
or an agonist to the media or nutrition or can be caused by introducing said subjects 
into a organism, transient or stable. 
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[0074.0.0.0] In one embodiment the increase in the amount of the fine chemical in 
the organism or a part thereof, e.g. in a cell, a tissue, an organ, an organelle etc., is 
achived by increasing the endogenous level of the polypeptide of the invention. 
Accordingly, in an embodiment of the present invention, the present invention relates to 
5 a process wherein the gene copy number of a gene encoding the polynucleotide or 

5 , • j.: : Cur+lior tha enrlrmpnni IS level of 

nucleic acia moiecuie of me niveimun ■» mucaoSu. • «■ « >~> . «• o — — 

the polypeptide of the invention can for example be increased by modifiying the 
transcriptional or translational regulation of the polypeptide. 

[0075.0.0.0] In one embodiment the amount of the fine chemical in the organism or 
1 0 part thereof can be increase by targeted or random mutagenesis of the endogenous 
genes of the invention. For example homologous recombination can be used to either 
introduce positive regulatory elements like for plants the 35S enhancer into the 
promoter or to remove repressor elements form regulatory regions. In addition gene 
conversion like methods described by Kochevenko and Willmitzer (Plant Physiol. 2003 
15 May;132(1):174-84) and citations therein can be used to disrupt repressor elements or 
to enhance to acitivty of positive regulatory elements. 

Furthermore positive elements can be randomly introduced in (plant) genomes by T- 
DNA or transposon mutagenesis and lines can be screened for, in which the positive 
elements has be integrated near to a gene of the invention, the expression of which is 

20 thereby enhanced. The activation of plant genes by random integrations of enhancer 
elements has been described by Hayashi et aL, 1992 (Science 258:1350-1353) or 
Weigel et aL, 2000 (Plant Physiol. 122, 1003-1013) and others citated therein. 
Reverse genetic strategies to identify insertions (which eventually carrying the 
activation elements) near in genes of interest have been described for various cases 

25 eg. Krysan et aL, 1999 (Plant Cell 1999, 11, 2283-2290); Sessions et al.. 2002 (Plant 
Cell 2002, 14, 2985-2994); Young etal., 2001, (Plant Physiol. 2001, 125, 513-518); 
Koprek et aL. 2000 (Plant J. 2000, 24, 253-263) ; Jeon et aL, 2000 (Plant J. 2000, 22, 
561-570) ; Tissier et al.. 1999 (Plant Cell 1999, 11, 1841-1852); Speulmann etal., 1999 
(Plant Cell 1999 ,11 , 1853-1866). Briefly material from all plants of a large T-DNA or 

30 transposon mutagenized plant population is harvested and genomic DNA prepared. 
Then the genomic DNA is pooled following specific architectures as described for 
example in Krysan et al., 1999 (Plant Cell 1999. 11, 2283-2290). Pools of genomics 
DNAs are then screened by specific multiplex PCR reactions detecting the combination 
of the insertions mutagen (eg T-DNA or Transposon) and the gene of interest. 

35 Therefore PCR reactions are run on the DNA pools with specific combinations of T- 

DNA or transposon border primers and gene specific primers. General rules for primer 
design can again be taken from Krysan et al.. 1999 (Plant Cell 1999, 11, 2283-2290) 
Rescreening of lower levels DNA pools lead to the identifcation of individual plants in 
which the gene of interest is disrupted by the insertional mutagen. 

40 The enhancement of positive regulatory elements or the disruption or weaking of 
negative regulatory elements can also be achieved through common mutagenesis 
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techniques: The production of chemically or radiation mutated populations is a common 
technique and known to the skilled worker. Methods for plants are described by 
Koomeef et al. 1982 and the citations therein and by Lightner and Caspar in "Methods 
in Molecular Biology" Vol 82. These techniques usually induce pointmutations that can 
5 be identified in any known gene using methods such as TILLING (Colbert et al. 2001). 

[0076.0.0.0] Accordingly, the expression level can be increased if the endogenous 
genes encoding a polypeptide conferring an increased expression of the polypeptide of 
the present invention, in particular genes comprising the nucleic acid molecule of the 
present invention, are modified via homologous recombination, Tilling approaches or 
.10 gene conversion . . . - 

[0077.0.0.0] Regulatory sequences can be operatively linked to the coding region of 
an endogenous protein and control its transcription and translation or the stability or 
decay of the encoding mRNA or the expressed protein. In order to modify and control 
the expression, promoter, UTRs, splicing sites, processing signals, polyadenylation 

15 sites, terminators, enhancers, repressors, post transcriptional or posttranslational 
modification sites can be changed, added or amended for example, the activation of 
plant genes by random integrations of enhancer elements has been described by 
Hayashi et al., 1992 (Science 258:1350-1353) or Weigel et al., 2000 (Plant Physiol. 
122, 1003-1013) and others citated therein. For example, the expression level of the 

20 endogenous protein can be modulated by replacing the endogenous promoter with a 
stronger transgenic promoter or by replacing the endogenous 3'UTR with a 3'UTR, 
which provides more stablitiy without amending the coding region. Further, the 
transcriptional regulation can be modulated by introduction of an artifical transcription 
factor as described in the examples. Alternative promoters, terminators and UTR are 

25 described below. 

[0078.0.0.0] The activation of an endogenous polypeptide having the fine chemical 
increasing activity, e.g.having the biological activity represented by a protein as 
depicted in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 
38, 40, 42, 44, 46, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 

30 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 
128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 
162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 
196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 
230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 

35 264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 
298, 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 
332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 
366, 368, 370, 372. 374, 376, 378, 380, 382, 384, 386, 388, 390, 392 or 394, e.g. 
conferring the increase of the fine chemical after increase of expression or activity can 

40 also be increased by introducing a synthetic transcription factor, which binds close to 
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the coding region of the protein of the invention encoding gene and activates its 
transcription. A chimeric zinc finger protein can be construed, which comprises a 
specific DNA-binding domain and an activation domain as e.g. the VP1 6 domain of 
Herpes Simplex virus. The specific binding domain can bind to the regulatory region of 
5 the nucleic acid sequence used in the inventive process. The expression of the 

chimeric transcription factor in an organism, in particular in a plant, leads to a specific 
expression of the protein of the invention, see e.g. in WO01/52620, Oriz, Proc. Natl. 
Acad. Sci. USA, 2002, Vol. 99, 13290 or Guan, Proc. Natl. Acad. Sci. USA 2002, Vol. 
99, 13296. 

1 0 [0079.0.0.0] In one further embodiment of the process according to the invention, 
organisms are used in which one of the abovementioned genes, or one of the 
abovementioned nucleic acids, is mutated in a way that the activity of the encoded 
gene products is less influenced by cellular factors, or not at all. in comparison with the 
unmutated proteins. For example, well known regulation mechanism of enzymic activity 

1 5 are substrate inhibition or feed back regulation mechanisms. Ways and techniques for 
the introduction of substitutions, deletions and additions of one or more bases, 
nucleotides or amino acids of a corresponding sequence are described herein below in 
the corresponding paragraphs and the references listed there, e.g. in Sambrook et al M 
Molecular Cloning, Cold Spring Habour, NY, 1989. The person skilled in the art will be 

20 able to identify regulation domains and binding sites of regulators by comparing the 
sequence of the nucleic acid molecule of the present invention or the expression 
product thereof with the state of the art by computer software means which comprise 
algorithms for the identifying of binding sites and regulation domains or by introducing 
into a nucleic acid molecule or in a protein systematically mutations and assaying for 

25 those mutations which will lead to an increased specifiy activity or an increased activity 
per volume, in particular per cell. 

[0080.0.0.0] It is therefore adavantageously to express in an organism a nucleic acid 
molecule of the invention or a polypeptide of the invention derived from a evolutionary 
distantly related organism, as e.g. using a prokaryotic gene in. a eukaryotic host, as in 
30 these cases the regulation mechanism of the host cell may not weaken the activity 
(cellular or specific) of the gene or its expression product 

[0081.0.0.0] The mutation is introduced in such a way that the production of the fine 
chemical is not adversely affected. 

[0082.0.0.0] Less influence on the regulation of a gene or its gene product is 
35 understood as meaning a reduced regulation of the enzymatic activity leading to an 
increased specific or cellular activity of the gene or its product. An increase of the 
enzymatic activity is understood as meaning an enzymatic activity, which is increased 
by at least 10%, 20%, 30%, 40% or 50%, advantageously by at least 60%, 70%, 80%, 
90% or 100%, especially advantageously by at least 150%, 200%, 300% or more in 
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comparison with the starting organism. In the event the inventive nucleic acid 
sequences were introduced into an organism, which did not have the encoded protein 
activity, said new generated enzymatic activity shall also be embraced by the herein 
described invention. This leads to an increased productivity of the desired fine 
5 chemical. 

[0083.0.0.0] Owing to the introduction of a gene or a plurality of genes conferring the 
expression of the nucleic acid molecule of the invention or the polypeptide of the 
invention as described below, for example the nucleic acid construct mentioned below, 
or e.g. encoding the protein as depicted in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 

10 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40,-42, 44, 46, 56, 58, 60, 62, 64, 66, 68, 70, 72, 
74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 
114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 
148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 
182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 

15 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 
250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 
284, 286, 288, 290, 292, 294, 296, 298, 300, 302, 304, 306, 308, 310, 312, 314, 316, 
318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 
352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 

20 386, 388, 390, 392 or 394 into an organism alone or in combination with other genes, it 
is possible not only to increase the biosynthetic flux towards the end product, but also 
to increase, modify or create de novo an advantageous, preferably novel metabolites 
composition in the organism, e.g. an advantageous amino acid composition comprising 
a higher content of (from a viewpoint of nutrional physiology limited) amino acids, for 

25 example essential amino acids like tryptophane, threonine, methionine or lysine. 

[0084.0.0.0] Preferably the composition comprises further higher amounts of 
metabolites positively affecting or lower amounts of metabolites negatively affecting the 
nutrition or health of animals or humans provided with said compositions or organisms 
of the invention or parts thereof. Likewise, the number or activity of further genes which 

30 are required for the import or export of nutrients or metabolites, including for example 
amino acids or its precursors, required for the cell's biosynthesis of the fine chemical 
may be increased so that the concentration of necessary or relevant precursors, 
cofactors or intermediates within the cell(s) or within the corresponding storage 
compartments is increased. Owing to the increased or novel generated activity of the 

35 polypeptide of the invention or owing to the increased number of nucleic acid 
sequences of the invention and/or to the modulation of further genes which are 
involved in the biosynthesis of the fine chemical, e.g. by increasing the acitivty of 
enzymes synthizing precursors or by destroying the activity of one or more genes 
which are involved in the breakdown of the fine chemical, it is thereby possible to 

40 increase the yield, production and/or production efficiency of the fine chemical in the 
host organism, such as plants or microorganims. 
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[0085.0.0.0] Accordingly, in one embodiment, the process according to the invention 
relates to a process, which comprises: 

a) providing a non-human organism, preferably a microorganism, non-human 
animal, plant or part, cell or tissue thereof; 

5 b) increasing the biological activity represented by a protein as depicted in 3EQ ID 
NO: 2, 4, 6, 8. 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36. 38, 40, 42, 
44, 46, 56, 58, 60, 62, 64, 66, 68, 70, 72. 74. 76, 78, 80. 82, 84, 86, 88, 90, 92, 
94 96, 98, 100, 102. 104. 106. 108. 110. 112, 114. 116, 118. 120, 122. 124. 126. 
128, 130. 132. 134, 136. 138. 140. 142, 144. 146. 148, 150, 152, 154, 156. 158. 
10 160, 162. 164, 166. 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188. 190. 

192 194, 196. 198. 200, 202. 204, 206, 208. 210. 212, 214. 216. 218, 220, 222, 
224 226. 228, 230, 232, 234, 236, 238. 240, 242, 244. 246, 248, 250, 252, 254. 
256, 258, 260. 262, 264. 266. 268. 270. 272. 274. 276. 278. 280. 282. 284. 286, 
288 290, 292. 294, 296, 298, 300. 302, 304. 306, 308. 310. 312. 314, 316. 318, 
15 320*. 322. 324, 326. 328. 330. 332, 334. 336. 338, 340. 342. 344. 346, 348. 350. 

352. 354. 356, 358, 360, 362, 364, 366, 368, 370. 372, 374. 376, 378, 380, 382, 
384*, 386, 388, 390, 392 or 394 or of a polypeptide being encoded by the nucleic 
acid molecule of the invention and described below, i.e. conferring an increase of 
the fine chemical in the organism, preferably in the microorganism, the non- 
20 human animal, the plant or part, cell or tissue thereof, 

c) growing the organism, preferably the microorganism, the non-human animal, the 
plant or part, cell or tissue thereof under conditions which permit the production 
of the fine chemical; and 

d) if desired, revovering. optionally isolating, the free and/or bound fine chemical. 

25 [0086.0.0.0] The organism, in particular the microorganism, non-human animal, the 
plant or animal parts, the plant or animal cell, the plant or animal tissue or the plant is 
advantageously grown in such a way that it is not only possible to recover, if des.red 
isolate the free or bound fine chemical (Galili et al.. Transgenic Res., 2000, 9. 2. 137- 
144). 

30 [0087.0.0.0] After the above-described increasing (which as defined above also 

encompasses the generating of an activity in an organism, i.e. a de novo activity), for 
example after the introduction and the expression of the nucleic acid molecules of the 
invention or described in the methods or processes according to the invention, the 
organism according to the invention, advantageously, a microorganism, a non-human 

35 animal, a plant, plant or animal tissue or plant or animal cell, is grown and 
subsequently harvested. 
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[0088.0.0.0] Suitable organisms or host organisms (transgenic organism) for the 
nucleic acid molecule used according to the invention and for the inventive process, the 
nucleic acid construct or the vector (both as described below) are, in principle, all 
organisms which are capable of synthesizing the fine chemical, and which are suitable 
5 for the activation, introduction or stimulation genes. Examples which may be mentioned 
are plants, microorganisms such as fungi, bacteria, yeasts, alga or diatom, transgenic 
or obtained by site directed mutagenesis or random mutagenesis combined with 
specific selection procedures. Preferred organisms are those which are naturally 
capable of synthesizing the fine chemical in substantial amounts, like fungi, yeasts, 
10 bactria or plants. In principle, transgenic animals, for example Caenorhabditis elegans, 
are alsb suitable as host organisms. 

[0089.0.0.0] In the event that the transgenic organism is a microorganism, such as a 
eukaryotic organism, for example a fungus, an alga, diatom or a yeast in particular a 
fungus, alga, diatom or yeast selected from the families Chaetomiaceae, 

15 Choanephoraceae, Cryptococcaceae, Cunninghamellaceae, Demetiaceae, 
Dipodascaceae, Moniliaceae, Mortierellaceae, Mucoraceae, Pythiaceae, 
Saccharomycetaceae, Saprolegniaceae, Schizosaccharomycetaceae, Sodariaceae, 
Sporobolomycetaceae Tuberculariaceae, Adelotheciaceae, Dinophyceae, Ditrichaceae 
or Prasinophyceae, or a fungus selected from the families Tremellaceae, 

20 Filobasidiaceae, Christianseniaceae, Cystofilobasidiaceae, Sordariaceae, 
Annulatascaceae, Cephalothecaceae, Chaetomiaceae, Coniochaetaceae, 
Lasiosphaeriaceae, Pleurotremataceae, Elaphomycetaceae, Trichocomaceae, 
Mucoraceae,_Schizophyllaceae, Onygenaceae, Suillaceae, Hypocreaceae, 
Ustilaginaceae, Trichocomaceae, Phyllachoraceae, Erysiphaceae or a prokaryotic 

25 organism, for example a bacterium or blue alga, in particular a bacterium from the 
families Actinomycetaceae, Bacillaceae, Brevibacteriaceae, Corynebacteriaceae, 
Cyanophyceae, Enterobacteriacae, Gordoniaceae, Nocardiaceae, Micrococcaceae, 
Mycobacteriaceae, Pseudomonaceae, Rhizobiaceae or Streptomycetaceae, this 
microorganism is grown on a solid or in a liquid medium which is known to the skilled 

30 worker and suits the organism. After the growing phase, the organisms can be 
harvested. 

[0090.0.0.0] The microorganisms or the recovered, and if desired isolated, fine 
chemical can then be processed further directly into foodstuffs or animal feeds or for 
other applications, for example according to the disclosures made in EP-B-0 533 039 

35 or EP-A-0 615 693, which are expressly incorporated herein by reference. The 

fermentation broth or fermentation products can be purified in the customary manner 
by extraction and precipitation or via ion exchangers and other methods known to the 
person skilled in the art and described herein below. Products of these different work- 
up procedures are fine chemical e.g. amino acids or amino acid compositions which 

40 still comprise fermentation broth and cell components in different amounts, 

advantageously in the range of from 0 to 99% by weight, preferably below 80% by 
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weight, especially preferably between below 50%, 40%, 30%, 20%, 10% or 5% by 
weight 

[0091 .0.0.0] Preferred microorganisms are selected from the group consisting of 
Chaetomiaceae such as the genera Chaetomium e.g. the species Chaetomidium 
5 fi ms ti; chcanephoraceae such as the genera Blakeslea, Choanephora e.g. the species 
Blakeslea trispora, Choanephora cucurbitarum or Choanephora infundibulifera var. 
cucurbitarum; Cryptococcaceae such as the genera Candida, Crytococcus, 
Rhodotorula, Torulopsis e.g. the species Candida albicans, Candida albomarginata, 
Candida antarctica, Candida bacarum, Candida bogoriensis, Candida boidinii, Candida 
10 bovina, Candida . brumptii, . Candida cacaoi, Candida cariosilignicola, Candida 

catenulata, Candida chalmersii, Candida ciferrii, Candida cylindracea, Candida edax, 
Candida ernobii, Candida famata, Candida freyschussii, Candida friedrichii, Candida 
glabrata, Candida guilliermondii, Candida haemulonii, Candida humicola, Candida 
inconspicua, Candida ingens, Candida intermedia, Candida kefyr, Candida krusei, 
15 Candida lactiscondensi, Candida lambica, Candida lipolytica, Candida lusitaniae, 
Candida macedoniensis, Candida magnoliae, Candida membranaefaciens, Candida 
mesenterica, Candida multigemmis, Candida mycoderma, Candida nemodendra, 
Candida nitratophila, Candida norvegensis, Candida norvegica, Candida parapsilosis, 
Candida pelliculosa, Candida peltate, Candida pint, Candida pseudotropicalis, Candida 
20 pulcherrima, Candida punicea, Candida pustula, Candida ravautii, Candida reukaufii, 
Candida rugosa, Candida sake, Candida silvicola, Candida solani, Candida sp., 
Candida spandovensis, Candida succiphila, Candida tropicalis, Candida utilis, Candida 
valida, Candida versatilis, Candida vini, Candida zeylanoides, Cryptococcus albidus, 
Cryptococcus curvatus, Cryptococcus flavus, Cryptococcus humicola, Cryptococcus 
25 hungaricus, Cryptococcus kuetzingii, Cryptococcus laurentii, Cryptococcus macerans, 
Cryptococcus neoformans, Cryptococcus terreus, Cryptococcus uniguttulatus, 
Rhodotorula acheniorum, Rhodotorula bacarum, Rhodotorula bogoriensis, Rhodotorula 
flava, Rhodotorula glutinis, Rhodotorula macerans, Rhodotorula minuta, Rhodotorula 
mucilaginosa, Rhodotorula pilimanae, Rhodotorula pustula, Rhodotorula rubra, 
30 Rhodotorula tokyoensis, Torulopsis colliculosa, Torulopsis dattila or Torulopsis 
neoformans; Cunninghamellaceae such as the genera Cunninghamella e.g. the 
species Cunninghamella blakesleeana, Cunninghamella echinulata, Cunninghamella 
echinulata var. elegans, Cunninghamella elegans or Cunninghamella homothallica; 
Demetiaceae such as the genera Alternaria. Bipolaris, Cercospora, Chalara, 
35 Cladosporium, Curvularia, Exophilia, Helicosporium, Helminthosporium, Orbimyces, 
Philalophora, Pithomyces, Spilocaea, Thielaviopsis, Wangiella e.g. the species 
Curvularia affinis, Curvularia clavata, Curvularia fallax, Curvularia inaequalis, 
Curvularia indica, Curvularia lunata, Curvularia pallescens, Curvularia verruculosa or 
Helminothosporium sp.; Moniliaceae such as the genera Arthrobotrys, Aspergillus, 
40 Epidermophyton, Geotrichum, Gliocladium, Histoplasma, Microsporum, Monilia, 
Oedocephalum, Oidium, Penicillium, Trichoderma, Trichophyton, Thrichoteclum, 
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Verticillium e.g. the species Aspergillus aculeatus, Aspergillus albus, Aspergillus 
alliaceus, Aspergillus asperescens , Aspergillus awamori, Aspergillus candidus, 
Aspergillus carbonarius, Aspergillus carneus, Aspergillus chevalieri, Aspergillus 
chevalieri var. intermedius, Aspergillus clavatus, Aspergillus ficuum, Aspergillus 
5 flavipes, Aspergillus flavus, Aspergillus foetidus, Aspergillus fumigatus, Aspergillus 
giganteus, Aspergillus humicola, Aspergillus intermedius, Aspergillus japonicus, 
Aspergillus nidulans, Aspergillus niger, Aspergillus niveus, Aspergillus ochraceus, 
Aspergillus oryzae, Aspergillus ostianus, Aspergillus parasiticus, Aspergillus parasiticus 
var. globosus, Aspergillus penicillioides, Aspergillus phoenicis, Aspergillus rugulosus, 

1 0 Aspergillus sclerotiorum, Aspergillus sojae var. gymnosardae, Aspergillus sydowi, 
Aspergillus tamarii, Aspergillus terreus, Aspergillus terricola, Aspergillus toxicarius, 
Aspergillus unguis, Aspergillus ustus, Aspergillus versicolor, Aspergillus vitricolae, 
Aspergillus wentii,- Penicillium adametzi, -Penicillium albicans, Penicillium arabicum, 
Penicillium arenicola, Penicillium argillaceum, Penicillium arvense, Penicillium 

1 5 asperosporum, Penicillium aurantiogriseum, Penicillium avellaneum, -Penicillium 
baarnense, Penicillium bacillisporum, Penicillium brasilianum, Penicillium 
brevicompactum, Penicillium camemberti, -Penicillium canadense, Penicillium 
canescens, -Penicillium caperatum, -Penicillium capsulatum, -Penicillium 
caseicolum, -Penicillium chrysogenum, -Penicillium citreonigrum, Penicillium 

20 citrinum, -Penicillium clayiforme, -Penicillium commune, -Penicillium 

corylophilum,-Penicillium corymbiferum, Penicillium crustosum,-PeniciHium 
cyclopium, -Penicillium daleae, -Penicillium decumbens, -Penicillium dierckxii, -Penicillium 
digitatum, -Penicillium digitatum var. latum, -Penicillium divaricatum, -Penicillium 
diversum, -Penicillium duclauxii, -Penicillium echinosporum, -Penicillium 

25 expansum, -Penicillium fellutanum, Penicillium frequentans, -Penicillium 
funiculosum, -Penicillium glabrum, -Penicillium gladioli, -Penicillium 
griseofulvum, -Penicillium hirsutum,- Penicillium hispanicum, -Peniclllium 
islandicum, -Penicillium italicum, -Penicillium italicum var. avellaneum, -Penicillium 
janczewskii, -Penicillium janthinellum, -Penicillium japonicum, -Penicillium 

30 lavendulum, Penicillium lilacinum, -Penicillium lividum, -Penicillium martensii, Penicillium 
megasporum, -Penicillium miczynskii, -Penicillium nalgiovense, -Penicillium 
nigricans, -Penicillium notatum, Penicillium ochrochloron, Penicillium 
odoratum, -Penicillium oxalicum,- Penicillium paraherquei, - Penicillium 
patulum, -Penicillium pinophilum, -Penicillium piscarium, -Penicillium 

35 pseudostromaticum, -Penicillium puberulum, -Penicillium purpurogenum, -Penicillium 

raciborskii, -Penicillium roqueforti, -Penicillium rotundum, -Penicillium rubrum,-Penicillium 
sacculum, -Penicillium simplicissimum, Penicillium sp., Penicillium spinulosum, 
Penicillium steckii, Penicillium stoloniferum, Penicillium striatisporum, Penicillium 
striatum, Penicillium tardum, Penicillium thomii, Penicillium turbatum, Penicillium 

40 variabile, Penicillium vermiculatum, Penicillium vermoesenii, Penicillium verrucosum, 
Penicillium verrucosum var. corymbiferum, Penicillium verrucosum var. cyclopium, 
Penicillium verruculosum, Penicillium vinaceum, Penicillium violaceum, Penicillium 
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viridicatum, Penicillium vulpinum, Trichoderma hamatum, Trichoderma harzianum, 
Trichoderma koningii, Trichoderma longibrachiatum, Trichoderma polysporum, 
Trichoderma reesei, Trichoderma virens or Trichoderma viride; Mortierellaceae such as 
the genera Mortierella e.g. the species Mortierelia isabellina, Mortierella polycephala, 
5 Mortierella ramanniana , Mortierella vinacea or Mortierella zonata; Mucoraceae such as 
the genera Actinomucor, Mucor, Phycomyces, Rhlzopus, Zygorhynchus e.g. the 
species Mucor amphibiorum, Mucor circinelloides f. circinelloides, Mucor circinelloides 
var. griseocyanus, Mucor flavus, Mucor fuscus, Mucor griseocyanus, Mucor 
heterosporus, Mucor hiemalis, Mucor hiemalis f. hiemalis, Mucor inaequisporus, Mucor 
1 0 indicus, Mucor javanicus, Mucor mucedo, Mucor mucilaginous, Mucor piriformis, Mucor 
plasmaticus, Mucor plumbeus, Mucor racemosus, Mucor racemosus f. racemosus, 
Mucor racemosusf. sphaerosporus, Mucor rouxianus, Mucor rouxii, Mucor sinensis, 
Mucor sp., Mucor spinosus, Mucor tuberculisporus, Mucor variisporus, Mucor 
variosporus, Mucor wosnessenskii, Phycomyces blakesleeanus, Rhizopus 
1 5 achlamydosporus, Rhizopus arrhizus, Rhizopus chinensis, Rhizopus delemar, 
Rhizopus formosaensis, Rhizopus japonicus, Rhizopus javanicus, Rhizopus 
microsporus, Rhizopus microsporus var. chinensis, Rhizopus microsporus var. 
oligosporus, Rhizopus microsporus var. rhizopodiformis, Rhizopus nigricans, Rhizopus 
niveus, Rhizopus oligosporus, Rhizopus oryzae, Rhizopus pygmaeus, Rhizopus 
20 rhizopodiformis, Rhizopus semarangensis, Rhizopus sontii, Rhizopus stolonifer, 
Rhizopus thermosus, Rhizopus tonkinensis, Rhizopus tritici or Rhizopus usamii; 
Pythiaceae such as the genera Phytium, Phytophthora e.g. the species Pythium 
debaryanum, Pythium intermedium, Pythium irregulare, Pythium megalacanthum, 
Pythium paroecandrum, Pythium sylvaticum, Pythium ultimum, Phytophthora cactorum, 
25 Phytophthora cinnamomi, Phytophthora citricola, Phytophthora citrophthora, 
Phytophthora cryptogea, Phytophthora drechsleri, Phytophthora erythroseptica, 
Phytophthora lateralis, Phytophthora megasperma, Phytophthora nicotianae, 
Phytophthora nicotianae var. parasitica, Phytophthora palmivora, Phytophthora 
parasitica or Phytophthora syringae; Saccharomycetaceae such as the genera 
30 Hansenula, Pichia, Saccharomyces, Saccharomycodes, Yarrowia e.g. the species 
Hansenula anomala, Hansenula californica, Hansenula canadensis, Hansenula 
capsulata, Hansenula ciferrii, Hansenula glucozyma, Hansenula henricii, Hansenula 
hotstii, Hansenula minuta, Hansenula nonfermentans, Hansenula philodendri, 
Hansenula polymorpha, Hansenula saturnus, Hansenula subpelliculosa, Hansenula 
35 wickerhamii, Hansenula wingei, Pichia alcoholophila, Pichia angusta, Pichia anomala, 
Pichia bispora, Pichia burtonii, Pichia canadensis, Pichia capsulata, Pichia carsonii, 
Pichia cellobiosa, Pichia ciferrii, Pichia farinosa, Pichia fermentans, Pichia finlandica, 
Pichia glucozyma, Pichia guilliermondii, Pichia haplophila, Pichia henricii, Pichia holstii, 
Pichia jadinii, Pichia lindnerii, Pichia membranaefaciens, Pichia methanolica, Pichia 
40 minuta var. minuta, Pichia minuta var. nonfermentans, Pichia norvegensis, Pichia 
ohmeri, Pichia pastoris, Pichia philodendri, Pichia pini, Pichia polymorpha, Pichia 
quercuum, Pichia rhodanensis, Pichia sargentensis, Pichia stipitis, Pichia 
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strasburgensis, Pichia subpelliculosa, Pichia toletana, Pichia trehalophila, Pichia vini, 
Pichia xylosa, Saccharomyces aceti, Saccharomyces bailii, Saccharomyces bayanus, 
Saccharomyces bisporus, Saccharomyces capensis, Saccharomyces carlsbergensis, 
Saccharomyces cerevisiae, Saccharomyces cerevisiae var. ellipsoideus, 
5 Saccharomyces chevalieri, Saccharomyces delbrueckii, Saccharomyces diastaticus, 
Saccharomyces drosophilarum, Saccharomyces elegans, Saccharomyces ellipsoideus, 
Saccharomyces fermentati, Saccharomyces florentinus, Saccharomyces fragffls, 
Saccharomyces heterogenous, Saccharomyces hienipiensis, Saccharomyces 
inusitatus, Saccharomyces italicus, Saccharomyces kluyveri, Saccharomyces krusei, 

1 0 Saccharomyces lactis, Saccharomyces marxianus, Saccharomyces microellipsoides, 
Saccharomyces montanus, Saccharomyces norbensis, Saccharomyces oleaceus, 
Saccharomyces paradoxus, Saccharomyces pastorianus, Saccharomyces pretoriensis, 
Saccharomyces rosei, Saccharomyces rouxii, Saccharomyces uvarum, 
Saccharomycodes ludwigii or Yarrowia lipolytica; Saprolegniaceae such as the genera 

15 Saprolegnia e.g. the species Saprolegnia ferax; Schizosaccharomycetaceae such as 
the genera Schizosaccharomyces e.g. the species Schizosaccharomyces japonicus 
van japonicus, Schizosaccharomyces japonicus var. versatilis, Schizosaccharomyces 
malidevorans, Schizosaccharomyces octosporus, Schizosaccharomyces pombe van 
malidevorans or Schizosaccharomyces pombe var. pombe; Sodariaceae such as the 

20 genera Neurospora, Sordaria e.g. the species Neurospora africana, Neurospora 
crassa, Neurospora intermedia, Neurospora sitophila, Neurospora tetrasperma, 
Sordaria fimicoia or Sordaria macrospora; Tuberculariaceae such as the genera 
Epicoccum, Fusarium, Myrothecium, Sphacelia, Starkeyomyces, Tubercularia e.g. the 
species Fusarium acuminatum, Fusarium anthophilum, Fusarium aquaeductuum, 

25 Fusarium aquaeductuum var. medium, Fusarium avenaceum, Fusarium buharicum, 
Fusarium camptoceras, Fusarium cerealis, Fusarium chlamydosporum, Fusarium 
ciliatum, Fusarium coccophilum, Fusarium coeruleum, Fusarium concolor, Fusarium 
crookwellense, Fusarium culmorum, Fusarium dimerum, Fusarium diversisporum, 
Fusarium equiseti, Fusarium equiseti var. bullatum, Fusarium eumartii, Fusarium 

30 flocciferum, Fusarium fujikuroi, Fusarium graminearum, Fusarium graminum, Fusarium 
heterosporum, Fusarium incarnatum, Fusarium inflexum, Fusarium javanicum, 
Fusarium lateritium, Fusarium lateritium var. majus, Fusarium longipes, Fusarium 
melanochlorum, Fusarium merismoides, Fusarium merismoides var. chlamydosporafe, 
Fusarium moniliforme, Fusarium moniliforme var. anthophilum, Fusarium moniliforme 

35 var. subglutinans, Fusarium nivale, Fusarium nivale var. majus, Fusarium oxysporum, 
Fusarium oxysporum f. sp. aechmeae, Fusarium oxysporum f. sp. cepae, Fusarium 
oxysporum f. sp. conglutinans, Fusarium oxysporum f. sp. cucumerinum, Fusarium 
oxysporum f. sp. cyclaminis, Fusarium oxysporum f. sp. dianthi, Fusarium oxysporum f. 
sp. lycopersici, Fusarium oxysporum f. sp. melonis, Fusarium oxysporum f. sp. 

40 passiflorae, Fusarium oxysporum f. sp. pisi, Fusarium oxysporum f. sp. tracheiphilum, 
Fusarium oxysporum f. sp. tuberosi, Fusarium oxysporum f. sp. tulipae, Fusarium 
oxysporum f. sp. vasinfectum, Fusarium pallidoroseum, Fusarium poae, Fusarium 
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proliferatum, Fusarium proliferatum var. minus, Fusarium redolens, Fusarium redolens 
f. sp. dianthi, Fusarium reticulatum, Fusarium roseum, Fusarium saccharivar. 
elongatum, Fusarium sambucinum, Fusarium sambucinum var. coeruleum, Fusarium 
semitectum, Fusarium semitectum var. majus, Fusarium solani, Fusarium solanif. sp. 
5 pisi, Fusarium sporotrichioides, Fusarium sporotrichioides var. minus, Fusarium 

subiunatum, Fusarium succisae, Fusarium sulphureum, Fusarium tabacinum, Fusarium 
tricinctum, Fusarium udum, Fusarium ventricosum, Fusarium verticillioides, Fusarium 
xylarioides or Fusarium zonatum; Sporobolomycetaceae such as the genera Bullera, 
Sporobolomyces, Itersonilia e.g. the species Sporobolomyces holsaticus, 
1 0 Sporobolomyces odorus, Sporobolomyces puniceus, Sporobolomyces salmonicolor, 
Sporobolomyces singularisor-Sporobolomyces tsugae; Adelotheciaceae such as the 
genera e.g. the species Physcomitrella patens; Dinophyceae such as the genera 
Crypthecodinium, Phaeodactylum e.g. the species Crypthecodinium cohniior 
Phaeodactylum tricornutum; Ditrichaceae such as the genera Ceratodon, Pleuridium, 
15 Astomiopsis, Ditrichum, Philibertiella, Ceratodon, Distichium, Skottsbergia e.g. the 
species Ceratodon antarcticus, Ceratodon purpureus, Ceratodon purpureus ssp. 
convolutes or Ceratodon purpureus ssp. stenocarpus; Prasinophyceae such as the 
genera Nephroselmis, Prasinococcus, Scherffelia, Tetraselmis, Mantoniella, 
Ostreococcus e.g. the species Nephroselmis olivacea, Prasinococcus capsulatus, 
20 Scherffelia dubia, Tetraselmis chui, Tetraselmis suecica, Mantoniella squamata or 
Ostreococcus tauri; Actinomycetaceae such as the genera Actinomyces, 
Actinobaculum, Arcanobacterium, Mobiluncus e.g. the species Actinomyces 
bemardiae, Actinomyces bovis, Actinomyces bowdenii, Actinomyces canis, 
Actinomyces cardiffensis, Actinomyces catuli, Actinomyces coleocanis, Actinomyces 
25 denticolens, Actinomyces europaeus, Actinomyces funkei, Actinomyces georgiae, 
Actinomyces gerencseriae, Actinomyces hordeovulneris, Actinomyces howellii, 
Actinomyces humiferus, Actinomyces hyovaginalis, Actinomyces israelii, Actinomyces 
marimammalium, Actinomyces meyeri, Actinomyces naeslundii, Actinomyces nasicola, 
Actinomyces neuii subsp. anitratus, Actinomyces neuii subsp. neuii, Actinomyces 
30 odontolyticus, Actinomyces oricola, Actinomyces pyogenes, Actinomyces radicidentis, 
Actinomyces radingae, Actinomyces slackii, Actinomyces suimastitidis, Actinomyces 
suis, Actinomyces turicensis, Actinomyces urogenitalis, Actinomyces vaccimaxillae, 
Actinomyces viscosus, Actinobaculum schaalii, Actinobaculum suis, Actinobaculum 
urinate, Arcanobacterium bemardiae, Arcanobacterium haemolyticum, 
35 Arcanobacterium hippocoleae, Arcanobacterium phocae, Arcanobacterium 
pluranimalium, Arcanobacterium pyogenes, Mobiluncus curtisii subsp. curtisii, 
Mobiluncus curtisii subsp. holmesii or Mobiluncus mulieris; Bacillaceae such as the 
genera Amphibacillus, Anoxybacillus, Bacillus, Exiguobacterium, Gracilibacillus, 
Holobacillus, Saccharococcus, Salibacillus, Virgibacillus e.g. the species Amphibacillus 
40 fermentum, Amphibacillus tropicus, Amphibacillus xylanus, Anoxybacillus flavithermus, 
Anoxybacillus gonensis, Anoxybacillus pushchinoensis, Bacillus acidocaldarius, 
Bacillus acidoterrestris, Bacillus aeolius, Bacillus agaradhaerens, Bacillus agri, Bacillus 
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alcalophilus, Bacillus alginolyticus, Bacillus alvei, Bacillus amyloliquefaciens, Bacillus 
amylolyticus, Bacillus aneurinilyticus, Bacillus aquimaris, Bacillus arseniciselenatis, 
Bacillus atrophaeus, Bacillus azbtofixans, Bacillus azotoformans, Bacillus badius, 
Bacillus barbaricus, Bacillus benzoevorahs, Bacillus borstelensis, Bacillus brevis, 
5 Bacillus carboniphilus, Bacillus centrosporus, Bacillus cereus, Bacillus chitinolyticus, 
Bacillus chondroitinus, Bacillus choshinensis, Bacillus circulans, Bacillus clarkii, 
Bacillus clausii, Bacillus coagulans, Bacillus cohnii, Bacillus curdlanolyticus, Bacillus 
cycloheptanicus, Bacillus decolorationis, Bacillus dipsosauri, Bacillus edaphicus, 
Bacillus ehimensis, Bacillus endophyticus, Bacillus fastidiosus, Bacillus firmus, Bacillus 

10 flexus, Bacillus formosus, Bacillus fumarioli, Bacillus funiculus, Bacillus fusiformis, 
Bacillus sphaericus subsp. fusiformis, Bacillus galactophilus, Bacillus globisporus, 
Bacillus globisporus subsp. marinus, Bacillus glucanolyticus, Bacillus gordonae, 
Bacillus halmapalus, Bacillus haloalkaliphilus, Bacillus halodenitrificans, Bacillus 
halodurans, Bacillus halophilus, Bacillus horikoshii, Bacillus horti, Bacillus infernos, 

15 Bacillus insolitus, Bacillus jeotgali, Bacillus kaustophilus, Bacillus kobensis, Bacillus 
krulwichiae, Bacillus laevolacticus, Bacillus Jarvae, Bacillus laterosporus, Bacillus 
lautus, Bacillus lentimorbus, Bacillus lentus, Bacillus licheniformis, Bacillus luciferensis, 
Bacillus macerans, Bacillus macquariensis, Bacillus marinus, Bacillus marisflavi, 
Bacillus marismortui, Bacillus megaterium, Bacillus methanolicus, Bacillus migulanus, 

20 Bacillus mojavensis, Bacillus mucilaginosus, Bacillus mycoides, Bacillus naganoensis, 
Bacillus nealsonii, Bacillus neidei, Bacillus niacini, Bacillus okuhidensis, Bacillus 
oleronius, Bacillus pabuli, Bacillus pallidus, Bacillus pantothenticus, Bacillus 
parabrevis, Bacillus pasteurii, Bacillus peoriae, Bacillus polymyxa, Bacillus popilliae, 
Bacillus pseudalcaliphilus, Bacillus pseudofirmus, Bacillus pseudomycoides, Bacillus 

25 psychrodurans, Bacillus psychrophilus, Bacillus psychrosaccharolyticus, Bacillus 
psychrotolerans, Bacillus pulvifaciens, Bacillus pumilus, Bacillus pycnus, Bacillus 
reuszeri, Bacillus salexigens, Bacillus schlegelii, Bacillus selenitireducens, Bacillus 
silvestris, Bacillus simplex, Bacillus siralis, Bacillus sm 'rthii, Bacillus sonorensis, Bacillus 
sphaericus, Bacillus sporothermodurans, Bacillus stearothermophilus , Bacillus 

30 subterraneus, Bacillus subtilis subsp. spizizenii, Bacillus subtilis subsp. subtilis, Bacillus 
thermantarcticus, Bacillus thermoaerophilus, Bacillus thermoamylovorans, Bacillus 
thermoantarcticus, Bacillus thermocatenulatus, Bacillus thermocloacae, Bacillus 
thermodenitrificans, Bacillus thermoglucosidasius, Bacillus thermoleovorans, Bacillus 
thermoruber, Bacillus thermosphaericus, Bacillus thiaminolyticus, Bacillus 

35 thuringiensis, Bacillus tusciae, Bacillus validus, Bacillus vallismortis, Bacillus vedderi, 
Bacillus vulcani, Bacillus weihenstephanensis, Exiguobacterium acetylicum, 
Exiguobacterium antarcticum, Exiguobacterium aurantiacum, Exiguobacterium undae, 
Gracilibacillus dipsosauri, Gracilibacillus halotolerans, Halobacillus halophilus, 
Halobacillus karajensis, Halobacillus litoralis, Halobacillus salinus, Halobacillus 

40 trueperi, Saccharococcus caldoxylosilyticus, Saccharococcus thermophilus, 
Salibacillus marismortui, Salibacillus salexigens, Virgibacillus carmonensis, 
Virgibacillus marismortui, Virgibacillus necropolis, Virgibacillus pantothenticus, 
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Virgibacillus picturae, Virgibacillus proomii or Virgibacillus salexigens, 
Brevibacteriaceae such as the genera Brevibacterium e.g. the species Brevibacterium 
acetylicum, Brevibacterium albidum, Brevibacterium ammoniagenes, Brevibacterium 
avium, Brevibacterium casei, Brevibacterium citreum, Brevibacterium divaricatum, 
5 Brevibacterium epidermidis, Brevibacterium fermentans, Brevibacterium frigoritolerans, 
Brevibacterium haiotolerans, Brevibacterium imperiale, Brevibacterium incertum, 
Brevibacterium iodinum, Brevibacterium linens, Brevibacterium liquefaciens, 
Brevibacterium lutescens, Brevibacterium luteum, Brevibacterium lyticum, 
Brevibacterium mcbrellneri, Brevibacterium otitidis, Brevibacterium oxydans, 
1 0 Brevibacterium paucivorans, Brevibacterium protophormiae, Brevibacterium pusillum, 
- Brevibacterium saperdae, Brevibacterium stationis, Brevibacterium testaceum or 
Brevibacterium vitaeruminis; Corynebacteriaceae such as the genera Corynebacterium 
e.g. the species Corynebacterium accolens, Corynebacterium afermentans subsp. 
afermentans, Corynebacterium afermentans subsp. lipophilum, Corynebacterium 
15 ammoniagenes, Corynebacterium amycolatum, Corynebacterium appendicis, 
Corynebacterium aquilae, Corynebacterium argentoratense, Corynebacterium 
atypicum, Corynebacterium aurimucosum, Corynebacterium auris, Corynebacterium 
auriscanis, Corynebacterium betae, Corynebacterium beticola, Corynebacterium bovis, 
Corynebacterium callunae, Corynebacterium camporealensis, Corynebacterium 
20 capitovis, Corynebacterium casei, Corynebacterium confusum, Corynebacterium 
coyleae, Corynebacterium cystitidis, Corynebacterium durum, Corynebacterium 
efficiens, Corynebacterium equi, Corynebacterium falsenii, Corynebacterium fascians, 
Corynebacterium felinum, Corynebacterium flaccumfaciens, Corynebacterium 
flavescens, Corynebacterium freneyi, Corynebacterium glaucum, Corynebacterium 
25 giucuronolyticum, Corynebacterium glutamicum, Corynebacterium hoagii, 

Corynebacterium ilia's, Corynebacterium imitans, Corynebacterium insidiosum, 
Corynebacterium iranicum, Corynebacterium jeikeium, Corynebacterium 
kroppenstedtii, Corynebacterium kutscheri, Corynebacterium lilium, Corynebacterium 
lipophiloflavum, Corynebacterium macginleyi, Corynebacterium mastitidis, 
30 Corynebacterium matruchotii, Corynebacterium michiganense, Corynebacterium 

michiganense subsp. tessellarius, Corynebacterium minutissimum, Corynebacterium 
mooreparkense, Corynebacterium mucifaciens, Corynebacterium mycetoides, 
Corynebacterium nebraskense, Corynebacterium oortii, Corynebacterium 
paurometabolum, Corynebacterium phocae, Corynebacterium pilosum, 
35 Corynebacterium poinsettiae, Corynebacterium propinquum, Corynebacterium 
pseudodiphtheriticum, Corynebacterium pseudotuberculosis, Corynebacterium 
pyogenes, Corynebacterium rathayi, Corynebacterium renale, Corynebacterium riegelii, 
Corynebacterium seminale, Corynebacterium sepedonicum, Corynebacterium 
simulans, Corynebacterium singulare, Corynebacterium sphenisci, Corynebacterium 
40 spheniscorum, Corynebacterium striatum, Corynebacterium suicordis, 

Corynebacterium sundsvallense, Corynebacterium terpenotabidum, Corynebacterium 
testudinoris, Corynebacterium thomssenii, Corynebacterium tritici, Corynebacterium 
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ulcerans, Corynebacterium urealyticum, Corynebacterium variabile, Corynebacterium 
vitaeruminis or Corynebacterium xerosis; Enterobacteriacae such as the genera 
Alterococcus, Arsenophonus, Brenneria, Buchnera, Budvicia, Buttiauxella, 
Calymmatobacterium, Cedecea, Citrobacter, Edwardsiella, Enterobacter, Erwinia, 
5 Escherichia, Ewingella, Hafnia, Klebsiella, Kluyvera, Leclercia, Leminorella, 

Moellerella, Morganella, Obesumbacterium, Pantoea, Pectobacterium, Photorhabdus, 
Plesiomonas, Pragia, Proteus, Providencia, Rahnella, Saccharobacter, Salmonella, 
Shigella, Serratia, Sodalis, Tatumella, Trabulsiella, Wigglesworthia, 
Xenorhabd us, Yersinia and Yokenella e.g. the species Arsenophonus nasoniae, 

10 Brenneria alni, Brenneria nigrifluens, Brenneria quercina, Brenneria rubrifaciens, 
Brenneria salicis, Budvicia aquatica, Buttiauxella agrestis, Buttiauxella brennerae, 
Buttiauxella ferragutiae, Buttiauxella gaviniae, Buttiauxella izardii, Buttiauxella 
noackiae, Buttiauxella warmboldiae, Cedecea davisae, Cedecea lapagei, Cedecea 
neteri, Citrobacter amalonaticus, Citrobacter diversus, Citrobacter freundii, Citrobacter 

15 genomospecies, Citrobacter gillenii, Citrobacter intermedium, Citrobacter koseri, 
Citrobacter muriiniae, Citrobacter sp., Edwardsiella hoshinae, Edwardsiella ictaluri, 
Edwardsiella tarda, Erwinia alni, Erwinia amylovora, Erwinia ananatis, Erwinia 
aphidicola, Erwinia billingiae, Erwinia cacticida, Erwinia cancerogena, Erwinia 
carnegieana, Erwinia carotovora subsp. atroseptica, Erwinia carotovora subsp. 

20 betavasculorum, EnA/inia carotovora subsp. odorifera, Erwinia carotovora subsp. 
wasabiae, Erwinia chrysanthemi, Erwinia cypripedii, Erwinia dissolvens, Erwinia 
herbicola, Erwinia mallotivora, Erwinia milletiae, Erwinia nigrifluens, Env/n/a 
nimipressuralis, Erwinia persicina, Erwinia psidii, Erwinia pyrifoliae, Erwinia quercina, 
Erwinia rhapontici, EnA/inia rubrifaciens, Erwinia salicis, Erwinia stewartii, Erwinia 

25 tracheiphila, Erwinia uredovora, Escherichia adecarboxylata, Escherichia anindolica, 
Escherichia aurescens, Escherichia blattae, Escherichia coli, Escherichia coli van 
communion Escherichia colhmutabite, Escherichia fergusonii, Escherichia hermannii, 
Escherichia sp., Escherichia vulneris, Ewingella americana, Hafnia alvei, Klebsiella 
aerogenes, Klebsiella edwardsii subsp. atlantae, Klebsiella omithinolytica, Klebsiella 

30 oxytoca, Klebsiella planticola, Klebsiella pneumoniae, Klebsiella pneumoniae subsp. 
pneumoniae, Klebsiella sp., Klebsiella terrigena, Klebsiella trevisanii, Kluyvera 
ascorbata, Kluyvera citrophila, Kluyvera cochleae, Kluyvera cryocrescens, Kluyvera 
georgiana, Kluyvera noncitrophila, Kluyvera sp., Leclercia adecarboxylata, Leminorella 
grimontii, Leminorella richardii, Moellerella wisconsensis, Morganella morganii, 

35 Morganella morganii subsp. morganii, Morganella morganii subsp. sibonii, 

Obesumbaterium proteus, Pantoea agglomerans, Pantoea ananatis, Pantoea citrea, 
Pantoea dispersa, Pantoea punctata, Pantoea stewartii subsp. stewartii, Pantoea 
terrea, Pectobacterium atrosepticum, Pectobacterium carotovorum subsp. 
atrosepticum, Pectobacterium carotovorum subsp. carotovorum, Pectobacterium 

40 chrysanthemi, Pectobacterium cypripedii, Photorhabdus asymbiotica, Photorhabdus 
luminescens, Photorhabdus luminescens subsp. akhurstii, Photorhabdus luminescens 
subsp. laumondii, Photorhabdus luminescens subsp. luminescens, Photorhabdus sp., 
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Photorhabdus temperata, Plesiomonas shigelloides, Pragia fontium, Proteus hauseri, 
Proteus ichthyosmius, Proteus inconstans, Proteus mirabilis, Proteus morganii, Proteus 
myxofaciens, Proteus penneri, Proteus rettgeri, Proteus shigelloides, Proteus vulgaris, 
Providencia alcalifaciens, Providencia friedericiana, Providencia heimbachae, 
5 Providencia rettgeri, Providencia rustigianii, Providencia stuartii, Rahnella aquatilis, 
Salmonella abony, Salmonella arizonae, Salmonella bongori, Salmonella choleraesuis 
subsp. arizonae, Salmonella choleraesuis subsp. bongori, Salmonella choleraesuis 
subsp. cholereasuis, Salmonella choleraesuis subsp. diarizonae, Salmonella 
choleraesuis subsp. houtenae, Salmonella choleraesuis subsp. indica, Salmonella 
10 choleraesuis subsp. salamae, Salmonella daressalaam, Salmonella enterica subsp. 
- houtenae, Salmonella enterica subsp. salamae, Salmonella enteritidis, Salmonella 
gallinarum, Salmonella heidelberg, Salmonella panama, Salmonella senftenberg, 
Salmonella typhimurium, Serratia entomophila, Serratia ficaria, Serratia fonticola, 
Serratia grimesii, Serratia liquefaciens, Serratia marcescens, Serratia marcescens 
15 subsp. marcescens, Serratia marinorubra, Serratia odorifera, Serratia plymouthensis, 
Serratia plymuthica, Serratia proteamaculans, Serratia proteamaculans subsp. 
quinovora, Serratia quinivorans, Serratia rubidaea, Shigella boydii, Shigella flexneri, 
Shigella paradysenteriae, Shigella sonnei, Tatumella ptyseos, Xenorhabdus beddingii, 
Xenorhabdus bovienii, Xenorhabdus luminescens, Xenorhabdus nematophila, 
20 Xenorhabdus nematophila subsp. beddingii, Xenorhabdus nematophila subsp. bovienii, 
Xenorhabdus nematophila subsp. poinarii or Xenorhabdus poinarii; Gordoniaceae such 
as the genera Gordonia, Skermania e.g. the species Gordonia aichiensis, Gordonia 
alkanivorans, Gordonia amarae, Gordonia amicalis, Gordonia bronchialis, Gordonia 
desulfuricans, Gordonia hirsuta, Gordonia hydrophobica, Gordonia namibiensis, 
25 Gordonia nitida, Gordonia paraffinivorans, Gordonia polyisoprenivorans, Gordonia 
rhizosphera, Gordonia rubripertincta, Gordonia sihwensis, Gordonia sinesedis, 
Gordonia sputi, Gordonia terrae or Gordonia westfalica; Micrococcaceae such as the 
genera Micrococcus, Arthrobacter, Kocuria, Nesterenkonia, Renibacterium, Rothia, 
Stomatococcus e.g. the species Micrococcus agilis, Micrococcus antarcticus, 
30 Micrococcus halobius, Micrococcus kristinae, Micrococcus luteus, Micrococcus lylae, 
Micrococcus nishinomiyaensis, Micrococcus roseus, Micrococcus sedentarius, 
Micrococcus varians, Arthrobacter agilis, Arthrobacter albus, Arthrobacter atrocyaneus, 
Arthrobacter aurescens, Arthrobacter chlorophenolicus, Arthrobacter citreus, 
Arthrobacter creatinolyticus, Arthrobacter crystallopoietes, Arthrobacter cumminsii, 
35 Arthrobacter duodecadis, Arthrobacter flavescens, Arthrobacter flavus, Arthrobacter 
gandavensis, Arthrobacter globiformis, Arthrobacter histidinolovorans, Arthrobacter 
ilicis, Arthrobacter koreensis, Arthrobacter luteolus, Arthrobacter methylotrophus, 
Arthrobacter mysorens, Arthrobacter nasiphocae, Arthrobacter nicotianae, Arthrobacter 
nicotinovorans, Arthrobacter oxydans, Arthrobacter pascens, Arthrobacter 
40 picolinophilus, Arthrobacter polychromogenes, Arthrobacter protophormiae, 

Arthrobacter psychrolactophilus, Arthrobacter radiotolerans, Arthrobacter ramosus, 
Arthrobacter rhombi, Arthrobacter roseus, Arthrobacter siderocapsulatus, Arthrobacter 
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simplex, Arthrobacter suffonivorans, Arthrobacter sulfureus, Arthrobacter terregens, 
Arthrobacter tumescens, Arthrobacter uratoxydans, Arthrobacter ureafaciens, 
Arthrobacter variabilis, Arthrobacter viscosus, Arthrobacter woluwensis, Kocuria 
erythromyxa, Kocuria kristinae, Kocuria palustris, Kocuria polaris, Kocuria rhizophila, 
5 Kocuria rosea, Kocuria varians, Nesterenkonia halobia, Nesterenkonia lacusekhoensis, 
Renibacterium salmoninarum, Rothia amarae, Rothia dentocariosa, Rothia 
mucilaginosa, Rothia nasimurium or Stomatococcus mucilaginosus; Mycobacteriaceae 
such as the genera Mycobacterium e.g. the species Mycobacterium africanum, 
Mycobacterium agri, Mycobacterium aichiense, Mycobacterium alvei, Mycobacterium 

10 asiaticum, Mycobacterium aurum, Mycobacterium austroafricanum, Mycobacterium 
bohemicum, Mycobacterium botniense, Mycobacterium brumae, Mycobacterium 
chelonae subsp. abscessus, Mycobacterium ch'rtae, Mycobacterium chlorophenolicum, 
Mycobacterium chubuense, Mycobacterium confluentis, Mycobacterium cookii, 
Mycobacterium diernhoferi, Mycobacterium doricum, Mycobacterium duvalii, 

15 Mycobacterium fallax, Mycobacterium farcinogenes, Mycobacterium flavescens, 
Mycobacterium frederiksbergense, Mycobacterium gadium, Mycobacterium gilvum, 
Mycobacterium gordonae, Mycobacterium hassiacum, Mycobacterium hibemiae, 
Mycobacterium hodleri, Mycobacterium holsaticum, Mycobacterium komossense, 
Mycobacterium lacus, Mycobacterium madagascariense, Mycobacterium mageritense, 

20 Mycobacterium montefiorense, Mycobacterium moriokaense, Mycobacterium murale, 
Mycobacterium neoaurum, Mycobacterium nonchromogenicum, Mycobacterium 
obuense, Mycobacterium palustre, Mycobacterium parafortuitum, Mycobacterium 
peregrinum, Mycobacterium phlei, Mycobacterium pinnipedii, Mycobacterium poriferae, 
Mycobacterium pulveris, Mycobacterium rhodesiae, Mycobacterium shottsii, 

25 Mycobacterium sphagni, Mycobacterium terrae, Mycobacterium thermoresistibile, 
Mycobacterium tokaiense, Mycobacterium triviale, Mycobacterium tusciae or 
Mycobacterium vanbaalenii; Nocardiaceae such as the genera Nocardia, Rhodococcus 
e.g. the species Nocardia abscessus, Nocardia africana, Nocardia amarae, Nocardia 
asteroides, Nocardia autotrophica, Nocardia beijingensis, Nocardia brasiliensis, 

30 Nocardia brevicatena, Nocardia caishijiensis, Nocardia calcarea, Nocardia camea, 
Nocardia cellulans, Nocardia cerradoensis, Nocardia coeliaca, Nocardia 
corynebacterioides, Nocardia crassostreae, Nocardia cummidelens, Nocardia 
cyriacigeorgica, Nocardia farcinica, Nocardia flavorosea, Nocardia fluminea, Nocardia 
globerula, Nocardia hydrocarbonoxydans, Nocardia ignorata, Nocardia mediterranei, 

35 Nocardia nova, Nocardia orientalis, Nocardia otitidis-caviarum, Nocardia 

otitidiscaviarum, Nocardia paucivorans, Nocardia petroleophila, Nocardia pinensis, 
Nocardia pseudobrasiliensis, Nocardia pseudovaccinii, Nocardia puris, Nocardia 
restricta, Nocardia rugosa, Nocardia salmonicida, Nocardia saturnea, Nocardia 
seriolae, Nocardia soli, Nocardia sulphurea, Nocardia transvalensis, Nocardia 

40 uniformis, Nocardia vaccinii, Nocardia veterana or Nocardia vinacea; 

Pseudomonaceae such as the genera Azomonas, Azotobacter, Cellvibrio, 
Chryseomonas, Flaviomonas, Lampropedia, Mesophilobacter, Morococcus, Oligella, 
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Pseudomonas, Rhizobacter, Rugamonas, Serpens, Thermoleophilum, Xylophilus e.g. 
the species Azomonas agilis, Azomonas insignis, Azomonas macrocytogenes, 
Azotobacter agilis, Azotobacter agilis subsp. armeniae, Azotobacter armeniacus, 
Azotobacter beijerinckii, Azotobacter chroococcum, Azotobacter indicum, Azotobacter 
5 macrocytogenes, Azotobacter miscellum, Azotobacter nigricans subsp. nigricans, 
/iwnM a rn« n9 i< Avntnhartar stftlinefstris. Azotobacter so.. Azotobacter vinelandii, 

/^i.WlV/*/UV/l^i jLruw*jMr«*«*« f m *— * • • 

Flavimonas oryzihabitans, Mesophilobacter marinus, Oligella urethralis, Pseudomonas 
acidovorans, Pseudomonas aeruginosa, Pseudomonas agarici, Pseudomonas 
alcaligenes, Pseudomonas aminovorans, Pseudomonas amygdali, Pseudomonas 
10 andropogonis, Pseudomonas anguilliseptica, Pseudomonas antarctica, Pseudomonas 
■ antimicrobica, Pseudomonas antimycetica, Pseudomonas aptata, Pseudomonas 
arvilla, Pseudomonas asplenii, Pseudomonas atlantica, Pseudomonas atrofaciens, 
Pseudomonas aureofaciens, Pseudomonas avellanae, Pseudomonas azelaica, 
Pseudomonas azotocolligans, Pseudomonas balearica, Pseudomonas barkeri, 
15 Pseudomonas bathycetes, Pseudomonas beijerinckii, Pseudomonas brassicacearum, 
Pseudomonas brenneri, Pseudomonas butanovora, Pseudomonas carboxydoflava, 
Pseudomonas carboxydohydrogena, Pseudomonas carboxydovorans, Pseudomonas 
carrageenovora, Pseudomonas caryophylli, Pseudomonas cepacia, Pseudomonas 
chloritidismutans, Pseudomonas chlororaphis, Pseudomonas cichorii, Pseudomonas 
20 citronellolis, Pseudomonas cocovenenans, Pseudomonas compransoris, 

Pseudomonas congelans, Pseudomonas coronafaciens, Pseudomonas corrugata, 
Pseudomonas dacunhae, Pseudomonas delafieldii, Pseudomonas delphinii, 
Pseudomonas denitrificans, Pseudomonas desmolytica, Pseudomonas diminuta, 
Pseudomonas doudoroffii, Pseudomonas echinoides, Pseudomonas elongata, 
25 Pseudomonas extorquens, Pseudomonas extremorientalis, Pseudomonas facilis, 
Pseudomonas ficuserectae, Pseudomonas flava, Pseudomonas flavescens, 
Pseudomonas fluorescens, Pseudomonas fragi, Pseudomonas frederiksbergensis, 
Pseudomonas fulgida, Pseudomonas fuscovaginae, Pseudomonas gazotropha, 
Pseudomonas gladioli, Pseudomonas glathei, Pseudomonas glumae, Pseudomonas 
30 graminis, Pseudomonas halophila, Pseudomonas helianthi, Pseudomonas huttiensis, 
Pseudomonas hydrogenothermophila, Pseudomonas hydrogenovora, Pseudomonas 
indica, Pseudomonas indigofera, Pseudomonas iodinum, Pseudomonas kilonensis, 
Pseudomonas lachrymans, Pseudomonas lapsa, Pseudomonas lemoignei, 
Pseudomonas lemonnieri, Pseudomonas lundensis, Pseudomonas luteola, 
35 Pseudomonas maltophilia, Pseudomonas marginalis, Pseudomonas marginata, 
Pseudomonas marina, Pseudomonas meliae, Pseudomonas mendocina, 
Pseudomonas mesophilica, Pseudomonas mixta, Pseudomonas monteilii, 
Pseudomonas morsprunorum, Pseudomonas multivorans, Pseudomonas natriegens, 
Pseudomonas nautica, Pseudomonas nitroreducens, Pseudomonas oleovorans, 
40 Pseudomonas oryzihabitans, Pseudomonas ovalis, Pseudomonas oxalaticus, 

Pseudomonas palleronii, Pseudomonas paucimobilis, Pseudomonas phaseolicola, 
Pseudomonas phenazinium, Pseudomonas pickettii, Pseudomonas pisi, Pseudomonas 
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plantarii, Pseudomonas plecoglossicida, Pseudomonas poae, Pseudomonas primulae, 
Pseudomonas proteolytica, Pseudomonas pseudoalcaligenes, Pseudomonas 
pseudoalcaligenes subsp. konjaci, Pseudomonas pseudoalcaligenes subsp. 
pseudoalcaligenes, Pseudomonas pseudoflava, Pseudomonas putida, Pseudomonas 
5 putida var. naraensis, Pseudomonas putrefaciens, Pseudomonas pyrrocinia, 
Pseudomonas radiora, Pseudomonas reptilivora, Pseudomonas rhodesiae, 
Pseudomonas rhodos, Pseudomonas riboflavina, Pseudomonas rubescens, 
Pseudomonas rubrisubalbicans, Pseudomonas njhlandii, Pseudomonas saccharophila, 
Pseudomonas savastanoi, Pseudomonas savastanoi pvar. glycinea, Pseudomonas 

10 savastanoi pvar. phaseolicola, Pseudomonas solanacearum, Pseudomonas sp. f 

- Pseudomonas spinosa, Pseudomonas stanieri, Pseudomonas stutzeri, Pseudomonas 
syringae, Pseudomonas syringae pvar. aptata, Pseudomonas syringae pvar. 
atrofaciens, Pseudomonas syringae pvar. coronafaciens, Pseudomonas syringae pvar. 
delphinii, Pseudomonas syringae pvar. glycinea, Pseudomonas syringae pvar. 

15 helianthi, Pseudomonas syringae pvar. lachrymans, Pseudomonas syringae pvar. 
lapsa, Pseudomonas syringae pvar. morsprunorum, Pseudomonas syringae pvar. 
phaseolicola, Pseudomonas syringae pvar. primulae, Pseudomonas syringae pvar. 
syringae, Pseudomonas syringae pvar. tabaci, Pseudomonas syringae pvar. tomato, 
Pseudomonas syringae subsp. glycinea, Pseudomonas syringae subsp. savastanoi, 

20 Pseudomonas syringae subsp. syringae, Pseudomonas syzygii, Pseudomonas tabaci, 
Pseudomonas taeniospiralis, Pseudomonas testosteroni, Pseudomonas 
thermocarboxydovorans, Pseudomonas thermotolerans, Pseudomonas thivervalensis, 
Pseudomonas tomato, Pseudomonas trivialis, Pseudomonas veronii, Pseudomonas 
vesicularis, Pseudomonas viridiflava, Pseudomonas viscogena, Pseudomonas woodsil, 

25 Rhizobacter dauci, Rhizobacter daucus orXylophilus ampelinus; Rhizobiaceae such as 
the genera Agrobacterium, Carbophilus, Chelatobacter, Ensifer, Rhizobium, 
Sinorhizobium e.g. the species Agrobacterium atlanticum, Agrobacterium ferrugineum, 
Agrobacterium gelatinovorum, Agrobacterium larrymoorei, Agrobacterium meteori, 
Agrobacterium radiobacter, Agrobacterium rhizogenes, Agrobacterium rubi, 

30 Agrobacterium stellulatum, Agrobacterium tumefaciens, Agrobacterium vitis, 

Carbophilus carboxidus, Chelatobacter heintzii, Ensifer adhaerens, Ensifer arboris, 
Ensifer fredii, Ensifer kostiensis, Ensifer kummerowiae, Ensifer medicae, Ensifer 
meliloti, Ensifer saheli, Ensifer terangae, Ensifer xinjiangensis, Rhizobium ciceri 
Rhizobium etli, Rhizobium fredii, Rhizobium galegae, Rhizobium gallicum, Rhizobium 

35 giardinii, Rhizobium hainanense, Rhizobium huakuii, Rhizobium huautlense, Rhizobium 
indigoferae, Rhizobium japonicum, Rhizobium leguminosarum, Rhizobium loessense, 
Rhizobium loti, Rhizobium lupini, Rhizobium mediterraneum, Rhizobium meliloti, 
Rhizobium mongolense, Rhizobium phaseoli, Rhizobium radiobacter, Rhizobium 
rhizogenes, Rhizobium rubi, Rhizobium sullae, Rhizobium tianshanense, Rhizobium 

40 trifolii, Rhizobium tropici, Rhizobium undicola, Rhizobium vitis, Sinorhizobium 

adhaerens, Sinorhizobium arboris, Sinorhizobium fredii, Sinorhizobium kostiense, 
Sinorhizobium kummerowiae, Sinorhizobium medicae, Sinorhizobium meliloti, 
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Sinorhizobium morelense, Sinorhizobium saheli or Sinorhizobium xinjiangense; 
Streptomycetaceae such as the genera Kitasatosprora, Streptomyces, 
Streptoverticillium e.g. the species Streptomyces abikoensis, Streptomyces 
aburaviensis, Streptomyces achromogenes subsp. achromogenes, Streptomyces 
5 achromogenes subsp. rubradiris, Streptomyces acidiscabies, Streptomyces acrimycint, 
Streptomyces aculeolatus, Streptomyces afghaniensis, Sirepiomyces aianosinicus, 
Streptomyces albaduncus, Streptomyces albiaxialis, Streptomyces 
albidochromogenes, Streptomyces albidoflavus, Streptomyces albireticuli, 
Streptomyces albofaciens, Streptomyces alboflavus, Streptomyces albogriseolus, 
10 Streptomyces albolongus, Streptomyces alboniger, Streptomyces albospinus, 

Streptomyces albosporeus subsp. albosporeus, Streptomyces albosporeus subsp. 
labilomyceticus, Streptomyces alboverticillatus, Streptomyces albovinaceus, 
Streptomyces alboviridis, Streptomyces albulus, Streptomyces albus subsp. albus, 
Streptomyces albus subsp. pathocidicus, Streptomyces almquistii, Streptomyces 
15 althioticus, Streptomyces amakusaensis, Streptomyces ambofaciens, Streptomyces 
aminophilus, Streptomyces anandii, Streptomyces anthocyanicus, Streptomyces 
antibioticus, Streptomyces antimycoticus, Streptomyces anulatus, Streptomyces 
arabicus, Streptomyces ardus, Streptomyces arenae, Streptomyces argenteolus, 
Streptomyces armeniacus, Streptomyces asiaticus, Streptomyces asterosporus, 
20 Streptomyces atratus, Streptomyces atroaurantiacus, Streptomyces atroolivaceus, 
Streptomyces atrovirens, Streptomyces aurantiacus, Streptomyces aurantlogriseus, 
Streptomyces aureocirculatus, Streptomyces aureofaciens, Streptomyces aureorectus, 
Streptomyces aureoversilis, Streptomyces aureoverticillatus, Streptomyces aureus, 
Streptomyces avellaneus, Streptomyces avermectinius, Streptomyces avermitills, 
25 Streptomyces avidinii, Streptomyces azaticus, Streptomyces azureus, Streptomyces 
baarnensis, Streptomyces bacillaris, Streptomyces badius, Streptomyces baldaccii, 
Streptomyces bambergiensis, Streptomyces beijiangensis, Streptomyces bellus, 
Streptomyces bikiniensis, Streptomyces biverticillatus, Streptomyces blastmyceticus, 
Streptomyces bluensis, Streptomyces bobili, Streptomyces bottropensis, Streptomyces 
30 brasiliensis, Streptomyces bungoensis, Streptomyces cacaoi subsp. asoensis, 

Streptomyces cacaoi subsp. cacaoi, Streptomyces caelestis, Streptomyces caeruleus, 
Streptomyces californicus, Streptomyces calvus, Streptomyces canaries, Streptomyces 
candidus, Streptomyces canescens, Streptomyces cangkringensis, Streptomyces 
caniferus, Streptomyces canus, Streptomyces capillispiralis, Streptomyces capoamus, 
35 Streptomyces carpaticus, Streptomyces carpinensis, Streptomyces catenulae, 
Streptomyces caviscabies, Streptomyces cavburensis subsp. cavourensis, 
Streptomyces cavourensis subsp. washingtonensis, Streptomyces cellostaticus, 
Streptomyces celluloflavus, Streptomyces celluiolyticus, Streptomyces 
cellulosae.Streptomyceschampavatii, Streptomyces chartreuses, Streptomyces 
40 chattanoogensis, Streptomyces chibaensis, Streptomyces chrestomyceticus, 

Streptomyces chromofuscus, Streptomyces chryseus, Streptomyces chrysomallus 
subsp. chrysomallus, Streptomyces chrysomallus subsp. fumigatus, Streptomyces 
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cinereorectus, Streptomyces cinereoruber subsp. cinereoruber, Streptomyces 
cinereoruber subsp. fructofermentans, Streptomyces cinereospinus, Streptomyces 
cinereus, Streptomyces cinerochromogenes, Streptomyces cinnabarinus, 
Streptomyces cinnamonensis, Streptomyces cinnamoneus, Streptomyces 
5 cinnamoneus subsp. albosporus, Streptomyces cinnamoneus subsp. cinnamoneus, 
Streptomyces cinnamoneus subsp. lanosus, Streptomyces cinnamoneus subsp. 
sparsus, Streptomyces cirratus, Streptomyces ciscaucasicus, Streptomyces 
citreofluorescens, Streptomyces clavifer, Streptomyces clavuligerus, Streptomyces 
cochleatus, Streptomyces coelescens, Streptomyces coelicoflavus, Streptomyces 

10 coelicolor, Streptomyces coeruleoflavus, Streptomyces coeruleofuscus, Streptomyces 
coeruleoprunus, Streptomyces coeruleorubidus, Streptomyces coerulescens, 
Streptomyces collinus, Streptomyces colombiensis, Streptomyces corchorusii, 
Streptomyces costaricanus, Streptomyces cremeus, Streptomyces crystaliinus, 
Streptomyces curacoi, Streptomyces cuspidosporus, Streptomyces cyaneofuscatus, 

15 Streptomyces cyaneus, Streptomyces cyanoalbus, Streptomyces cystargineus, 
Streptomyces daghestanicus, Streptomyces diastaticus subsp. ardesiacus, 
Streptomyces diastaticus subsp. diastaticus, Streptomyces diastatochromogenes, 
Streptomyces distallicus, Streptomyces djakartensis, Streptomyces durhamensis, 
Streptomyces echinatus, Streptomyces echinoruber, Streptomyces ederensis, 

20 Streptomyces ehimensis, Streptomyces endus, Streptomyces enissocaesilis, 

Streptomyces erumpens, Streptomyces erythraeus, Streptomyces erythrogriseus, 
Streptomyces eurocidicus, Streptomyces europaeiscabiei, Streptomyces eurythermus, 
Streptomyces exfoliates, Streptomyces felleus, Streptomyces fervens, Streptomyces 
fervens subsp. fervens, Streptomyces fervens subsp. melrosporus, Streptomyces 

25 filamentosus, Streptomyces filipinensis, Streptomyces fimbriatus, Streptomyces 
fimicarius, Streptomyces finlayi, Streptomyces flaveolus, Streptomyces flaveus, 
Streptomyces fiavidofuscus, Streptomyces flavidovirens, Streptomyces fiaviscleroticus, 
Streptomyces flavofungini, Streptomyces flavofuscus, Streptomyces flavogriseus, 
Streptomyces flavopersicus, Streptomyces flavotricini, Streptomyces flavovariabilis, 

30 Streptomyces flavovirens, Streptomyces flavoviridis, Streptomyces flocculus, 

Streptomyces floridae, Streptomyces fluorescens, Streptomyces fradiae, Streptomyces 
fragilis, Streptomyces fulvissimus, Streptomyces fulvorobeus, Streptomyces fumanus, 
Streptomyces fumigatiscleroticus, Streptomyces galbus, Streptomyces galilaeus, 
Streptomyces gancidicus, Streptomyces gardnen, Streptomyces gelaticus, 

35 Streptomyces geysiriensis, Streptomyces ghanaensis, Streptomyces gibsonii, 
Streptomyces glaucescens, Streptomyces glaucosporus, Streptomyces glaucus, 
Streptomyces globisporus subsp. caucasicus, Streptomyces globisporus subsp. 
flavofuscus, Streptomyces globisporus subsp. globisporus, Streptomyces globosus, 
Streptomyces glomeratus, Streptomyces glomeroaurantiacus, Streptomyces gobitricini, 

40 Streptomyces goshikiensis, Streptomyces gougerotii, Streptomyces graminearus, 
Streptomyces graminofaciens, Streptomyces griseinus, Streptomyces 
griseoaurantiacus, Streptomyces griseobrunneus, Streptomyces griseocameus, 
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Streptomyces griseochromogenes, Streptomyces griseoflavus, Streptomyces 
griseofuscus, Streptomyces griseoincarnatus, Streptomyces griseoloalbus, 
Streptomyces griseolosporeus, Streptomyces griseolus, Streptomyces griseoluteus, 
Streptomyces griseomycini, Streptomyces griseoplanus, Streptomyces griseorubens, 
5 Streptomyces griseoruber, Streptomyces griseorubiginosus, Streptomyces 

gns&osporeus, streptomyces gm>ttuaucuiiuiGuo, ouo/jium/ovo y>i«»wr>«u V >>i U ».i 
Streptomyces griseoviridis, Streptomyces griseus subsp. alpha, Streptomyces griseus 
subsp. cretosus, Streptomyces griseus subsp. griseus, Streptomyces griseus subsp. 
solvifaciens, Streptomyces hachijoensis, Streptomyces halstedii, Streptomyces 
10 hawaiiensis, Streptomyces heliomycini, Streptomyces helvaticus, Streptomyces 
herbaricolor, Streptomyces hiroshimensis, Streptomyces hirsutus, Streptomyces 
humidus, Streptomyces humiferus, Streptomyces hydrogenans, Streptomyces 
hygroscopicus subsp. angustmyceticus, Streptomyces hygroscopicus subsp. 
decoyicus, Streptomyces hygroscopicus subsp. glebosus, Streptomyces hygroscopicus 
15 subsp. hygroscopicus, Streptomyces hygroscopicus subsp. ossamyceticus, 
Streptomyces iakyrus, Streptomyces indiaensis, Streptomyces indigoferus, 
Streptomyces indonesiensis, Streptomyces intermedius, Streptomyces inusitatus, 
Streptomyces ipomoeae, Streptomyces janthinus, Streptomyces javensis, 
Streptomyces kanamyceticus, Streptomyces kashmirensis, Streptomyces kasugaensis, 
20 Streptomyces katrae, Streptomyces kentuckensis, Streptomyces kifunensis, 

Streptomyces kishiwadensis, Streptomyces kunmingensis, Streptomyces kurssanovii, 
Streptomyces labedae, Streptomyces laceyi, Streptomyces ladakanum, Streptomyces 
lanatus, Streptomyces lateritius, Streptomyces laurentii, Streptomyces lavendofoliae, 
Streptomyces lavendulae subsp. grasserius, Streptomyces lavendulae subsp. 
25 lavendulae, Streptomyces lavenduligriseus, Streptomyces lavendulocolor, 

Streptomyces tews, Streptomyces libani subsp. libani, Streptomyces libani subsp. 
rufus, Streptomyces lienomycini, Streptomyces lilacinus, Streptomyces limosus, 
Streptomyces lincolnensis, Streptomyces lipmanii, Streptomyces litmocidini, 
Streptomyces lomondensis, Streptomyces longisporoflavus, Streptomyces 
30 longispororuber, Streptomyces longisporus, Streptomyces longwoodensis, 
Streptomyces lucensis, Streptomyces luridiscabiei, Streptomyces luridus, 
Streptomyces lusitanus, Streptomyces luteireticuli, Streptomyces luteogriseus, 
Streptomyces luteosporeus, Streptomyces luteoverticillatus, Streptomyces lydicus, 
Streptomyces macrosporus, Streptomyces malachitofuscus, Streptomyces 
35 malachitospinus, Streptomyces malaysiensis, Streptomyces mashuensis, 

Streptomyces massasporeus, Streptomyces matensis, Streptomyces mauvecolor, 
Streptomyces mediocidicus, Streptomyces mediolani, Streptomyces megasporus, 
Streptomyces melanogenes, Streptomyces melanosporofaciens, Streptomyces 
mexicanus, Streptomyces michiganensis, Streptomyces microflavus, Streptomyces 
40 minutiscleroticus, Streptomyces mirabilis, Streptomyces misakiensis, Streptomyces 
misionensis, Streptomyces mobaraensis, Streptomyces monomycini, Streptomyces 
morookaensis, Streptomyces murinus, Streptomyces mutabilis, Streptomyces 
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mutomycini, Streptomyces naganishii, Streptomyces narbonensis, Streptomyces 
nashvillensis, Streptomyces netropsis, Streptomyces neyagawaensis, Streptomyces 
niger, Streptomyces nigrescens, Streptomyces nigrifaciens, Streptomyces 
nitrosporeus, Streptomyces niveiciscabiei, Streptomyces niveoruber, Streptomyces 
5 niveus, Streptomyces noboritoensis, Streptomyces nodosus, Streptomyces nogalater, 
Streptomyces nojiriensis, Streptomyces noursei, Streptomyces novaecaesareae, 
Streptomyces ochraceiscleroticus, Streptomyces odorifer, Streptomyces 
olivaceiscleroticus, Streptomyces olivaceoviridis, Streptomyces olivaceus, 
Streptomyces olivochromogenes, Streptomyces olivomycin^ Streptomyces olivoreticuli, 

10 Streptomyces olivoreticuli subsp. cellulophilus, Streptomyces olivoreticuli subsp. 

olivoreticuli, Streptomyces olivoverticillatus, Streptomyces olivoviridis, Streptomyces 
omiyaensis, Streptomyces orinoci, Streptomyces pactum, Streptomyces 
paracochleatus, Streptomyces paradoxus, Streptomyces parvisporogenes, 
Streptomyces parvulus, Streptomyces parvus, Streptomyces peucetius, Streptomyces 

1 5 phaeochromogenes, Streptomyces phaeofaciens, Streptomyces phaeopurpureus, 
Streptomyces phaeoviridis, Streptomyces phosalacineus, Streptomyces pilosus, 
Streptomyces platensis, Streptomyces plicatus, Streptomyces pluricolorescens, 
Streptomyces polychromogenes, Streptomyces poonensis, Streptomyces praecox, 
Streptomyces prasinopilosus, Streptomyces prasinosporus, Streptomyces prasinus, 

20 Streptomyces prunicolor, Streptomyces psammoticus, Streptomyces 
pseudoechinosporeus, Streptomyces pseudogriseolus, Streptomyces 
pseudovenezuelae, Streptomyces pulveraceus, Streptomyces puniceus, Streptomyces 
puniciscabiei, Streptomyces purpeofuscus, Streptomyces purpurascens, Streptomyces 
purpureus, Streptomyces purpurogeneiscleroticus, Streptomyces 

25 racemochromogenes, Streptomyces rameus, Streptomyces ramulosus, Streptomyces 
rangoonensis, Streptomyces recifensis, Streptomyces rectiverticillatus, Streptomyces 
rectiviolaceus, Streptomyces regensis, Streptomyces resistomycificus, Streptomyces 
reticuliscabiei, Streptomyces rhizosphaericus, Streptomyces rimosus subsp. 
paromomycinus, Streptomyces rimosus subsp. rimosus, Streptomyces rishiriensis, 

30 Streptomyces rochei, Streptomyces roseiscleroticus, Streptomyces roseodiastaticus, 
Streptomyces roseoflavus, Streptomyces roseofulvus, Streptomyces roseolilacinus, 
Streptomyces roseolus, Streptomyces roseosporus, Streptomyces roseoverticillatus, 
Streptomyces roseoviolaceus, Streptomyces roseoviridis, Streptomyces rubber, 
Streptomyces rubiginosohelvolus, Streptomyces rubiginosus, Streptomyces 

35 rubrogriseus, Streptomyces rutgersensis subsp. castelarensis, Streptomyces 

rutgersensis subsp. rutgersensis, Streptomyces salmonis, Streptomyces sampsonii, 
Streptomyces sanglieri, Streptomyces sannanensis, Streptomyces sapporonensis, 
Streptomyces scabiei, Streptomyces sclerotialus, Streptomyces scopiformis, 
Streptomyces seoulensis, Streptomyces septatus, Streptomyces setae, Streptomyces 

40 setonii, Streptomyces showdoensis, Streptomyces sindenensis, Streptomyces 

sioyaensis, Streptomyces somaliensis, Streptomyces sparsogenes, Streptomyces 
spectabilis, Streptomyces speibonae, Streptomyces speleomycini, Streptomyces 
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spheroids, Streptomyces spinoverrucosus, Streptomyces spiralis, Streptomyces 
spiroverticillatus, Streptomyces spitsbergensis, Streptomyces sporocinereus, 
Streptomyces sporoclivatus, Streptomyces spororaveus, Streptomyces 
sporoverrucosus, Streptomyces stelliscabiei, Streptomyces stramineus, Streptomyces 
5 subrutilus, Streptomyces sulfonofaciens, Streptomyces sulphurous, Streptomyces 

synngium, £>irepiomyuttz> iaua&i i;c;<o»o, uuo^iuih/v/Boiumi.—) / ■> — - 

Streptomyces termitum, Streptomyces thermoalcalitolerans, Streptomyces 
thermoautotrophicus, Streptomyces thermocarboxydovorans, Streptomyces 
thermocarboxydus, Streptomyces thermocoprophilus, Streptomyces thermodiastaticus, 
10 Streptomyces thermogriseus, Streptomyces thermolineatus, Streptomyces 

thermonitrificans, Streptomyces thermospinosisporus, Streptomyces thermoviolaceus 
subsp. apingens, Streptomyces thermoviolaceus subsp. thermoviolaceus, 
Streptomyces thermovulgaris, Streptomyces thioluteus, Streptomyces torulosus, 
Streptomyces toxytricini, Streptomyces tricolor, Streptomyces tubercidicus, 
15 Streptomyces tuirus, Streptomyces turgidiscabies, Streptomyces umbrinus, 
Streptomyces variabilis, Streptomyces variegates, Streptomyces varsoviensis, 
Streptomyces vastus, Streptomyces venezuelae, Streptomyces vinaceus, 
Streptomyces vinaceusdrappus, Streptomyces violaceochromogenes, Streptomyces 
violaceolatus,Streptomyces violaceorectus, Streptomyces violaceoruber, Streptomyces 
20 violaceprubidus, Streptomyces violaceus, Streptomyces violaceusniger, Streptomyces 
violarus, Streptomyces violascens, Streptomyces violatus, Streptomyces violens, 
Streptomyces virens, Streptomyces virginiae, Streptomyces viridiflavus, Streptomyces 
viridiviolaceus, Streptomyces viridobrunneus, Streptomyces viridochromogenes, 
Streptomyces viridodiastaticus, Streptomyces viridosporus, Streptomyces 
25 vitaminophileus, Streptomyces vitaminophilus, Streptomyces wedmorensis, 

Streptomyces werraensis, Streptomyces willmorei, Streptomyces xanthochromogenes, 
Streptomyces xanthocidicus, Streptomyces xantholiticus, Streptomyces xanthophaeus, 
Streptomyces yatensis, Streptomyces yerevanensis, Streptomyces yogyakartensis, 
Streptomyces yokosukanensis, Streptomyces yunnanensis, Streptomyces 
30 zaomyceticus, Streptoverticillium abikoense, Streptoverticillium albireticuli, 

Streptoverticillium alboverticillatum, Streptoverticillium album, Streptoverticillium ardum, 
Streptoverticillium aureoversale, Streptoverticillium aureoversile, Streptoverticillium 
baldaccii , Streptoverticillium biverticillatum, Streptoverticillium blastmyceticum, 
Streptoverticillium cinnamoneum subsp. albosporum, Streptomyces cinnamoneus 
35 subsp. albosporus, Streptoverticillium cinnamoneum subsp. cinnamoneum, 

■ Streptoverticillium cinnamoneum subsp. tanosum, Streptoverticillium cinnamoneum 
subsp. sparsum, Streptoverticillium distallicum, Streptoverticillium ehimense, 
Streptoverticillium eurocidicum, Streptoverticillium fervens subsp. fervens, 
Streptoverticillium fervens subsp. melrosporus, Streptoverticillium flavopersicum, 
40 Streptoverticillium griseocarneum, Streptoverticillium griseoverticillatum, 

Streptoverticillium hachijoense, Streptoverticillium hiroshimense, Streptoverticillium 
kashmirense, Streptoverticillium kentuckense, Streptoverticillium kishiwadense, 
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Streptoverticillium ladakanum, Streptoverticillium lavenduligriseum, Streptoverticillium 
lilacinum, Streptoverticillium luteoverticillatum, Streptoverticillium mashuense, 
Streptoverticillium mobaraense, Streptoverticillium morookaense, Streptoverticillium 
netropsis, Streptoverticillium olivomycin!', Streptomyces olivomycin!' t Streptoverticillium 
5 olivoreticuli subsp. cellulophilum, Streptoverticillium olivoreticuli subsp. olivoreticuli, 
Streptoverticillium olivoreticulum, Streptoverticillium olivoreticulum subsp. 
cellulophilum, Streptoverticillium olivoverticillatum, Streptoverticillium orinoci, 
Streptoverticillium parvisporogenes, Streptoverticillium parvisporogenum, 
Streptoverticillium rectiverticillatum, Streptoverticillium reticulum subsp. protomycicum, 
10 Streptoverticillium roseoverticillatum, Streptoverticillium salmonis, Streptoverticillium 
sapporonense, Streptoverticillium septatum, Streptoverticillium syringium, 
Streptoverticillium thioluteum, Streptoverticillium verticillium subsp. quantum, 
Streptoverticillium verticillium subsp. tsukushiense or Streptoverticillium viridoflavum. 

[0092.0.0.0] Particular preferred strains are strains selected from the group 
15 consisting of Bacillaceae, Brevibacteriaceae, Corynebacteriaceae, Nocardiaceae, 

Mycobacteriaceae, Streptomycetaceae, Enterobacteriaceae such as Bacillus circulans, 
Bacillus subtilis, Bacillus sp., Brevibacterium albidum, Brevibacterium album, 
Brevibacterium cerinum, Brevibacterium flavum, Brevibacterium glutamigenes, 
Brevibacterium iodinum, Brevibacterium ketoglutamicum, Brevibacterium 
20 lactofermentum, Brevibacterium linens, Brevibacterium roseum, Brevibacterium 
saccharolyticum, Brevibacterium sp., Corynebacterium acetoacidophilum, 
Corynebacterium acetoglutamicum, Corynebacterium ammoniagenes, 
Corynebacterium glutamicum (= Micrococcus glutamicum), Corynebacterium 
melassecola, Corynebacterium sp., Nocardia rhodochrous (Rhodococcus 
25 rhodochrous), Mycobacterium rhodochrous, Streptomyces lividans and Escherichia coli 
especially Escherichia coli K1 2. 

[0093.0.0.0] In addition particular preferred strains are strains selected from the 
group consisting of Cryptococcaceae, Saccharomycetaceae, Schizosaccharo- 
mycetacease such as the genera Candida, Hansenula, Pichia, Saccharomyces and 

30 Schizosaccharomyces preferred are strains selected from the group consisting of the 
species Rhodotorula rubra, Rhodotorula glutinis, Rhodotorula graminis, Yarrowia 
lipolytica, Sporobolomyces salmonicolor, Sporobolomyces shibatanus, Saccharomyces 
cerevisiae, Candida boidinii, Candida bombicola, Candida cylindracea, Candida 
parapsilosis, Candida rugosa, Candida tropicalis, Pichia methanolica and Pichia 

35 pastoris especially Saccharomyces cerevisiae. 

[0094.0.0.0] Anacardiaceae such as the genera Pistacia, Mangifera, Anacardium e.g. 
the species Pistacia vera [pistachios, Pistazie], Mangifer indica [Mango] or Anacardium 
occidentale [Cashew]; Asteraceae such as the genera Calendula, Carthamus, 
Centaurea, Cichorium, Cynara, Helianthus, Lactuca, Locusta, Tagetes, Valeriana e.g. 
40 the species Calendula officinalis [Marigold], Carthamus tinctorius [safflower], 
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Centaurea cyanus [cornflower], Cichorium intybus [blue daisy], Cynara scolymus 
[Artichoke], Helianthus annus [sunflower], Lactuca sativa, Lactuca crispa, Lactuca 
esculenta, Lactuca scariola L ssp. sativa, Lactuca scahola L var. integrate, Lactuca 
scariola L. var. integrifolia, Lactuca sativa subsp. romana, Locusta communis, 
5 Valeriana locusta [lettuce], Tagetes lucida, Tagetes erecta or Tagetes tenuifolia 
[Marigold]; Apiaceae such as the genera Daucus e.g. the species Daucus carota 
[carrot]; Betulaceae such as the genera Corylus e.g. the species Corylus avellana or 
Corylus colurna [hazelnut]; Boraginaceae such as the genera Borago e.g. the species 
Borago officinalis [borage]; Brassicaceae such as the genera Brassica, Melanosinapis, 
10 Sinapis, Arabadopsis e.g. the species Brassica napus, Brassica rapa ssp. [canola, 
oilseed rape, turnip rape], Sinapis arvensis Brassica juncea, Brassica juncea var. 
juncea, Brassica juncea var. crisp'rfolia, Brassica juncea var. foliosa, Brassica nigra, 
Brassica sinapioides, Melanosinapis communis [mustard], Brassica oleracea [fodder 
beet] or Arabidopsis thaiiana; Bromeliaceae such as the genera Anana, Bromelia e.g. 
1 5 the species Anana comosus, Ananas ananas or Bromelia comosa [pineapple]; 
Caricaceae such as the genera Carica e.g. the species Carica papaya [papaya]; 
Cannabaceae such as the genera Cannabis e.g. the species Cannabis sative [hemp], 
Convolvulaceae such as the genera Ipomea, Convolvulus e.g. the species Ipomoea 
batatus, Ipomoea pandurata, Convolvulus batatas, Convolvulus tiliaceus, Ipomoea 
20 fastigiata, Ipomoea t Hi ace a, Ipomoea triloba or Convolvulus panduratus [sweet potato, 
Man of the Earth, wild potato], Chenopodiaceae such as the genera Beta, i.e. the 
species Beta vulgaris, Beta vulgaris var. altissima, Beta vulgaris var. Vulgaris, Beta 
mafAima, Beta vulgaris var. perennis, Beta vulgaris var. conditiva or Beta vulgaris var. 
esculenta [sugar beet]; Cucurbitaceae such as the genera Cucubita e.g. the species 
25 Cucurbita maxima, Cucurbita mixta, Cucurbita pepo or Cucurbita moschata [pumpkin, 
squash]; Elaeagnaceae such as the genera Elaeagnus e.g. the species Olea europaea 
[olive]; Ericaceae such as the genera Kalmia e.g. the species Kalmia latifolia, Kalmia 
angustifolia, Kalmia microphylla, Kalmia polifolia, Kalmia occidentalis, Cistus 
chamaerhodendros or Kalmia lucida [American laurel, broad-leafed laurel, calico bush, 
30 spoon wood, sheep laurel, alpine laurel, bog laurel, western bog-laurel, swamp-laurel]; 
Euphorbiaceae such as the genera Manihot, Janipha, Jatropha, Ricinus e.g. the 
species Manihot utilissima, Janipha manihot,, Jatropha manihot, Manihot aipil, Manihot 
dulcis, Manihot manihot, Manihot melanobasis, Manihot esculenta [manihot, arrowroot, 
tapioca, cassava] or Ricinus communis [castor bean, Castor Oil Bush, Castor Oil Plant, 
35 Palma Christi, Wonder Tree]; Fabaceae such as the genera Pisum, Albizia, 

Cathormion, Feuillea, Inga, Pithecolobium, Acacia, Mimosa, Medicajo, Glycine, 
Dolichos, Phaseolus, Soja e.g. the species Pisum sativum, Pisum arvense, Pisum 
humile [pea], Albizia berteriana, Albizia julibrissin, Albizia lebbeck, Acacia berteriana, 
Acacia littoralis, Albizia berteriana, Albizzia berteriana, Cathormion berteriana, Feuillea 
40 berteriana, Inga fragrans, Pithecellobium berterianum, Pithecellobium fragrans, 

Pithecolobium berterianum, Pseudalbizzia berteriana, Acacia julibrissin, Acacia nemu, 
Albizia nemu, Feuilleea julibrissin, Mimosa julibrissin, Mimosa speciosa, Sericanrda 
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julibrissin, Acacia lebbeck, Acacia macrophylla, Albizia lebbek, Feuilleea lebbeck, 
Mimosa lebbeck, Mimosa speciosa [bastard logwood, silk tree, East Indian Walnut], 
Medicago sativa, Medicago falcata, Medicago varia [alfalfa] Glycine max Dolichos soja, 
Glycine gracilis, Glycine hispida, Phaseolus max, Soja hispida or Soja max [soybean]; 
5 Geraniaceae such as the genera Pelargonium, Cocos, Oleum e.g. the species Cocos 
nucifera, Pelargonium grossularioides or Oleum cocois [coconut]; Gramineae such as 
the genera Saccharum e.g. the species Saccharum officinarum; Juglandaceae such as 
the genera Juglans, Wallia e.g. the species Juglans regie, Juglans ailanthifolia, Juglans 
sieboldiana, Juglans cinerea, Wallia cinerea, Juglans bixbyi, Juglans califomica, 

1 0 Juglans hindsii, Juglans intermedia, Juglans jamaicensis, Juglans major, Juglans 
microcarpa, Juglans nigra or Wallia nigra [walnut, black walnut, common walnut, 
persian walnut, white walnut, butternut, black walnut]; Lauraceae such as the genera 
Persea, Laurus e.g. the species laurel Laurus nobilis [bay, laurel, bay laurel, sweet 
bay], Persea americana Persea americana, Persea gratissima or Persea persea 

1 5 [avocado]; Leguminosae such as the genera Arachis e.g. the species Arachis 

hypogaea [peanut]; Linaceae such as the genera Linum, Adenolinum e.g. the species 
Linum usitatissimum, Linum humile, Linum austriacum, Linum bienne, Linum 
angustifolium, Linum catharticum, Linum flavum, Linum grandiflorum, Adenolinum 
grandiflorum, Linum lewisii, Linum narbonense, Linum perenne, Linum perenne van 

20 lewisii, Linum pratense or Linum trigynum [flax, linseed]; Lythrarieae such as the 

genera Punica e.g. the species Punica granatum [pomegranate]; Malvaceae such as 
the genera Gossypium e.g. the species Gossypium hirsutum, Gossypium arboreum, 
Gossypium barbadense, Gossypium herbaceum or Gossypium thurberi [cotton]; 
Musaceae such as the genera Musa e.g. the species Musa nana, Musa acuminata, 

25 Musa paradisiaca, Musa spp. [banana]; Onagraceae such as the genera Camissonia, 
Oenothera e.g. the species Oenothera biennis or Camissonia brevipes [primrose, 
evening primrose]; Palmae such as the genera Elacis e.g. the species Elaeis 
guineensis [oil plam]; Papaveraceae such as the genera Papaver e.g. the species 
Papaver orientate, Papaver rhoeas, Papaver dubium [poppy, oriental poppy, corn 

30 poppy, field poppy, Shirley poppies, field poppy, long-headed poppy, long-pod poppy]; 
Pedaliaceae such as the genera Sesamum e.g. the species Sesamum indicum 
[sesame]; Piperaceae such as the genera Piper, Artanthe, Peperomia, Steffensia e.g. 
the species Piper aduncum, Piper amalago, Piper angustifolium, Piper auritum, Piper 
betel, Piper cubeba, Piper longum, Piper nigrum, Piper retrofractum, Artanthe adunca, 

35 Artanthe elongata, Peperomia elongata, Piper elongatum, Steffensia elongata. 

[Cayenne pepper, wild pepper]; Poaceae such as the genera Hordeum, Secale, Avena, 
Sorghum, Andropogon, Holcus, Panicum, Oryza, Zea, Triticum e.g. the species 
Hordeum vulgare, Hordeum jubatum, Hordeum murinum, Hordeum secalinum, 
Hordeum distichon Hordeum aegiceras, Hordeum hexastichon., Hordeum 

40 hexastichum, Hordeum irregulare, Hordeum sativum, Hordeum secalinum [barley, 
pearl barley, foxtail barley, wall barley, meadow barley], Secale cereale [rye], Avena 
sativa, Avena fatua, Avena byzantina, Avena fatua van sativa, Avena hybrida [oat], 
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Sorghum bicolor, Sorghum halepense, Sorghum saccharatum, Sorghum vulgare, 
Andropogon drummondii, Holcus bicolor, Holcus sorghum, Sorghum aethiopicum, 
Sorghum arundinaceum, Sorghum caffrorum, Sorghum cernuum, Sorghum dochna, 
Sorghum drummondii, Sorghum durra, Sorghum guineense, Sorghum lanceolatum, 
5 Sorghum nervosum, Sorghum saccharatum, Sorghum subglabrescens, Sorghum 
verticiiiiflorjm, Sorghum vulgare, Holcus halepensis, Sorghum miliaceum millet, 
Panicum militaceum [Sorghum, millet], Oryza sativa, Oryza latifolia [rice], Zea mays 
[corn, maize] Triticum aestivum, Triticum durum, Triticum turgidum, Triticum hybernum, 
Triticum macha, Triticum sativum or Triticum vulgare [wheat, bread wheat, common 
10 wheat], Proteaceae such as the genera Macadamia e.g. the species Macadamia 

intergrifolia [macadamia]; Rubiaceae such as the genera Coffea e.g. the species Cofea 
spp., Coffea arabica, Coffea canephora or Coffea liberica [coffee]; Scrophulariaceae 
such as the genera Verbascum e.g. the species Verbascum blattaria, Verbascum 
chaixii, Verbascum densiflorum, Verbascum lagurus, Verbascum longifolium, 
15 Verbascum lychnitis, Verbascum nigrum, Verbascum olympicum, Verbascum 

phlomoides, Verbascum phoenicum, Verbascum pulverulentum or Verbascum thapsus 
[mullein, white moth mullein, nettle-leaved mullein, dense-flowered mullein, silver 
mullein, long-leaved mullein, white mullein, dark mullein, greek mullein, orange mullein, 
purple mullein, hoary mullein, great mullein]; Solanaceae such as the genera 
20 Capsicum, Nicotiana, Solanum, Lycopersicon e.g. the species Capsicum annuum, 
Capsicum annuum var. glabriusculum, Capsicum frutescens [pepper], Capsicum 
annuum [paprika], Nicotiana tabacum, Nicotiana alata, Nicotiana attenuata, Nicotiana 
glauca, Nicotiana langsdorffii, Nicotiana obtusifolia, Nicotiana quadrivalvis, Nicotiana 
repanda, Nicotiana rustica, Nicotiana sylvestris [tobacco], Solanum tuberosum [potato], 
25 Solanum melongena [egg-plant] {Lycopersicon esculentum, Lycopersicon 
lycopersicum., Lycopersicon pyriforme, Solanum integrifolium or Solanum 
lycopersicum [tomato]; Sterculiaceae such as the genera Theobroma e.g. the species 
Theobroma cacao [cacao]; Theaceae such as the genera Camellia e.g. the species 
Camellia sinensis) [tea]. 
30 All abovementioned organisms can in princible also function as host organisms. 

[0095.0.0.0] Particular preferred plants are plants selected from the group consisting 
of Asteraceae such as the genera Helianthus, Tagetes e.g. the species Helianthus 
annus [sunflower], Tagetes lucida, Tagetes erecta or Tagetes tenuifolia [Marigold]; 
Brassicaceae such as the genera Brassica, Arabadopsis e.g. the species Brassica 

35 napus, Brassica rape ssp., Brassica juncea [canola, oilseed rape, turnip rape] or 

Arabidopsis thaliana; Fabaceae such as the genera Glycine e.g. the species Glycine 
max, Soja hispida or Soja max [soybean]; Linaceae such as the genera Linum e.g. the 
species Linum usitatissimum, [flax, linseed]; Poaceae such as the genera Hordeum, 
Secale, Avena, Sorghum, Oryza, Zea, Triticum e.g. the species Hordeum vulgare 

40 " [barley]; Secale cereale [rye], Avena sativa, Avena fatua, Avena byzantina, Avena 

fatua var. sativa, Avena hybrida [oat], Sorghum bicolor [Sorghum, millet], Oryza sativa, 
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Oryza latifolia [rice], Tea mays [corn, maize] Triticum aestivum, Triticum durum, 
Triticum turgidum, Triticum hybernum, Triticum macha, Triticum sativum or Triticum 
vulgare [wheat, bread wheat, common wheat]; Solanaceae such as the genera 
Solanum, Lycopersicon e.g. the species Solanum tuberosum [potato], Lycopersicon 
5 esculentum, Lycopersicon lycopersicum., Lycopersicon pyriforme, Solanum 
integrifolium or Solanum lycopersicum [tomato]. 

[0096.0.0.0] All abovementioned organisms can in princible also function as host 
organisms. 

[0097.0.0.0] With regard to the nucleic acid sequence as depicted below a nucleic 
10 acid construct which contains a nucleic acid sequence mentioned herein or an 
organism (= transgenic organism) which is transformed with said nucleic acid 
sequence or said nucleic acid construct, "transgene" means all those constructs which 
have been brought about by genetic manipulation methods, preferably in which either 

a) the nucleic acid sequence as depicted in SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 
15 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41 43, 45, 55, 57, 59, 61, 63, 65, 67, 

69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 
109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 
141, 143, 145, 147. 149, 151, 153, 155, 157, 159, 161, 163, 165. 167, 169, 171, 
173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 

20 205, 207, 209, 21 1, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 

237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 
269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299, 
301, 303, 305, 307, 309, 311, 313, 315, 317, 319, 321, 323, 325. 327, 329, 331, 
333, 335, 337, 339, 341, 343, 345, 347, 349, 351, 353, 355, 357, 359, 361, 363, 

25 365. 367, 369, 371, 373, 375, 377, 379, 381 , 383, 385, 387, 389, 391or 393 or 

a derivative thereof, or 

b) a genetic regulatory element, for example a promoter, which is functionally linked 
to the nucleic acid sequence as depicted in SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 
17, 19, 21, 23, 25. 27. 29. 31, 33, 35, 37, 39, 41 43, 45, 55, 57, 59, 61, 63, 65, 

30 67, 69, 71, 73, 75, 77, 79. 81, 83. 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105. 

107, 109, 111, 113, 115, 117, 119. 121, 123. 125, 127, 129, 131, 133, 135, 137. 
139, 141. 143, 145, 147, 149, 151, 153, 155,157. 159. 161, 163, 165, 167, 169, 
171, 173, 175, 177, 179, 181. 183. 185, 187. 189, 191, 193, 195. 197, 199, 201, 
203, 205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 

35 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 

267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 
299, 301, 303, 305, 307, 309, 311, 313, 315, 317, 319, 321, 323, 325, 327. 329, 
331, 333, 335, 337, 339, 341, 343, 345, 347, 349, 351, 353, 355, 357, 359, 361, 
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363, 365, 367, 369, 371, 373, 375, 377, 379, 381, 383, 385, 387, 389, 391or 393 
or a derivative thereof, or 

c) (a) and (b) 

is/are not present in its/their natural genetic environment or has/have been modified by 
5 means of genetic manipulation methods, it being possible for the modification to be, by 
way of example, a substitution, addition, deletion, inversion or insertion of one or more 
nucleotide. "Natural genetic environmenf means the natural chromosomal locus in the 
organism of origin or the presence in a genomic library. In the case of a genomic 
library, the natural, genetic environment of the nucleic acid sequence is preferably at 
10 least partially still preserved. The environment flanks the nucleic acid sequence at least 
on one side and has a sequence length of at least 50 bp, preferably at least 500 bp, 
particularly preferably at least 1000 bp, very particularly preferably at least 5000 bp. 

[0098.0.0.0] The use of the nucleic acid sequence according to the invention or of 
the nucleic acid construct according to the invention for the generation of transgenic 
1 5 plants is therefore also subject matter of the invention. 

[0099.0.0.0] The fine chemical, which is synthesized in the organism, in particular the 
microorganism, the cell, the tissue or the plant, of the invention can be isolated if 
desired. Depending on the use of the fine chemical, different purities resulting from the 
purification may be advantageous as will be described herein below. 

20 [01 00.0.0.0] In an advantageous embodiment of the invention, the organism takes 
the form of a plant whose fine chemical content is modified advantageously owing to 
the nucleic acid molecule of the present invention expressed. This is important for plant 
breeders since, for example, the nutritional value of organisms such as a plant is very 
often limited by its amino acid, protein, co-factor and/or vitamin content to mention only 

25 a couple of them. For example in feed for monogastric animals a few essential amino 
acids such as lysine, threonine or methionine are very often limiting. After the biological 
activity of the nucleic acid and/or protein of the invention has been increased or 
generated, or after the expression of nucleic acid molecule or polypeptide according to 
the invention has been generated or increased, the transgenic plant generated thus is 

30 grown on or in a nutrient medium or else in the soil and subsequently harvested. 

[0101 .0.0.0] The plants or parts thereof, e.g. the leaves, roots, flowers, and/or stems 
and/or other harvestable material as described below, can then be used directly as 
foodstuffs or animal feeds or else be further processed. Again, the amino acids can be 
purified further in the customary manner via extraction and precipitation or via ion 
35 exchangers and other methods known to the person skilled in the art and described 
herein below. Products which are suitable for various applications and which result 
from these different processing procedures are for example amino acids or amino acid 
compositions which can still comprise further plant components in different amounts, 
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advantageously in the range of from 0 to 99% by weight or more, preferably from below 
90%, 80%, 70%, 60% or 50% by weight, especially preferably below 40%, 30%, 20% 
or 10% by weight. The plants can also advantageously be used directly without further 
processing, e.g. as feed or for extraction. 

5 [0102.0.0.0] The chemically pure fine chemical or chemically pure compositions 
comprising the fine chemical may also be produced by the process described above. 
To this end, the fine chemical or the compositions are isolated in the known manner 
from an organism according to the invention, such as the microorganisms, non-human 
animal or the plants, and/or their culture medium in which or on which the organisms 
1 0 .had been grown. These chemically pure fine chemical or said compositions, are 

advantageous for applications in the field of the food industry, the cosmetics industry or 
the pharmaceutical industry. 

[0103.0.0.0] Thus, the content of plant components and preferably also further 
impurities is as low as possible, and the abovementioned fine chemical are obtained in 
15 as pure form as possible. In these applications, the content of plant components 

advantageously amounts to less than 10% by weight, preferably 1% by weight, more 
preferably 0.1 % by weight, very especially preferably 0.01 % by weight or less. 

[0104.0.0.0] Accordingly, the fine chemical produced by the present invention is at 
least 0,1% by weight pure, preferably more than 1% by weight pure, more preferred 
20 10% by weight pure, even more preferred are more than 50%, 60%, 70% or 80% by 
weight pure, even more preferred are more than 90%, 91%, 92%, 93%, 94% or 95% 
weight pure, most preferred are 96%, 97%, 98% or 99% by weight or more pure. 

[0105.0.0.0] In this context, the amount of the fine chemical in a cell of the invention 
may be increased according to the process of the invention by at least a factor of 1 .1 , 
25 preferably at least a factor of 1 .5; 2; or 5, especially preferably by at least a factor of 1 0 
or 30, very especially preferably by at least a factor of 50, in comparison with the wild 
type, control or reference. Preferrably, said increase is found in a tissue of an 
organism, more preferred in the organism itself or in a harvestable part thereof. 

[0106.0.0.0] In principle, the fine chemicals produced can be increased in two ways 
30 by the process according to the invention. The pool of free fine chemicals, in particular 
of the free fine chemical, and/or the content of bound for example protein-bound fine 
chemicals, in particular of the protein-bound fine chemical may advantageously be 
increased. 

[0107.0.0.0] It may be advantageous to increase the pool of free fine chemical in the 
35 transgenic organisms by the process according to the invention in order to isolate high 
amounts of the pure fine chemical. 
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[01 08.0.0.0] In another preferred embodiment of the invention a combination of the 
increased expression of the nucleic acid sequence or the protein of the invention 
together with the transformation of a protein or polypeptid, which functions as a sink for 
the desired fine chemical such as an amino acid for example methionine, lysine or 
5 threonine in the organism is useful to increase the production of the fine chemical (see 
US 5,589,616, WO 96/38574, WO 97/07665, WO 97/28247, US 4,886,878, US 
5,082,993 and US 5,670,635). Galili et al. [Transgenic Res. 2000] showed, that 
enhancing the synthesis of threonine by a feed back insensitive aspertate kinase didnot 
lead only to in increase in free threonine but also in protein bound threonine. 

10 [0109.0.0.0] In may also be advantageous to increase the content of the bound fine 
chemical. 

[01 10.0.0.0] In a preferred embodiment, the fine chemical is produced in accordance 
with the invention and, if desired, is isolated. The production of further fine chemicals 
for example amino acids such as lysine and of amino acid mixtures by the process 
15 according to the invention is advantageous. 

[0111 .0.0.0] In the case of the fermentation of microorganisms, the abovementioned 
fine chemical e.g. amino acid or mixtures of amino acids may accumulate in the 
medium and/or the cells. If microorganisms are used in the process according to the 
invention, the fermentation broth can be processed after the cultivation. Depending on 

20 the requirement, all or some of the biomass can be removed from the fermentation 
broth by separation methods such as, for example, centrifugation, filtration, decanting 
or a combination of these methods, or else the biomass can be left in the fermentation 
broth. The fermentation broth can subsequently be reduced, or concentrated, with the 
aid of known methods such as, for example, rotary evaporator, thin-layer evaporator, 

25 falling film evaporator, by reverse osmosis or by nanofiltration. This concentrated 
fermentation broth can subsequently be processed by lyophilization, spray drying, 
spray granulation or by other methods. 

[011 2.0.0.0] To purify a fine chemical such as an amino acid, a product-containing 
fermentation broth from which the biomass has been separated may be subjected to 

30 chromatography with a suitable resin such as ion exchange resin for example anion or 
cation exchange resin, hydrophobic resin or hydrophilic resin for example epoxy resin, 
polyurethane resin or polyacrylamid resin, or resin for separation according to the 
molecular weight of the compounds for example polyvinyl chloride homopolymer resin 
or resins composed for example of polymers of acrylic acid, crosslinked with 

35 polyalkenyl ethers or divinyl glycol such as Carbopol®, Pemulen® and Noveon®. If 
necessary these chromatography steps may be repeated using the same or other 
chromatography resins. The skilled worker is familiar with the choice of suitable 
chromatography resins and their most effective use. The purified product may be 



WO 2005/014828 PCT/EP2004/008136 

58 

concentrated by filtration or ultrafiltration and stored at a temperature, which ensures 
the maximum stability of the product 

[0113.0.0.0] The identity and purity of the compound(s) isolated can be determined 
5 by prior-art techniques. They encompass high-performance liquid chromatography 
(HPLC), spectroscopic methods, mass spectrometry (MS), staining methods, thin-layer 
chromatography, NIRS, enzyme assays or microbiological assays. These analytical 
methods are compiled in: Patek et al. (1994) Appl. Environ. Microbiol. 60:133-140; 
Malakhova et al. (1996) Biotekhnologiya 1 1 27-32; and Schmidt et al. (1998) 

10 Bioprocess Engineer. 19:67-70. Ulmann's Encyclopedia of Industrial Chemistry (1996) 
Bd. A27, VCH Weinheim, pp. 89-90, pp. 521-540, pp. 540-547, pp. 559-566, 575-581 
and pp. 581-587; Michal, G (1999) Biochemical Pathways: An Atlas of Biochemistry 
and Molecular Biology, John Wiley and Sons; Fallon, A. et al. (1987) Applications of 
HPLC in Biochemistry in: Laboratory Techniques in Biochemistry and Molecular Bio- 

15 logy, vol. 17. 

[01 14.0.0.0] Fine chemicals like amino acids can for excample be detected 
advantageously via HPLC separation in ethanolic extract as described by 
Geigenberger et al. (Plant Cell & Environ, 19, 1996: 43-55). Amino acids can be 
extracted with hot water. After filtration the extracts are diluted with water containing 

20 20 mg/mL sodium acide. The separation and detection of the amino acids is performed 
using an anion exchange column and an electrochemical detector. Technical details 
can be taken from Y. Ding et al., 2002, Direct determination of free amino acids and 
sugars in green tea by anion-exchange chromatography with integrated pulsed 
amperometric detection, J Chromatogr A, (2002) 982; 237-244, or e.g. from Karchi et 

25 al., 1993 , Plant J. 3: 721-727; Matthews M J, 1997 (Lysine, threonine and methionine 
biosynthesis. In BK Singh, ed, Plant Amino Acids: Biochemistry and Biotechnology. 
Dekker, New York, pp 205-225; H Hesse and R Hoefgen. (2003) Molecular aspects of 
methionine biosynthesis. TIPS 8(259-262. 

[01 15.0.0.0] In a preferred embodiment, the present invention relates to a process for 
30 the production of the fine chemical comprising or generating in an organism or a part 
thereof the expression of at least one nucleic acid molecule comprising a nucleic acid 
molecule selected from the group consisting of: 

nucleic acid molecule encoding, preferably at least the mature form, of the 
polypeptide as depicted in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 
26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 
76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 
114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 
146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 
178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 



a) 

35 
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210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230. 232, 234, 236, 238, 240, 
242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 
274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300, 302, 304, 
306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 
5 338, 340, 342, 344. 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 

370, 372, 374, 376, 378, 380. 382, 384, 386, 388, 390, 392 or 394 or a fragment 
thereof, which confers an increase in the amount of the fine chemical in an 
organism or a part thereof; 

b) nucleic acid molecule comprising, preferably at least the mature form, of the 

10 nucleic acid molecule as depicted in SEQ ID NO: 1 , 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 

21 , 23, 25, 27, 29, 31 , 33, 35, 37, 39, 41 43. 45, 55, 57, 59, 61 , 63, 65, 67, 69, 
71, 73, 75, 77, 79, 81, 83, 85, 87. 89. 91. 93, 95, 97, 99, 101, 103, 105, 107, 109, 
111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 
143, 145, 147, 149, 151, 153, 155, 157, 159, 161. 163. 165, 167, 169, 171, 173, 

15 175. 177. 179. 181. 183. 185. 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 

207, 209, 211. 213, 215, 217, 219. 221, 223, 225, 227, 229, 231, 233, 235, 237, 
239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 
271, 273, 275. 277, 279, 281, 283, 285. 287. 289. 291. 293, 295, 297, 299, 301, 
303, 305, 307, 309, 311, 313, 315, 317, 319, 321. 323, 325, 327, 329, 331, 333, 

20 335, 337, 339, 341 , 343, 345, 347, 349, 351 , 353, 355, 357, 359, 361, 363, 365, 

367, 369, 371, 373, 375, 377, 379, 381, 383, 385, 387, 389, 391 or 393; 

c) nucleic acid molecule whose sequence can be deduced from a polypeptide 
sequence encoded by a nucleic acid molecule of (a) or (b) as result of the 
degeneracy of the genetic code and conferring an increase in the amount of the 

25 fine chemical in an organism or a part thereof; 

d) nucleic acid molecule encoding a polypeptide which has at least 50% identity 
with the amino acid sequence of the polypeptide encoded by. the nucleic acid 
molecule of (a) to (c) and conferring an increase in the amount of the fine 
chemical in an organism or a part thereof; 

30 e) nucleic acid molecule which hybidizes with a nucleic acid molecule of (a) to (c) 
under under stringent hybridisation conditions and conferring an increase in the 
amount of the fine chemical in an organism or a part thereof; 

f) nucleic acid molecule encoding a polypeptide, the polypeptide being derived by 
substituting, deleting and/or adding one or more amino acids of the amino acid 
35 sequence of the polypeptide encoded by the nucleic acid molecules (a) to (d), 

preferably to (a) to (c) and conferring an increase in the amount of the fine 
chemical in an organism or a part thereof; 
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g) nucleic acid molecule encoding a fragment or an epitope of a polypeptide which 
is encoded by one of the nucleic acid molecules of (a) to (e), preferably to (a) to 
(c) and conferring an increase in the amount of the fine chemical in an organism 
or a part thereof; 

5 h) nucleic acid molecule comprising a nucleic acid molecule which is obtained by 
amplifying nucleic acid molecules from a cDNA library or a genomic library using 
the primers as depicted in SEQ ID NO: 53 or SEQ ID NO: 54 and conferring an 
increase in the amount of the fine chemical in an organism or a part thereof; 

i) nucleic acid molecule encoding a polypeptide which is isolated, e.g. from an 
10 expression library, with the aid of monoclonal antibodies against a polypeptide 

encoded by one of the nucleic acid molecules of (a) to (h), preferably to (a) to (c), 
and and conferring an increase in the amount of the fine chemical in an organism 
or a part thereof; 

j) nucleic acid molecule which encodes a polypeptide comprising the consensus 
15 sequence as depicted in SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ 

ID NO: 50, SEp ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 397, SEQ ID NO: 398, 
SEQ ID NO: 399 and/or SEQ ID NO: 400 and conferring an increase in the 
amount of the fine chemical in an organism or a part thereof ; and/or 

k) nucleic acid molecule which is obtainable by screening a suitable library under 
20 stringent conditions with a probe comprising one of the sequences of the nucleic 

acid molecule of (a) to fl), preferably to (a) to (c), or with a fragment of at least 15 
nt, preferably 20 nt, 30 nt, 50 nt, 1 00 nt, 200 nt or 500 nt of the nucleic acid 
molecule characterized in (a) to (j), preferably to (a) to (c), and conferring an 
increase in the amount of the fine chemical in an organism or a part thereof; 

25 or which comprises a sequence which is complementary thereto. 

[0116.0.0.0] In one embodiment, the nucleic acid molecule used in the process 
distinguishes over the sequence as depicted in SEQ ID NO: 1, 55, 57, 59, 61, 63, 65, 
67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 
109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137,139, 141, 

30 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 
177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 
211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 
245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 
279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299, 301, 303, 305, 307, 309, 311, 

35 313, 315, 317, 319, 321, 323, 325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 345, 
347, 349, 351, 353, 355, 357, 359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 379, 
381, 383, 385, 387, 389, 391 or 393 by one or more nucleotides or does not consist of 
the sequence as depicted in SEQ ID NO: 1, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 
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77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 
117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 
151* 153, 155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 
18s', 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 211, 213, 215, 217, 
5 219*. 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 

. ««« ««r- nr*-r nen 1-7.4 «V70 07R 077 OTQ 0$M Oft*? 

253, 2i5t>, Zi3(, Z0«, ZO I, ^oo, ^oo, ^o/ , ^oa, t-i v., ^< ~, ., ■ ■ 

287, 289, 291, 293, 295, 297, 299, 301, 303, 305, 307, 309, 31 1, 313, 315, 317, 319, 
321, 323, 325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 345, 347, 349, 351. 353, 
355, 357, 359, 361, 363, 365, 367, 369, 371, 373. 375, 377, 379, 381, 383, 385, 387, 
10 389, 391or 393. In one embodiment, the nucleic acid molecule of the present invention 
is iess than 1 00%, 99,999%, 99,99%, 99,9% or 99% identical to the sequence as 
depicted in SEQ ID NO: 1, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 
87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115. 117. 119. 121. 123. 
125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 
15 159. 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189. 191, 
193, 195, 197, 199, 201. 203. 205. 207. 209, 211, 213, 215, 217. 219, 221, 223, 225, 
227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 
261, 263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 283. 285, 287, 289, 291. 293. 
295. 297. 299, 301, 303, 305, 307, 309, 311, 313, 315, 317, 319, 321, 323, 325, 327, 
20 329, 331 , 333, 335, 337, 339, 341 , 343, 345, 347, 349, 351 , 353, 355, 357, 359. 361 . 
363, 365. 367, 369, 371, 373, 375, 377, 379, 381, 383, 385, 387, 389, 391or 393. In 
another embodiment, the nucleic acid molecule does not encode a polypeptide of the 
sequence as depicted in SEQ ID NO: 2, 56, 58, 60. 62. 64. 66. 68. 70, 72, 74. 76. 78, 
80, 82, 84, 86, 88, 90, 92, 94, 96, 98.100, 102, 104, 106, 108, 110, 112. 114. 116, 118, 
25 120, 122. 124. 126, 128, 130, 132, 134. 136. 138, 140, 142. 144. 146, 148. 150, 152, 
154, 156, 158, 160, 162, 164. 166. 168. 170, 172, 174, 176, 178, 180, 182, 184, 186, 
188, 190, 192, 194, 196, 198. 200. 202. 204. 206. 208, 210, 212, 214, 216, 218, 220, 
222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248. 250, 252, 254, 
256', 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 
30 290. 292, 294, 296, 298, 300. 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 
324, 326, 328, 330, 332. 334. 336. 338. 340, 342. 344, 346, 348. 350, 352, 354, 356, 
358, 360. 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386, 388, 390, 
392 or 394. In another embodiment, the nucleic acid molecule does not encode a 
polypeptide of the sequence of the present invention is less than 100%, 99,999%, 
35 99,99%, 99,9% or 99% identical to the sequence as depicted in SEQ ID NO: 2, 56, 58, 
60. 62, 64, 66, 68. 70. 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 
104, 106, 108, 110, 112, 114, 116. 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 
138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162. 164, 166, 168, 170, 
172, 174. 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202. 204, 
40 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 
240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 
274, 276. 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300, 302, 304, 306, 
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308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 
342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 
376, 378, 380, 382, 384, 386, 388, 390, 392 or 394. In another embodiment, the 
nucleic acid molecule encodes a polypeptide of the sequence as depicted in SEQ ID 
5 NO: 2, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 
98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120. 122. 124, 126, 128, 130, 
132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 
166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 
200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 

10 234, 236, 238, 240, 242, 244, 246, 248. 250. 252, 254, 256, 258, 260. 262. 264. 266. 
268", 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300, 
302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 
336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 
370, 372, 374, 376, 378, 380, 382, 384, 386, 388, 390, 392 or 394 and distinguishes 

15 over said sequence by one or more amino acids preferably by 1, 2, 3, 4, 5, 6, 7, 8, 9 or 
10 amino acids. In further another advantageously embodiment, the nucleic acid 
molecule used in the process distinguishes over the sequence as depicted in SEQ ID 
NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 13, 
SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID NO: 23, 

20 SEQ ID NO: 25, SEQ ID NO: 27, SEQ ID NO: 29, SEQ ID NO: 31, SEQ ID NO: 33, 
SEQ ID NO: 35, SEQ ID NO: 37, SEQ ID NO: 39, SEQ ID NO: 41 SEQ ID NO: 43 or 
SEQ ID NO: 45 by one or more nucleotides or does not consist of the sequence as 
depicted in SEQ ID NO: 3, SEQ ID NO: 5. SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 
11. SEQ ID NO: 13. SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, 

25 SEQ ID NO: 23, SEQ ID NO: 25, SEQ ID NO: 27, SEQ ID NO: 29. SEQ ID NO: 31 , 
SEQ ID NO: 33, SEQ ID NO: 35. SEQ ID NO: 37, SEQ ID NO: 39, SEQ ID NO: 41 
SEQ ID NO: 43 or SEQ ID NO: 45. In one advantageously embodiment, the nucleic 
acid molecule of the present invention is less than 100%, 99,999%, 99,99%. 99,9% or 
99% identical to the sequence as depicted in SEQ ID NO: 3. SEQ ID NO: 5, SEQ ID 

30 NO: 7, SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 
17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID NO: 25, SEQ ID NO: 27, 
SEQ ID NO: 29, SEQ ID NO: 31, SEQ ID NO: 33, SEQ ID NO: 35, SEQ ID NO: 37, 
SEQ ID NO: 39, SEQ ID NO: 41 SEQ ID NO: 43 or SEQ ID NO: 45. In another 
advantageously embodiment, the nucleic acid molecule does not encode a polypeptide 

35 of the sequence as depicted in SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID 
NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID 
NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID 
NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, SEQ ID NO: 36, SEQ ID NO: 38, SEQ ID 
NO: 40, SEQ ID NO: 42, SEQ ID NO: 44 or SEQ ID NO: 46. 



40 [01 17.0.0.0] Unless otherwise specified, the terms "polynucleotides", "nucleic acid" 
and "nucleic acid molecule" are interchangeably in the present context. Unless 
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otherwise specified, the terms "peptide", "polypeptide" and "protein" are 
interchangeably in the present context. The term "sequence" may relate to 
polynucleotides, nucleic acids, nucleic acid molecules, peptides, polypeptides and 
proteins, depending on the context in which the term "sequence" is used. The terms 
5 "gene(s)", "polynucleotide", "nucleic acid sequence", "nucleotide sequence", or "nucleic 

OMIU I I IWIOOUIO^O J CJC5 UOCU 11^1^7111 I CICIO IU CI JJWI^IIICI I Willi VI nwuiwrwuuw Wi Ul IJT 

length, either ribonucleotides or deoxyribonucleotides. The terms refer only to the 
primary structure of the molecule. 

[01 18.0.0.0] Thus, the terms "gene(s)", "polynucleotide", "nucleic acid sequence", 
0 "nucleotide sequence", or "nucleic acid molecule(s)" as used herein include double- 
and single-stranded DNA and RNA. They also include known types of modifications, for 
example, methylation, "caps", substitutions of one or more of the naturally occurring 
nucleotides with an analog. Preferably, the DNA or RNA sequence of the invention 
comprises a coding sequence encoding the herein defined polypeptide. 

5 [0119.0.0.0] A "coding sequence" is a nucleotide sequence, which is transcribed into 
mRNA and/or translated into a polypeptide when placed under the control of 
appropriate regulatory sequences. The boundaries of the coding sequence are 
determined by a translation start codon at the 5-terminus and a translation stop codon 
at the 3'-terminus, A coding sequence can include, but is not limited to mRNA, cDNA, 

0 recombinant nucleotide sequences or genomic DNA, while introns may be present as 
well under certain circumstances. 

[0120.0.0.0] Nucleic acid molecules with the sequence as depicted in SEQ ID NO: 1, 
SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 1 1 , SEQ ID 
NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID 

5 NO: 23, SEQ ID NO: 25, SEQ ID NO: 27, SEQ ID NO: 29, SEQ ID NO: 31, SEQ ID 
NO: 33, SEQ ID NO: 35, SEQ ID NO: 37, SEQ ID NO: 39, SEQ ID NO: 41 SEQ ID NO: 
43 or SEQ ID NO: 45, nucleic acid molecules which are derived from the amino acid 
sequences as depicted in SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, 
SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, 

0 SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, 
SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, SEQ ID NO: 36, SEQ ID NO: 38, 
SEQ ID NO: 40, SEQ ID NO: 42, SEQ ID NO: 44 or SEQ ID NO: 46 or from 
polypeptides comprising the consensus sequence as depicted in SEQ ID NO: 47, SEQ 
ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID 

;5 NO: 397, SEQ ID NO: 398, SEQ ID NO: 399 and/or SEQ ID NO: 400, or their 
derivatives or homologues encoding polypeptides with the enzymatic or biological 
activity of a protein as depicted in SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ 
ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID 
NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID 

0 NO: 28, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, SEQ ID NO: 36, SEQ ID 
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NO: 38, SEQ ID NO: 40, SEQ ID NO: 42, SEQ ID NO: 44 or SEQ ID NO: 46 and/or 
conferring a fine chemical increase after increasing its expression or activity are 
advantageously increased in the process according to the invention. 

[0121.0.0.0] In one embodiment, said sequences are cloned into nucleic acid 
5 constructs, either individually or in combination. These nucleic acid constructs enable 
an optimal synthesis of the fine chemical produced in the process according to the 
invention. 

[0122.0.0.0] Nucleic acid molecules, which are advantageous for the process 
according to the invention and which encode polypeptides with the biological activity of 
10 the protein of the invention can be determined from generally accessible databases. 

[0123.0.0.0] Those, which must be mentioned, in particular jn this context are 
general gene databases such as the EMBL database (Stoesser G. et al., Nucleic Acids 
Res 2001, Vol. 29, 17-21), the GenBank database (Benson D.A. et al., Nucleic Acids 
Res 2000, Vol. 28,15-18), or the PIR database (Barker W. C. et al., Nucleic Acids Res. 

15 1 999, Vol. 27, 39-43). It is furthermore possible to use organism-specific gene 

databases for determining advantageous sequences, in the case of yeast for example 
advantageously the SGD database (Cherry J. M. et al., Nucleic Acids Res. 1998, Vol. 
26, 73-80) or the MIPS database (Mewes H.W. et al., Nucleic Acids Res. 1999, Vol. 27, 
44-48), in the case of E. coli the GenProtEC database 

20 (http://web.bham.ac.uk/bcm4ght6/res.html), and in the case of Arabidopsis the TAIR- 
database (Huala, E. et al., Nucleic Acids Res. 2001 Vol. 29(1), 102-5) or the MIPS 
database. 

[0124.0.0.0] The nucleic acid molecules used in the process according to the 
invention take the form of isolated nucleic acid sequences, which encode polypeptides 

25 having the biological activity represented by a protein as depicted in SEQ ID NO: 2, 4, 
6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28. 30, 32, 34, 36, 38, 40, 42, 44, 46, 56, 58, 60, 
62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 
106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 
140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 

30 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 
208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 
242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 
276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300, 302, 304, 306. 308, 
310, 312, 314. 316, 318. 320, 322, 324, 326, 328, 330. 332, 334, 336, 338, 340, 342, 

35 344. 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 
378, 380, 382, 384, 386, 388, 390, 392 or 394and conferring the fine chemical 
increase. 
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[0125.0.0.0] The nucleic acid sequence(s) used in the process for the production of 
the fine chemical in transgenic organisms originate advantageously from an eukaryote 
but may also originate from a prokaryote or an archebacterium, thus it can derived from 
e.g. a microorganism, an animal or a plant. 

5 [0126.0.0.0] For the purposes of the invention, as a rule the plural is intended to 
encompass the singular and vice versa. 

[0127.0.0.0] In order to improve the introduction of the nucleic acid sequences and 
the expression of the sequences in the transgenic organisms, which are used in the 
process, the nucleic acid sequences are incorporated into a nucleic acid construct 

1 0 and/br a vector. In addition to the herein described sequences which are used in the 
process according to the invention, further nucleic acid sequences, advantageously of 
biosynthesis genes of amino acids, carbohydrates, lipids, fatty acids, vitamins etc. 
produced in the process according to the invention, may additionally be present in the 
nucleic acid construct or in the vector and may be introduced into the organism 

15 together. However, these additional sequences may also be introduced into the 
organisms via other, separate nucleic acid constructs or vectors. 

[0128.0.0.0] Using the herein mentioned cloning vectors and transformation methods 

such as those which are published and cited in: Plant Molecular Biology 

and Biotechnology (CRC Press, Boca Raton, Florida), chapter 6/7, pp. 71-119 (1993); 

20 F.F. White, Vectors for Gene Transfer in Higher Plants; in: Transgenic Plants, vol. 1 , 
Engineering and Utilization, Ed.: Kung and R. Wu, Academic Press, 1993, 15-38; 
B. Jenes et al„ Techniques for Gene Transfer, in: Transgenic Plants, vol. 1, 
Engineering and Utilization, Ed.: Kung and R. Wu, Academic Press (1993), 128-143; 
Potrykus, Annu. Rev. Plant. Physiol. Plant Molec. Biol. 42 (1991), 205-225)) and further 

25 cited below, the nucleic acids may be used for the recombinant modification of a wide 
range of organisms, in particular prokaryotic or eukaryotic microorganisms or plants, so 
that they become a better and more efficient producer of the fine chemical produced in 
the process according to the invention. This improved production, or production 
efficiency, of the fine chemical or products derived there from, such as modified 

30 proteins, can be brought about by a direct effect of the manipulation or by an indirect 
effect of this manipulation. 

[0129.0.0.0] In one embodiment, the nucleic acid molecule according to the invention 
originates from a plant, such as a plant selected from the families Aceraceae, 
Anacardiaceae, Apiaceae, Asteraceae, Brassicaceae, Cactaceae, Cucurbitaceae, 
35 Euphorbiaceae, Fabaceae, Malvaceae, Nymphaeaceae, Papaveraceae, Rosaceae, 
Salicaceae, Solanaceae, Arecaceae, Bromeliaceae, Cyperaceae, Iridaceae, Liliaceae, 
Orchidaceae, Gentianaceae, Labiaceae, Magnoliaceae, Ranunculaceae, Carifolaceae, 
Rubiaceae, Scrophulariaceae, Caryophyllaceae, Ericaceae, Polygonaceae, Violaceae, 
Juncaceae or Poaceae and preferably from a plant selected from the group of the 
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families Apiaceae, Asteraceae, Brassicaceae, Cucurbitaceae, Fabaceae, 
Papaveraceae, Rosaceae, Solanaceae, Liiiaceae or Poaceae. Preferred are crop 
plants and in particular plants mentioned herein above as host plants such as the 
families and genera mentioned above for example preferred the species Anacardium 
5 occidentals, Calendula officinalis, Carthamus tinctorius, Cichorium intybus, Cynara 
scolymus, Helianthus annus, Tagetes lucida, Tagetes erecta, Tagetes tenuifolia; 
Daucus carota\ Corylus avellana, Corylus columa, Borago officinalis] Brassica napus, 
Brassica rapa ssp., Sinapis arvensis Brassica juncea, Brassica juncea var. juncea, 
Brassica juncea var. crispifolia, Brassica juncea var. foliosa, Brassica nigra, Brassica 

10 sinapioides, Melanosinapis communis, Brassica oleracea, Arabidopsis thaliana, Anana 
comosus, Ananas ananas, Brome/ia comosa, Carica papaya, Cannabis sative, 
Ipomoea batatus, Ipomoea pandurata, Convolvulus batatas, Convolvulus tiliaceus, 
Ipomoea fastigiata, Ipomoea tiliacea, Ipomoea triloba, Convolvulus panduratus, Beta 
vulgaris, Beta vulgaris var. altissima, Beta vulgaris var vulgaris, Beta maritima, Beta 

15 vulgaris var. perennis, Beta vulgaris var. conditiva, Beta vulgaris var. esculenta, 
Cucurbita maxima, Cucurbita mixta, Cucurbita pepo, Cucurbita moschata, Olea 
europaea, Manihot utilissima, Janipha manihot,, Jatropha manihot, Manihot aipil, 
Manihot dulcis, Manihot manihot, Manihot melanobasis, Manihot esculenta, Ricinus 
communis, Pisum sativum, Pisum arvense, Pisum humile, Medicago sativa, Medicago 

20 falcata, Medicago varia, Glycine max Dolichos soja, Glycine gracilis, Glycine hispida, 
Phaseolus max, Soja hispida, Soja max, Cocos nucifera, Pelargonium grossularioides, 
Oleum cocoas, Laurus nobilis, Persea americana, Arachis hypogaea, Linum 
usitatissimum, Linum humile, Linum austriacum, Linum bienne, Linum angustifolium, 
Linum catharticum, Linum flavum, Linum grandiflorum, Adenolinum grandiflorum, 

25 Linum lewisii, Linum narbonense, Linum perenne, Linum perenne var. lewisii, Linum 
pratense, Linum trigynum, Punica granatum, Gossypium hirsutum, Gossypium 
arboreum, Gossypium barbadense, Gossypium herbaceum, Gossypium thurberi, Musa 
nana, Musa acuminata, Musa paradisiaca, Musa spp., Elaeis guineensis, Papaver 
orientate, Papaver rhoeas, Papaver dubium, Sesamum indicum, Piper aduncum, Piper 

30 amalago, Piper angustifolium, Piper auritum, Piper betel, Piper cubeba, Piper longum, 
Piper nigrum, Piper retrofractum, Artanthe adunca, Artanthe elongata, Peperomia 
elongata, Piper elongatum, Steffensia elongata, , Hordeum vulgare, Hordeum jubatum, 
Hordeum murinum, Hordeum secalinum, Hordeum distichon Hordeum aegiceras, 
Hordeum hexastichon., Hordeum hexastichum, Hordeum irregulare, Hordeum sativum, 

35 Hordeum secalinum, Avena sativa, Avena fatua, Avena byzantina, Avena fatua var. 
sativa, Avena hybrida, Sorghum bicolor, Sorghum halepense, Sorghum saccharatum, 
Sorghum vulgare, Andropogon drummondii, Holcus bicolor, Holcus sorghum, Sorghum 
aethiopicum, Sorghum arundinaceum, Sorghum caffrorum, Sorghum cernuum, 
Sorghum dochna, Sorghum drummondii, Sorghum durra, Sorghum guineense, 

40 Sorghum lanceolatum, Sorghum nervosum, Sorghum saccharatum, Sorghum 
subglabrescens, Sorghum verticilliflorum, Sorghum vulgare, Holcus halepensis, 
Sorghum miliaceum millet, Panicum militaceum, Zea mays, Triticum aestivum, Triticum 
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durum, Triticum turgidum, Triticum hybernum, Triticum macha, Triticum sativum or 
Triticum vulgare, Cofea spp., Coffea arabica, Coffea canephora, Coffea liberies, 
Capsicum annuum, Capsicum annuum var. glabriusculum, Capsicum frutescens, 
Capsicum annuum, Nicotiana tabacum, Solanum tuberosum, Solanum melongena,. 
5 Lycopersicon esculentum, Lycopersicon lycopersicum., Lycopersicon pyriforme, 

Solanum integrifolium, Solanum iycopersicumTheobroma cacao or Camellia sinensis. 

[01 30.0.0.0] In one embodiment the nucleic acid molecule sequence originates 
advantageously from a microorganism as mentioned above under host organism such 
as a fungus for example the genera Aspergillus, Penicillium or Claviceps or from yeasts 
10 such as the genera Pichia, Torulopsis, Hansenula, Schizosaccharomyces, Candida, 
Rhodotorula or Saccharomyces, very especially advantageously from the yeast of the 
family Saccharomycetaceae, such as the advantageous genus Saccharomyces and 
the very advantageous genus and species Saccharomyces cerevisiae for the 
production of the fine chemical in microorganims. 

15 [0131 .0.0.0] The skilled worker knows other suitable sources for the production of 
fine chemicals, which present also useful nucleic acid molecule sources. They include 
in general all prokaryotic or eukaryotic cells, preferably unicellular microorganisms, 
such as fungi like the genus Claviceps or Aspergillus or gram-positive bacteria such as 
the genera Bacillus, Corynebacterium, Micrococcus, Brevibacterium, Rhodococcus, 

20 Nocardia, Caseobacter or Arthrobacter or gram-negative bacteria such as the genera 
Escherichia, Flavobacterium or Salmonella, or yeasts such as the genera Rhodotorula, 
Hansenula or Candida. 

[01 32.0.0.0] Production strains which are especially advantageously selected in the 
process according to the invention are microorganisms selected from the group of the 

25 families Actinomycetaceae, Bacillaceae, Brevibacteriaceae, Corynebacteriaceae, 

Enterobacteriacae. Gordoniaceae, Micrococcaceae, Mycobacteriaceae. Nocardiaceae, 
Pseudomonaceae, Rhizobiaceae, Streptomycetaceae, Chaetomiaceae, 
Choanephoraceae, Cryptococcaceae, Cunninghamellaceae, Demetiaceae, 
Moniliaceae, Mortierellaceae, Mucoraceae, Pythiaceae, Saccharomycetaceae, 

30 Saprolegniaceae, Schizosaccharomycetaceae, Sodariaceae, Sporobolomycetaceae, 
Tuberculariaceae, Adelotheciaceae, Dinophyceae, Ditrichaceae and Prasinophyceaeor 
of the genera and species consisting of Hansenula anomala, Candida utilis, Claviceps 
purpurea, Bacillus circulans, Bacillus subtilis, Bacillus sp., Brevibacterium albidum, 
Brevibacterium album, Brevibacterium cerinum, Brevibacterium flavum, Brevibacterium 

35 glutamigenes, Brevibacterium iodinum, Brevibacterium ketoglutamicum, 

Brevibacterium lactofermentum, Brevibacterium linens, Brevibacterium roseum, 
Brevibacterium saccharolyticum, Brevibacterium sp., Corynebacterium 
acetoacidophilum, Corynebacterium acetoglutamicum, Corynebacterium 
ammoniagenes, Corynebacterium glutamicum (= Micrococcus glutamicum), Coryne- 
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bacterium melassecola, Corynebacterium sp. or Escherichia coli, specifically 
Escherichia coli K12 and its described strains. 

[0133.0.0.0] However, it is also possible to use artificial sequences, which differ in 
one or more bases (= nucleotides) from the nucleic acid sequences found in 
5 organisms, or in one or more amino acid molecules from polypeptide sequences found 
in organisms, in particular from the polypeptide sequences as depicted in SEQ ID NO: 
2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 56, 
58. 60, 62, 64, 66, 68, 70, 72, 74, 76. 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 
102, 104, 106, 108, 110, 112, 114, 116, 118, 120. 122, 124, 126, 128, 130, 132, 134, 

10 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 
170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200. 202, 
204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230. 232, 234. 236, 
238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 
272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300, 302, 304, 

15 306, 308, 310, 312, 314, 316, 318, 320, 322, 324. 326. 328, 330, 332, 334, 336, 338, 
340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 
374, 376, 378, 380, 382, 384, 386, 388, 390, 392 or 394 or the functional homologues 
thereof as described herein, preferably conferring above-mentioned biological activity, 
i.e. conferring the fine chemical increase after increasing Its activity, e.g. having the 

20 biological activity represented by a protein as depicted in SEQ ID NO: 2, 4, 6, 8, 10, 12, 
14. 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 56, 58, 60, 62, 64, 66, 
68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96. 98, 100, 102, 104, 106, 108, 
110, 112, 114, 116, 118, 120. 122, 124, 126, 128, 130. 132, 134, 136, 138, 140, 142, 
144, 146, 148, 150, 152, 154, 156, 158. 160, 162, 164, 166. 168, 170, 172, 174, 176, 

25 178, 180, 182, 184. 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 
212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238. 240, 242, 244, 
246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 
280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300, 302, 304, 306, 308, 310, 312, 
314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340. 342, 344, 346, 

30 348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 
382, 384, 386, 388, 390, 392 or 394. 

[0134.0.0.0] In the process according to the invention nucleic acid sequences can be 
used, which, if appropriate, contain synthetic, non-natural or modified nucleotide bases, 
which can be incorporated into DNA or RNA. Said synthetic, non-natural or modified 
35 bases can for example increase the stability of the nucleic acid molecule outside or 
inside a cell. The nucleic acid molecules of the invention can contain the same 
modifications as aforementioned. 

[0135.0.0.0] As used in the present context the term "nucleic acid molecule" may 
also encompass the untranslated sequence located at the 3' and at the 5' end of the 
40 coding gene region, for example at least 500, preferably 200, especially preferably 1 00, 
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nucleotides of the sequence upstream of the 5' end of the coding region and at least 
100, preferably 50, especially preferably 20, nucleotides of the sequence downstream 
of the 3' end of the coding gene region. It is often advantageous only to choose the 
coding region for cloning and expression purposes. 

5 [Ql 36.0.0.0] Preferably, the nucleic acid molecule used in the process according to 
the invention or the nucleic acid molecule of the invention is an isolated nucleic acid 
molecule. 

[0137.0.0.0] An "isolated" polynucleotide or nucleic acid molecule is separated from 
other polynucleotides or nucleic acid molecules, which are present in the natural 

10 source of the nucleic acid molecule. An isolated nucleic acid molecule may be a 
chromosomal fragment of several kb, or preferably, a molecule only comprising the 
coding region of the gene. Accordingly, an isolated nucleic acid molecule of the 
invention may comprise chromosomal regions, which are adjacent 5' and 3' or further 
adjacent chromosomal regions, but preferably comprises no such sequences which 

1 5 naturally flank the nucleic acid molecule sequence in the genomic or chromosomal 
context in the organism from which the nucleic acid molecule originates (for example 
sequences which are adjacent to the regions encoding the 5% and 3-UTRs of the 
nucleic acid molecule). In various embodiments, the isolated nucleic acid molecule 
used in the process according to the invention may, for example comprise less than 

20 approximately 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0. 1 kb nucleotide sequences which 
naturally flank the nucleic acid molecule in the genomic DNA of the cell from which the 
nucleic acid molecule originates. 

[0138.0.0.0] The nucleic acid molecules used in the process, for example the 
polynucleotides of the invention or of a part thereof can be isolated using molecular- 

25 biological standard techniques and the sequence information provided herein. Also, for 
example a homologous sequence or homologous, conserved sequence regions at the 
DNA or amino acid level can be identified with the aid of comparison algorithms. The 
former can be used as hybridization probes under standard hybridization techniques 
(for example those described in Sambrook et al., Molecular Cloning: A Laboratory 

30 Manual. 2nd Ed., Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, NY. 1989) for isolating further nucleic acid sequences 
useful in this process. 

[01 39.0.0.0] A nucleic acid molecule encompassing a complete sequence of the 
nucleic acid molecules used in the process, for example the polynucleotide of the 
35 invention, or a part thereof may additionally be isolated by polymerase chain reaction, 
oligonucleotide primers based on this sequence or on parts thereof being used. For 
example, a nucleic acid molecule comprising the complete sequence or part thereof 
can be isolated by polymerase chain reaction using oligonucleotide primers which have 
been generated on the basis of this sequence for example, mRNA can be isolated from 
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cells (for example by means of the guanidinium thiocyanate extraction method of 
Chirgwin et al. (1979) Biochemistry 18:5294-5299) and cDNA can be generated by 
means of reverse transcriptase (for example Moloney MLV reverse transcriptase, 
available from Gibco/BRL, Bethesda, MD, or AMV reverse transcriptase, obtainable 
5 from Seikagaku America, Inc., St. Petersburg, FL). 

[0140.0.0.0] Synthetic oligonucleotide primers for the amplification, e.g. as shown in 
SEQ ID NO: 53, SEQ ID NO: 54, SEQ ID NO: 395 or SEQ ID NO: 396, by means of 
polymerase chain reaction can be generated on the basis of a sequence shown herein, 
for example the sequence as depicted in SEQ ID NO: 1, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 

10 21, 23, 25, 27, 29, 31, 33. 35, 37, 39, 41 43, 45, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 
75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 
115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 
149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 
183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 211, 213, 215, 

15 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243. 245, 247, 249, 
251, 253, 255. 257. 259, 261, 263. 265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 
285, 287, 289, 291, 293, 295, 297, 299, 301, 303, 305, 307, 309, 311, 313, 315, 317, 
319, 321, 323. 325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 345, 347, 349, 351, 
353, 355, 357, 359. 361, 363, 365, 367, 369, 371, 373, 375, 377, 379, 381, 383, 385, 

20 387,389, 391 or 393. 

[0141.0.0.0] Moreover, it is possible to identify conserved regions from various 
organisms by carrying out protein sequence alignments with the polypeptide used in 
the process of the invention, in particular with sequences of the polypeptide of the 
invention, from which conserved regions, and in turn, degenerate primers can be 
25 derived. Conserved regions are those, which show a very little variation in the amino 
acid in one particular position of several homologs from different origin. The consenus 
sequences as depicted in SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID 
NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 397, SEQ ID NO: 398, SEQ ID 
NO: 399 and/or SEQ ID NO: 400 are derived from said aligments. 

30 [0142.0.0.0] Degenerated primers can then be utilized by PCR for the amplification of 
fragments of novel proteins having above-mentioned activity, e.g. conferring the 
increase of the fine chemical after increasing the expression or activity or having the 
biological activity represented by a protein as depicted in SEQ ID NO: 2, 4, 6, 8, 10, 12, 
14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 56, 58, 60, 62, 64, 66, 

35 68, 70, 72, 74, 76, 78, 80. 82. 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 
110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138. 140, 142. 
144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 
178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 
212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244. 

40 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270. 272, 274, 276, 278, 
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280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300, 302, 304, 306, 308, 310, 312, 
314*, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 
348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 
382, 384, 386, 388, 390, 392 or 394or further functional homologs of the polypeptide of 
5 the invention from other organisms. 

[0143.0.0.0] These fragments can then be utilized as hybridization probe for isolating 
the complete gene sequence. As an alternative, the missing 5* and 3' sequences can 
be isolated by means of RACE-PCR (rapid amplification of cDNA ends). A nucleic acid 
molecule according to the invention can be amplified using cDNA or, as an alternative, 

10 genomic DNA as template and suitable oligonucleotide primers, following standard 

PCR amplification techniques. The nucleic acid molecule amplified thus can be cloned 
into a suitable vector and characterized by means of DNA sequence analysis. 
Oligonucleotides, which correspond to one of the nucleic acid molecules used in the 
process can be generated by standard synthesis methods, for example using an 

1 5 automatic DNA synthesizer. 

[0144.0.0.0] Nucleic acid molecules which are advantageously for the process 
according to the invention can be isolated based on their homology to the nucleic acid 
molecules disclosed herein using the sequences or part thereof as hybridization probe 
and following standard hybridization techniques under stringent hybridization 

20 conditions. In this context, it is possible to use, for example, isolated nucleic acid 

molecules of at least 15, 20, 25, 30, 35, 40, 50, 60 or more nucleotides, preferably of at 
least 15, 20 or 25 nucleotides in length which hybridize under stringent conditions with 
the above-described nucleic acid molecules, in particular with those which encompass 
a nucleotide sequence of the nucleic acid molecule used in the process of the invention 

25 or encoding a protein used in the invention or of the nucleic acid molecule of the 

invention. Nucleic acid molecules with 30, 50, 100, 250 or more nucleotides may also 
be used. 

[0145.0.0.0] The term "homology" means that the respective nucleic acid molecules 
or encoded proteins are functionally and/or structurally equivalent. The nucleic acid 

30 molecules that are homologous to the nucleic acid molecules described above and that 
are derivatives of said nucleic acid molecules are, for example, variations of said 
nucleic acid molecules which represent modifications having the same biological 
function, in particular encoding proteins with the same or substantially the same 
biological function. They may be naturally occurring variations, such as sequences 

35 from other plant varieties or species, or mutations. These mutations may occur 

naturally or may be obtained by mutagenesis techniques. The allelic variations may be 
naturally occurring allelic variants as well as synthetically produced or genetically 
engineered variants. Structurally equivalents can, for example, be identified by testing 
the binding of said polypeptide to antibodies or computer based predictions. 
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Structurally equivalent have the similar immunological characteristic, e.g. comprise 
similar epitopes. 

[0146.0.0.0] By "hybridizing" it is meant that such nucleic acid molecules hybridize 
under conventional hybridization conditions, preferably under stringent conditions such 
5 as described by, e.g., Sambrook (Molecular Cloning; A Laboratory Manual, 2nd Edition, 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY (1989)) or in Current 
Protocols in Molecular Biology, John Wiley & Sons, N. Y. (1989), 6.3.1-6.3.6. 

[0147.0.0.0] According to the invention, DNA as well as RNA molecules of the 
nucleic acid of the invention can be used as probes. Further, as template for the 

10 identification of functional homologues Northern blot assays as well as Southern blot 
assays can be performed. The Northern blot assay advantageously provides further 
informations about the expressed gene product: e.g. expression pattern, occurance of 
. processing steps, like splicing and capping, etc. The Southern blot assay provides 
additional information about the chromosomal localization and organization of the gene 

1 5 encoding the nucleic acid molecule of the invention. 

[0148.0.0.0] A preferred, nonlimiting example of stringent hydridization conditions are 
hybridizations in 6 x sodium chloride/sodium citrate (= SSC) at approximately 45°C, 
followed by one or more wash steps in 0.2 x SSC, 0.1 % SDS at 50 to 65°C, for 
example at 50°C, 55°C or 60°C. The skilled worker knows that these hybridization 

20 conditions differ as a function of the type of the nucleic acid and, for example when 
organic solvents are present, with regard to the temperature and concentration of the 
buffer. The temperature under "standard hybridization conditions" differs for example 
as a function of the type of the nucleic acid between 42°C and 58°C, preferably 
between 45°C and 50°C in an gqueous buffer with a concentration of 0.1 x 0.5 x, 1 x, 

25 2x, 3x, 4x or 5 x SSC (pH 7.2). If organic solvent(s) is/are present in the 

abovementioned buffer, for example 50% formamide, the temperature under standard 
conditions is approximately 40°C, 42°C or 45°C. The hybridization conditions for 
DNA:DNA hybrids are preferably for example 0.1 x SSC and 20°C, 25°C, 30°C, 35°C, 
40°C or45°C, preferably between 30°C and 45°C. The hybridization conditions for 

30 DNA: RNA hybrids are preferably for example 0.1 x SSC and 30°C, 35°C, 40°C, 45°C, 
50°C or 55°C, preferably between 45°C and 55°C. The abovementioned hybjidization 
temperatures are determined for example for a nucleic acid approximately 100 bp (= 
base pairs) in length and a G + C content of 50% in the absence of formamide. The 
skilled worker knows to determine the hybridization conditions required with the aid of 

35 textbooks, for example the ones mentioned above, or from the following textbooks: 
Sambrook et al., "Molecular Cloning", Cold Spring Harbor Laboratory, 1989; Hames 
and Higgins (Ed.) 1985, "Nucleic Acids Hybridization: A Practical Approach", IRL Press 
at Oxford University Press, Oxford; Brown (Ed.) 1991, "Essential Molecular Biology: A 
Practical Approach", IRL Press at Oxford University Press, Oxford. 
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[0149.0.0.0] A further example of one such stringent hybridization condition is 
hybridization at 4XSSC at 65°C, followed by a washing in 0.1XSSC at 65°C for one 
hour. Alternatively, an exemplary stringent hybridization condition is in 50 % 
formamide, 4XSSC at42°C. Further, the conditions during the wash step can be 
5 selected from the range of conditions delimited by low-stringency conditions 

/. j . /-iv/ r»«N^ _* cnom Kinti t>Wir>nanr-\i onnHitinnft ( s nnrnYimatelv 0.2X 

(approximately oou m ju \sj emu 1 1131™ . » 

SSC at 50°C, preferably at 65°C) (20X SSC: 0.3M sodium citrate, 3M NaCI, pH 7.0). In 

addition, the temperature during the wash step can be raised from low-stringency 

conditions at room temperature, approximately 22°C, to higher-stringency conditions at 

10 approximately 65°C. Both of the parameters salt concentration and temperature can be 
varied simultaneously, or else one of the two parameters can be kept constant while 
only the other is varied. Denaturants, for example formamide or SDS, may also be 
employed during the hybridization. In the presence of 50% formamide, hybridization is 
preferably effected at 42°C. Relevant factors like i) length of treatment, ii) salt 

15 conditions, Hi) detergent conditions, iv) competitor DNAs, v) temperature and vi) probe 
selection can be combined case by case so that not all possibilities can be mentioned 
herein. 

Thus, in a preferred embodiment, Northern blots are prehybridized with Rothi-Hybri- 
Quick buffer (Roth, Karlsruhe) at 68°C for 2h. Hybridzation with radioactive labelled 
20 probe is done overnight at 68°C. Subsequent washing steps are performed at 68°C 
with 1xSSC. 

For Southern blot assays the membrane is prehybridized with Rothi-Hybri-Quick buffer 
(Roth, Karlsruhe) at 68°C for 2h. The hybridzation with radioactive labelled probe is 
conducted over night at 68°C. Subsequently the hybridization buffer is discarded and 
25 the filter shortly washed using 2xSSC; 0,1% SDS. After discarding the washing buffer 
new 2xSSC; 0,1 % SDS buffer is added and incubated at 68°C for 1 5 minutes. This 
washing step is performed twice followed by an additional washing step using 1xSSC; 
0,1% SDS at 68°C for 10 min. 

[01 50.0.0.0] Some further examples of conditions for DNA hybridization (Southern 
30 blot assays) and wash step are shown hereinbelow: 

(1 ) Hybridization conditions can be selected, for example, from the following 
conditions: 

a) 4X SSC at 65°C. 

b) 6XSSCat45°C, 

35 c) 6X SSC, 100 mg/ml denatured fragmented fish sperm DNA at 68°C, 

d) 6X SSC, 0.5% SDS, 1 00 mg/ml denatured salmon sperm DNA at 68°C, 

e) 6X SSC, 0.5% SDS, 1 00 mg/ml denatured fragmented salmon sperm DNA, 50% 
formamide at 42°C, 

f) 50% formamide, 4X SSC at 42°C, 
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g) 50% (vol/vol) formamide, 0. 1 % bovine serum albumin, 0. 1 % Ficoll, 0. 1 % 
polyvinylpyrrolidone, 50 mM sodium phosphate buffer pH 6.5, 750 mM NaCI, 
75 mM sodium citrate at 42°C, 

h) 2X or 4X SSC at 50°C (low-stringency condition), or 

5 i) 30 to 40% formamide, 2X or 4X SSC at 42°C (low-stringency condition). 

(2) Wash steps can be selected, for example, from the following conditions: 

a) 0.015 M NaCI/0.0015 M sodium citrate/0.1% SDS at 50°C. 

b) 0.1XSSCat65°C. 

10 c) 0.1X SSC, 0.5 % SDS at 68°C. 

d) 0.1 X SSC, 0.5% SDS, 50% formamide at 42°C. 

e) 0.2X SSC, 0. 1 % SDS at 42°C. 

f) 2X SSC at 65°C (low-stringency condition). 

[0151.0.0.0] Polypeptides having above-mentioned biological activity, i.e. conferring 
15 the fine chemical increase, derived from other organisms, can be encoded by other 
DNA sequences which hybridize to the sequences shown in SEQ ID NO: 1, 3, 5, 7, 9, 
11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41 43, 45, 55, 57, 59, 61, 63, 
65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 
109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 
20 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 
177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 
211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 
245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 
279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299, 301, 303, 305, 307, 309, 311, 
25 313, 315, 317, 319, 321, 323, 325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 345, 
347, 349, 351, 353, 355, 357, 359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 379, 
381, 383, 385, 387, 389, 391 or 393under relaxed hybridization conditions and which 
code on expression for peptides having the fine chemical increasing activity. 

[0152.0.0.0] Further, some applications have to be performed at low stringency 
30 hybridisation conditions, without any consequences for the specificity of the 

hybridisation. For example, a Southern blot analysis of total DNA could be probed with 
a nucleic acid molecule of the present invention and washed at low stringency (55°C in 
2xSSPE0, 1% SDS). The hybridisation analysis could reveal a simple pattern of only 
genes encoding polypeptides of the present invention or used in the process of the 
35 invention, e.g. having herein-mentioned activity of increasing the fine chemical. A 
further example of such low-stringent hybridization conditions is 4XSSC at 50°C or 
hybridization with 30 to 40% formamide at 42°C. Such molecules comprise those which 
are fragments, analogues or derivatives of the polypeptide of the invention or used in 
the process of the invention and differ, for example, by way of amino acid and/or 
40 nucleotide deletion(s), insertion(s), substitution (s), addition(s) and/or recombination (s) 
or any other modification(s) known in the art either alone or in combination from the 
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above-described amino acid sequences or their underlying nucleotide sequence(s). 
However, it is preferred to use high stringency hybridisation conditions. 

[01 53.0.0.0] Hybridization should advantageously be carried out with fragments of at 
least 5, 10, 15, 20, 25, 30, 35 or 40 bp, advantageously at least 50, 60, 70 or 80 bp, 
5 preferably at least 90, 1 00 or 1 1 0 bp. Most preferably are fragments of at least 1 5, 20, 
25 or 30 bp. Preferably are also hybridizations with at least 100 bp or 200, very 
especially preferably at least 400 bp in length. In an especially preferred embodiment, 
the hybridization should be carried out with the entire nucleic acid sequence with 
conditions described above. 

10 [01 54.0.0.0] The terms "fragment", "fragment of a sequence" or "part of a sequence" 
mean a truncated sequence of the original sequence referred to. The truncated 
sequence (nucleic acid or protein sequence) can vary widely in length; the minimum 
size being a sequence of sufficient size to provide a sequence with at least a 
comparable function and/or biological activity of the original sequence referred to or 

1 5 hybidizing with the nucleic acid molecule of the invention or used in the process of the 
invention under stringend conditions, while the maximum size is not critical. In some 
applications, the maximum size usually is not substantially greater than that required to 
provide the desired activity and/or function(s) of the original sequence. 

[01 55.0.0.0] Typically, the truncated amino acid sequence will range from about 5 to 
20 about 260 amino acids in length. More typically, however, the sequence will be a 

maximum of about 220 amino acids in length, preferably a maximum of about 215 or 
100 amino acids. It is usually desirable to select sequences of at least about 100, 120 
or 150 amino acids, up to a maximum of about 200 or 250 amino acids. 

[01 56.0.0.0] The term "epitope" relates to specific immunoreactive sites within an 
25 antigen, also known as antigenic determinates. These epitopes can be a linear array of 
monomers in a polymeric composition - such as amino acids in a protein - or consist 
of or comprise a more complex secondary or tertiary structure. Those of skill will 
recognize that immunogens (i.e., substances capable of eliciting an immune response) 
are antigens; however, some antigen, such as haptens, are not immunogens but may 
30 be made immunogenic by coupling to a carrier molecule. The term "antigen" includes 
references to a substance to which an antibody can be generated and/or to which the 
antibody is specifically immunoreactive. 

[01 57.0.0.0] In one embodiment the present invention relates to a epitope of the 
polypeptide of the present invention or used in the process of the present invention and 
35 conferring above mentioned biological activity, preferably conferring an increase in the 
fine chemical. 

[01 58.0.0.0] The term "one or several amino acids" relates to at least one amino acid 
but not more than that number of amino acids, which would result in a homology of 
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below 50% identity. Preferably, the identity is more than 70% or 80%, more preferred 
are 85%, 90%, 91%, 92%, 93%, 94% or 95%, even more preferred are 96%, 97%, 
98%, or 99% identity. 

[0159.0.0.0] Further, the nucleic acid molecule of the invention comprises a nucleic 
5 acid molecule, which is a complement of one of the nucleotide sequences of above 
mentioned nucleic acid molecules or a portion thereof. A nucleic acid molecule which is 
complementary to one of the nucleotide sequences as depicted in SEQ ID NO: 1, 3, 5, 
7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41 43, 45, 55, 57, 59, 61, 
63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 

10 107, 109, 111, 113,. 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137.139, 
141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 
175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 
209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 
243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 

15 277, 279, 281 , 283, 285, 287, 289, 291 , 293, 295, 297, 299, 301, 303, 305, 307, 309, 
311, 313, 315, 317, 319, 321, 323, 325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 
345, 347, 349, 351, 353, 355, 357, 359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 
379, 381, 383, 385, 367, 389, 391or 393 is one which is sufficiently complementary to 
one of the nucleotide sequences as depicted in SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 

20 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41 43, 45, 55, 57, 59, 61, 63, 65, 67, 69, 
71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 
113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 
147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 
181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 211, 213, 

25 215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 
249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 
283, 285, 287, 289, 291, 293, 295, 297, 299, 301, 303, 305, 307, 309, 311, 313, 315, 
317, 319, 321, 323, 325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 345, 347, 349, 
351, 353, 355, 357, 359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 379, 381, 383, 

30 385, 387, 389, 39 lor 393 such that it can hybridize to one of the nucleotide sequences 
as depicted in SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 
35, 37, 39, 41 43, 45, 55, 57, 59, 61 , 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 
89, 91, 93, 95, 97,99. 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 
127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 

35 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 
195, 197, 199, 201, 203, 205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 
229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 
263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 
297, 299, 301, 303, 305, 307, 309, 311, 313, 315, 317, 319, 321, 323, 325, 327, 329, 

40 331, 333, 335, 337, 339, 341, 343, 345, 347, 349, 351. 353, 355, 357, 359, 361, 363, 
365, 367, 369, 371, 373, 375, 377, 379, 381, 383, 385, 387, 389, 391or 393, thereby 
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forming a stable duplex. Preferably, the hybridisation is performed under stringent 
hybrization conditions. However, a complement of one of the herein disclosed 
sequences is preferably a sequence complement thereto according to the base pairing 
of nucleic acid molecules well known to the skilled person. For example, the bases A 
5 and G undergo base pairing with the bases T and U or C, resp. and visa versa. 

k A~**i{i~*4-\**r+r+ Aftu A hoenc r*on infii lonro the ha<5A-nairina ngrtner. 

[0160.0.0.0] The nucleic acid molecule of the invention comprises a nucleotide 
sequence which is at least about 30%, 35%, 40% or 45%, preferably at least about 
50%, 55%, 60% or 65%, more preferably at least about 70%, 80%, or 90%, and even 

10 more preferably at least about 95%, 97%, 98%, 99% most preferably at least about - 
99,1%; 99,2%, 99,3%; 99,4%; 99,5%; 99,6%; 99,7%; 99,8% or 99,9% or more homo- 
logous to a nucleotide sequence as depicted in SEQ ID NO: 1, 3, 5, 7, 9, 1 1, 13, 15, 
17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41 43, 45, 55, 57, 59, 61, 63, 65, 67, 69, 
71, 73, 75, 77, 79, 81. 83, 85, 87, 89, 91, 93, 95, 97, 99, 101. 103, 105, 107, 109. 111. 

15 113. 115. 117. 119. 121. 123. 125. 127, 129,131, 133, 135, 137, 139, 141, 143, 145, 
147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169. 171. 173, 175, 177, 179, 
181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 211, 213, 
215. 217, 219, 22\ 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 
249, 251, 253, 255. 257, 259. 261. 263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 

20 283, 285, 287, 289, 291 , 293, 295, 297, 299, 301 , 303, 305, 307, 309, 311,313,315, 
317, 319, 321, 323, 325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 345, 347, 349, 
351, 353, 355, 357, 359, 361, 363, 365, 367, 369, 371. 373. 375, 377, 379, 381, 383, 
385, 387, 389, 391or 393, or a portion thereof and preferably has above mentioned 
biological activity, in particular having a fine chemical increasing activity after 

25 increasing the biological acitivity of a protein having the biological activity represented 
by a protein as depicted in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28. 
30. 32. 34. 36, 38, 40, 42, 44, 46, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76. 78. 80, 82, 
84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 
122, 124, 126, 128, 130. 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 

30 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 
190, 192, 194, 196, 198, 200, 202, 204, 206. 208, 210, 212, 214, 216, 218, 220, 222, 
224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256. 
258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 
292, 294, 296! 298, 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 
35 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 
360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386, 388, 390, 392 or 
394 and their respective gene products. 

[01 61 .0.0.0] The nucleic acid molecule of the invention comprises a nucleotide 
sequence which hybridizes, preferably hybridizes under stringent conditions as defined 
40 herein, to one of the nucleotide sequences as depicted in SEQ ID NO: 1, 3, 5, 7, 9, 1 1 , 
13, 15. 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41 43, 45, 55, 57. 59. 61, 63, 65, 
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67, 69, 71, 73, 75, 77, 79, 81. 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 
109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 
143, 145, 147, 149, 151, 153, 155. 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 
177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205. 207, 209, 
5 21 1, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 
245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 
279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299, 301, 303, 305, 307. 309, 311, 
313, 315, 317, 319, 321, 323, 325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 345, 
347, 349, 351, 353, 355, 357, 359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 379, 
10 381 , 383, 385, 387, 389, 391 or 393, or a portion thereof and encodes a protein having 
' above-mentioned biological activity, e.g. conferring the fine chemical increase, and 
optionally, having the biological activity of YNL090W. 

[0162.0.0.0] Moreover, the nucleic acid molecule of the invention can comprise 
only a portion of the coding region of one of the sequences in SEQ ID NO: 1, 3, 5, 7, 9, 

15 11,13, 15, 17, 19, 21. 23, 25. 27, 29, 31, 33, 35, 37, 39, 41 43, 45, 55, 57, 59, 61, 63, 
65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 
109, 111, 113, 115, 117, 119, 121, 123. 125. 127. 129. 131, 133, 135, 137, 139, 141, 
143, 145, 147. 149. 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 171. 173, 175, 
177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 

20 211, 213, 215, 217, 219, 221, 223, 225, 227, 229; 231, 233, 235, 237, 239, 241, 243, 
245, 247, 249, 251, 253, 255, 257. 259, 261, 263, 265, 267, 269, 271, 273, 275, 277. 
279, 281, 283, 285. 287, 289, 291. 293, 295, 297, 299, 301, 303, 305, 307, 309, 31 1, 
313, 315, 317, 319, 321, 323, 325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 345, 
347, 349, 351, 353, 355, 357, 359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 379, 

25 381, 383, 385, 387, 389, 391 or 393, for example a fragment which can be used as a 
probe or primer or a fragment encoding a biologically active portion of the polypeptide 
of the present invention or of a polypeptide used in the process of the present 
invention, i.e. having above-mentioned activity, e.g.conferring an increase of 
methionine if its activity is increased. The nucleotide sequences determined from the 

30 cloning of the present protein-according-to-the-invention-encoding gene allows for the 
generation of probes and primers designed for use in identifying and/or cloning its 
homologues in other cell types and organisms. The probe/primer typically comprises 
substantially purified oligonucleotide. The oligonucleotide typically comprises a region 
of nucleotide sequence that hybridizes under stringent conditions to at least about 12, 

35 15 preferably about 20 or 25, more preferably about 40, 50 or 75 consecutive 

nucleotides of a sense strand of one of the sequences set forth, e.g., in SEQ ID NO: 1, 
3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35^ 37, 39, 41 43, 45, 55, 57, 
59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79. 81, 83, 85, 87. 89, 91, 93, 95, 97, 99, 101, 
103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129. 131, 133, 135, 

40 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169. 
171. 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203. 
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205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229. 231, 233. 235, 237, 
239, 241. 243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 267. 269, 271, 
273, 275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299, 301, 303, 305, 
307, 309, 311, 313, 315, 317, 319, 321, 323, 325, 327, 329, 331, 333, 335, 337, 339, 
5 341 , 343, 345, 347, 349, 351 , 353, 355, 357, 359, 361 , 363, 365, 367, 369, 371 . 373. 
375, 377, 379, 381, 383, 365, 367, 369, 391or 393, an anti-sense sequence cf one of 
the sequences, e.g., set forth in SEQ ID NO: 1. 3. 5, 7, 9, 11, 13. 15. 17, 19, 21, 23, 25, 
27, 29, 31, 33, 35, 37. 39, 41 43, 45. 55. 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 
81, 83, 85. 87, 89, 91. 93, 95, 97, 99, 101. 103. 105. 107, 109, 111, 113. 115, 117, 119, 
10 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 
155, 157, 159, 161, 163. 165. 167, 169, 171, 173, 175, 177, 179. 181, 183, 185, 187. 
189, 191, 193; 195, 197, 199, 201. 203, 205, 207, 209, 211, 213, 215, 217,219, 221. 
223, 225, 227, 229. 231, 233, 235, 237, 239, 241. 243, 245, 247, 249, 251, 253, 255. 
257, 259. 261. 263. 265, 267, 269, 271, 273, 275, 277, 279, 281. 283, 285. 287. 289, 
15 291, 293, 295, 297, 299, 301, 303, 305, 307, 309, 311, 313, 315, 317, 319, 321, 323, 
325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 345, 347. 349, 351, 353, 355, 357, 
359, 361. 363, 365. 367, 369, 371, 373, 375, 377, 379, 381, 383, 385, 387, 389, 391or 
393, or naturally occurring mutants thereof. Primers based on a nucleotide of invention 
can be used in PCR reactions to clone homologues of the polypeptide of the invention 
20 or of the polypeptide used in the process of the invention, e.g. as the primers described 
in the examples of the present invention, e.g. as shown in the examples. A PCR with 
the primers shown in SEQ ID NO: 53, SEQ ID NO: 54, SEQ ID NO: 395 or SEQ ID NO: 
396 will at least result in a fragment of gene products as depicted in the sequences 
SEQ ID NO: 1, 3, 5, 7, 9. 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35. 37, 39, 41 
25 43, 45. 55. 57, 59, 61 , 63, 65, 67, 69, 71 , 73, 75, 77, 79, 81 , 83. 85, 87, 89, 91 , 93, 95. 
97, 99. 101. 103. 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127. 129. 131. 
133, 135, 137, 139. 141. 143, 145, 147. 149. 151. 153, 155, 157, 159, 161, 163, 165, 
167, 169, 171, 173, 175, 177, 179. 181. 183. 185. 187, 189, 191, 193, 195. 197, 199. 
201. 203, 205, 207, 209, 211, 213, 215, 217, 219, 221. 223. 225. 227, 229. 231, 233, 
30 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 257. 259, 261, 263, 265, 267, 
269, 271. 273. 275. 277. 279, 281. 283, 285, 287, 289, 291, 293, 295, 297. 299, 301, 
303, 305. 307, 309, 311, 313, 315, 317, 319, 321, 323, 325, 327, 329, 331, 333, 335, 
337, 339, 341, 343, 345, 347, 349, 351, 353, 355, 357, 359. 361. 363, 365, 367, 369. 
371 , 373, 375, 377. 379. 381 , 383, 385, 387, 389, 391or 393. 

35 

[01 63.0.0.0] Primer sets are interchangable. The person skilled in the art knows to 
combine said primers to result in the desired product, e.g. in a full-length clone or a 
partial sequence. Probes based on the sequences of the nucleic acid molecule of the 
invention or used in the process of the present invention can be used to detect 
40 transcripts or genomic sequences encoding the same or homologous proteins. The 
probe can further comprise a label group attached thereto, e.g. the label group can be 
a radioisotope, a fluorescent compound, an enzyme, or an enzyme co-factor. Such 
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probes can be used as a part of a genomic marker test kit for identifying cells which 
express an polypepetide of the invention or used in the process of the present 
invention, such as by measuring a level of an encoding nucleic acid molecule in a 
sample of cells, e.g., detecting mRNA levels or determining, whether a genomic gene 
5 comprising the sequence of the polynucleotide of the invention or used in the processs 
of the present invention has been mutated or deleted. 

[0164.0.0.0] The nucleic acid molecule of the invention encodes a polypeptide or 
portion thereof which includes an amino acid sequence which is sufficiently 
homologous to the amino acid sequence as depicted in SEQ ID NO: 2, 4, 6, 8, 10, 12, 

10 1.4, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 56, 58, 60, 62, 64, 66, 
68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 
110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 
144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 
178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 

15 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 
246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 
280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300, 302, 304, 306, 308, 310, 312, 
314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 
348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 

20 382, 384, 386, 388, 390, 392 or 394 such that the protein or portion thereof maintains 
the ability to participate in the fine chemical production, in particular an amino acid 
increasing activity as mentioned above or as described in the examples in plants or 
microorganisms is comprised. 

[0165.0.0.0] As used herein, the language "sufficiently homologous" refers to 

25 proteins or portions thereof which have amino acid sequences which include a 

minimum number of identical or equivalent amino acid residues (e.g., an amino acid 
residue which has a similar side chain as an amino acid residue in one of the 
sequences of the polypeptide of the present invention) to an amino acid sequence as 
depicted in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 

30 38, 40, 42, 44, 46, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 
92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 
128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 
162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 
196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 

35 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 
264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 
298, 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 
332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 
366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386, 388, 390, 392 or 394 such that 

40 the protein or portion thereof is able to participate in the increase of the fine chemical 
production. For examples having the biological activity represented by a protein as 
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depicted in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 
38, 40, 42, 44, 46, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84. 86, 88, 90, 
92*. 94*. 96, 98, 100, 102. 104, 106, 108, 110. 112, 114, 116. 118. 120. 122. 124. 126. 
128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150. 152. 154. 156, 158. 160. 

5 162*. 164, 166, 168, 170, 172, 174, 176. 178, 180, 182, 184, 186, 188, 190. 192. 194. 
196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 
230, 232, 234, 236, 238. 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 
264', 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292. 294. 296. 
298', 300. 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324. 326, 328, 330, 

10 332*, 334, 336, 338, 340, 342. 344, 346, 348, 350, 352, 354. 356. 358, 360, 362. 364. 
366*. 368. 370. 372, 374, 376, 378, 380, 382, 384, 386, 388, 390, 392 or 394 are 
described herein. 

[01 66.0.0.0] In one embodiment, the nucleic acid molecule of the present invention 
comprises a nucleic acid that encodes a portion of the protein of the present invention. 

1 5 The protein is at least about 30%. 35%, 40%, 45% or 50%, preferably at least about 
55%, 60%, 65% or 70%, and more preferably at least about 75%, 80%, 85%, 90%. 
91 %', 92%, 93% or 94% and most preferably at least about 95%, 97%, 98%, 99% or 
more homologous to an entire amino acid sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 
14, 16, 18. 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 56, 58, 60, 62, 64, 66, 

20 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104. 106, 108, 
110. 112, 114, 116, 118, 120. 122. 124. 126. 128. 130. 132, 134, 136, 138, 140. 142. 
-144, 146, 148, 150, 152. 154. 156. 158. 160. 162. 164, 166, 168, 170, 172, 174, 176, 
178*. 180. 182, 184, 186, 188, 190, 192. 194. 196. 198. 200, 202. 204, 206, 208, 210, 
212*. 214. 216. 218. 220. 222. 224. 226. 228. 230. 232. 234. 236, 238, 240, 242, 244, 

25 246*. 248. 250. 252, 254, 256, 258. 260. 262, 264, 266, 268, 270, 272, 274, 276, 278, 
28o', 282, 284, 286, 288, 290, 292, 294, 296, 298, 300, 302, 304, 306, 308, 310, 312, 
314*. 316. 318. 320. 322. 324. 326, 328. 330, 332, 334, 336, 338, 340, 342, 344, 346, 
348, 350, 352, 354, 356, 358, 360. 362. 364, 366, 368, 370, 372, 374, 376, 378, 380, 
382, 384, 386, 388, 390, 392 or 394 and having above-mentioned activity, 

30 e.g.conferring preferably the increase of the fine chemical. 

[01 67.0.0.0] Portions of proteins encoded by the nucleic acid molecule of the 
invention are preferably biologically active, preferably having above-mentioned 
annotated activity, e.g. conferring an increase of the fine chemical after increase of 
activity. 

35 [01 68.0.0.0] As mentioned herein, the term "biologically active portion" is intended to 
include a portion, e.g., a domain/motif, that confers increase of the fine chemical or has 
an immunological activity such that it is binds to an antibody binding specifially to the 
polypeptide of the present invention or a polypeptide used in the process of the present 
invention for producing the fine chemical; 
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[01 69.0.0.0] The invention further relates to nucleic acid molecules that differ from 
one of the nucleotide sequences as depicted in SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 
17. 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41 43, 45, 55, 57, 59, 61, 63. 65, 67, 69, 
71. 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111. 
5 113, 115, 117, 119, 121. 123. 125. 127. 129, 131, 133, 135, 137, 139, 141, 143, 145, 
147, 149. 151, 153, 155, 157, 159, 161, 163. 1.65, 167, 169, 171, 173, 175, 177. 179, 
181, 183. 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 211, 213, 
215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239. 241, 243, 245, 247, 
249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 277. 279, 281, 

10 283, 285, 287, 289, 291, 293, 295, 297, 299, 301, 303, 305, 307, 309, 31 1, 313, 315, 
317, 319, 321, 323 r 325, 327, 329, 331, 333, 335, 337, 339, 34.1, 343,345, 347, .349,. 
351, 353, 355, 357, 359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 379, 381, 383, 
385, 387, 389, 391 or 393 (and portions thereof) due to degeneracy of the genetic code 
and thus encode a polypeptide of the present invention, in particular a polypeptide 

1 5 having above mentioned activity, e.g. conferring an increase in the fine chemical in a 
organism. Advantageously, the nucleic acid molecule of the invention comprises, or in 
an other embodiment has, a nucleotide sequence encoding a protein comprising, or in 
an other embodiment having, an amino acid sequence as depicted in SEQ ID NO: 2, 4, 
6. 8, 10, 12, 14, 16, 18, 20. 22, 24. 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 56, $8, 60, 

20 62, 64, 66, 68, 70, 72, 74, 76, 78. 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 
106, 108, 110, 112. 114, 116, 118, 120. 122. 124. 126, 128. 130, 132, 134, 136. 138, 
140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168. 170. 172, 
174, 176, 178, 180, 182, 184, 186, 188. 190, 192, 194, 196, 198, 200, 202, 204, 206. 
208, 210, 212, 214. 216, 218. 220, 222, 224. 226, 228, 230, 232, 234. 236. 238, 240. 

25 242, 244, 246, 248, 250, 252, 254. 256, 258, 260, 262, 264, 266. 268, 270, 272, 274, 
276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300, 302, 304, 306, 308. 
310, 312, 314, 316, 318, 320, 322. 324, 326, 328, 330, 332, 334. 336, 338, 340, 342, 
344, 346. 348, 350, 352. 354. 356, 358, 360, 362, 364, 366, 368, 370. 372, 374, 376, 
378, 380, 382, 384, 386, 388, 390, 392 or 394 or the functional homologues. In a still 

30 further embodiment, the nucleic acid molecule of the invention encodes a full length 
protein which is substantially homologous to an amino acid sequence as depicted in 
SEQ ID NO: 2, 4, 6, 8, 10, 12. 14. 16. 18, 20, 22. 24, 26, 28, 30. 32. 34, 36, 38, 40, 42, 
44, 46, 56, 58, 60, 62, 64, 66. 68. 70. 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 
98, 100, 102, 104. 106. 108, 110, 112, 114, 116, 118. 120, 122, 124, 126, 128, 130, 

35 132, 134, 136, 138. 140, 142, 144. 146, 148, 150, 152. 154. 156. 158, 160. 162. 164. 
166, 168, 170, 172, 174, 176, 178, 180, 182, 184. 186, 188, 190, 192, 194, 196, 198, 
200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224. 226, 228, 230, 232, 
234, 236, 238, 240, 242. 244, 246, 248, 250, 252, 254, 256, 258, 260, 262. 264. 266. 
268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300. 

40 302, 304, 306, 308, 310, 312, 314, 316. 318, 320. 322, 324. 326, 328, 330, 332, 334. 
336. 338, 340, 342, 344, 346, 348, 350, 352, 354, 356. 358, 360, 362, 364, 366, 368, 
370, 372, 374, 376, 378, 380, 382, 384, 386, 388, 390, 392 or 394 or the functional 
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homologues. However, in a preferred embodiment, the nucleic acid molecule of the 
present invention does not consist of the sequence as depicted in SEQ ID NO: 1 , 3, 5, 
7 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41 43, 45, 55, 57, 59, 61, 
63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93. 95, 97, 99, 101, 103, 105, 
5 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 
a a a-? aao AtiA <w ii« «7 1KQ 1R1. 163. 165. 167. 169, 171, 173, 

I'M, IH-O, I tvJ, l**< , IT5| •«■»>> iwv>, i ■ — ■ , . , ■• 

175, 177. 179, 181, 183, 185, 187, 189, 191, 193. 195, 197, 199, 201, 203, 205, 207, 
209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 
243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 
10 277, 279. 281 , 283, 285, 287, 289, 291 , 293, 295, 297, 299, 301 , 303, 305. 307. 309. 

311. 313. 315. 317, 319. 321, 323, 325. 327. 329, 331, 333, 335, 337, 339, 341, 343, 

345, 347, 349, 351, 353, 355, 357, 359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 
379, 381. 383, 385, 387, 389, 391 or 393. 

[01 70.0.0.0] In addition, it will be appreciated by those skilled in the art that DNA 
1 5 sequence polymorphisms that lead to changes in the amino acid sequences may exist 
within a population. Such genetic polymorphism in the gene encoding the polypeptide 
of the invention or comprising the nucleic acid molecule of the invention may exist 
among individuals within a population due to natural variation. 

[0171 .0.0.0] As used herein, the terms "gene" and "recombinant gene" refer to 
20 nucleic acid molecules comprising an open reading frame encoding the polypeptide of 
the invention or comprising the nucleic acid molecule of the invention or encoding the 
polypeptide used in the process of the present invention, preferably from a crop plant 
or from a microorgansim useful for the production of fine chemicals, in particular for the 
production of the fine chemical. Such natural variations can typically result in 1-5% 
25 . variance in the nucleotide sequence of the gene. Any and all such nucleotide variations 
and resulting amino acid polymorphisms in genes encoding a polypeptide of the 
invention or comprising a the nucleic acid molecule of the invention that are the result 
of natural variation and that do not alter the functional activity as described are 
intended to be within the scope of the invention. 

30 [0172.0.0.0] Nucleic acid molecules corresponding to natural variants homologues of 
a nucleic acid molecule of the invention, which can also be a cDNA can be isolated 
based on their homology to the nucleic acid molecules disclosed herein using the 
nucleic acid molecule of the invention, or a portion thereof, as a hybridization probe 
according to standard hybridization techniques under stringent hybridization conditions. 

35 [01 73.0.0.0] Accordingly, in another embodiment, a nucleic acid molecule of the 
invention is at least 15, 20, 25 or 30 nucleotides in length. Preferably, it hybridizes 
under stringent conditions to a nucleic acid molecule comprising a nucleotide sequence 
of the nucleic acid molecule of the present invention or used in the process of the 
present invention, e.g. comprising the sequence as depicted in SEQ ID NO: 1, 3, 5, 7, 
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9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41 43, 45, 55, 57, 59, 61, 
63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 
107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 
141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 
5 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 
209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 
243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 
277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299, 301, 303, 305, 307, 309, 
311, 313, 315, 317, 319, 321, 323, 325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 
10 345, 347, 349, 351, 353, 355, 357, 359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 
379, 381, 383, 385, 387, 389, 391 or 393. The nucleic acid molecule is preferably at 
least 20, 30, 50, 100, 250 or more nucleotides in length. 

[0174.0.0.0] The term "hybridizes under stringent conditions" is defined above. In one 
embodiment, the term "hybridizes under stringent conditions" is intended to describe 
15 conditions for hybridization and washing under which nucleotide sequences at least 
30 %, 40 %, 50 % or 65% identical to each other typically remain hybridized to each 
other. Preferably, the conditions are such that sequences at least about 70%, more 
preferably at least about 75% or 80%, and even more preferably at least about 85%, 
90% or 95% or more identical to each other typically remain hybridized to each other. 

20 [0175.0.0.0] Preferably, nucleic acid molecule of the invention that hybridizes under 
stringent conditions to a sequence as depicted in SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 
17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41 43, 45, 55, 57, 59, 61, 63, 65, 67, 69, 
71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 
113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 

25 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 
181, 183, 185, 187, 189, 191, 193, 195, 197, 199,201, 203, 205, 207, 209, 211, 213, 
215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 
249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 
283, 285, 287, 289, 291, 293, 295, 297, 299, 301, 303, 305, 307, 309, 311, 313, 315, 

30 317, 319, 321, 323, 325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 345, 347, 349, 
351, 353, 355, 357, 359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 379, 381, 383, 
385, 387, 389, 391 or 393 corresponds to a naturally-occurring nucleic acid molecule of 
the invention. As used herein, a "naturally-occurring" nucleic acid molecule refers to an 
RNA or DNA molecule having a nucleotide sequence that occurs in nature (e.g., 

35 encodes a natural protein). Preferably, the nucleic acid molecule encodes a natural 
protein having above-mentioned activity, e.g. conferring the fine chemical increase 
after increasing the expression or activity thereof or the activity of a protein of the 
invention or used in the process of the invention. 



40 



[0176.0.0.0] In addition to naturally-occurring variants of thesequences of the 
polypeptide or nucleic acid molecule of the invention as well as of the polypeptide or 
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nucleic acid molecule used in the process of the invention that may exist in the 
population, the skilled artisan will further appreciate that changes can be introduced by 
mutation into a nucleotide sequence of the nucleic acid molecule encoding the 
polypeptide of the invention or used in the process of the present invention, thereby 
5 leading to changes in the amino acid sequence of the encoded polypeptide, without 
altering the functional ability of the polypeptide, preferably not decreasing said activity. 

[01 77.0.0.0] For example, nucleotide substitutions leading to amino acid substitutions 
at "non-essential" amino acid residues can be made in a sequence of the nucleic acid 
molecule of the invention or used in the process of the invention, e.g. in SEQ ID NO: 1 , 

10 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41 43, 45, 55, 57, 
59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83. 85, 87, 89, 91, 93, 95, 97, 99, 101, 
103, 105, 107, 109. 111, 113, 115, 117. 119. 121. 123. 125, 127, 129, 131, 133, 135, 
137, 139. 141. 143, 145, 147, 149, 151. 153, 155, 157, 159, 161, 163, 165, 167, 169, 
171, 173. 175, 177. 179. 181. 183. 185. 187. 189. 191. 193. 195. 197. 199. 201. 203. 

15 205 207. 209, 21 1, 213, 215. 217. 219. 221. 223, 225, 227. 229. 231, 233, 235, 237. 
239 241, 243, 245, 247, 249, 251. 253. 255. 257. 259. 261. 263. 265, 267, 269, 271, 
273, 275. 277. 279. 281. 283. 285. 287. 289. 291. 293, 295, 297, 299, 301, 303, 305, 
307*. 309, 311. 313. 315. 317. 319. 321. 323. 325. 327. 329. 331. 333. 335. 337, 339, 
341, 343, 345, 347, 349. 351. 353, 355, 357. 359, 361, 363. 365. 367. 369. 371. 373. 

20 375, 377, 379, 381 , 383, 385, 387. 389. 391or 393. 

[0178.0.0.0] A"non-essential" amino acid residue is a residue that can be altered 
from the wild-type sequence of one without altering the activity of said polypeptide, 
whereas an "essential" amino acid residue is required for an activity as mentioned 
above, e.g. leading to an increase in the fine chemical in an organism after an increase 
25 of activity of the polypeptide. Other amino acid residues, however, (e.g., those that are 
not conserved or only semi-conserved in the domain having said activity) may not be 
essential for activity and thus are likely to be amenable to alteration without altering 
said activity. 

[0179.0.0.0] Further, a person skilled in the art knows that the codon usage between 
30 organisms can differ. Therefore, he may adapt the codon usage in the nucleic acid 
molecule of the present invention to the usage of the organism in which the 
polynuclestide or polypeptide is expressed. 

[01 80.0.0.0] Accordingly, the invention relates to nucleic acid molecules encoding a 
polypeptide having above-mentioned biological activity, e.g. conferring an increase in 
35 the the fine chemical in an organism or part thereof that contain changes in amino acid 
residues that are not essential for said activity. Such polypeptides differ in amino acid 
sequence from a sequence contained in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 
22, 24, 26, 28, 30. 32. 34. 36, 38, 40, 42, 44, 46, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 
76*. 78', 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 
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116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 
150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 
184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 
218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 
5 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 
286, 288, 290, 292, 294, 296, 298, 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 
320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 
354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386, 
388, 390, 392 or 394 yet retain said activity described herein. The nucleic acid 

10 molecule can comprise a nucleotide sequence encoding a polypeptide, wherein the 
polypeptide comprises an amino acid sequence at least about 50% identical to an 
amino acid sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 
30, 32, 34, 36, 38, 40, 42, 44. 46, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 
84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 

15 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 
156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 
190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 
224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 
258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 

20 292, 294, 296, 298, 300, 302, 304, 306, 308, 310, 312, 314, 316, ,31 8, 320, 322, 324, 
326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 
360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386, 388, 390, 392 or 
394and is capable of participation in the increase of production of the fine chemical 
after increasing its activity, e.g. its expression. Preferably, the protein encoded by the 

25 nucleic acid molecule is at least about 60% identical to the sequence in SEQ ID NO: 2, 
4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 56, 58, 
60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 
104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 
138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 

30 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 
206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 
240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 
274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300, 302, 304, 306, 
308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 

35 342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 
376, 378, 380, 382, 384, 386, 388, 390, 392 or 394, more preferably at least about 
70% identical to one of the sequences in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 
22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 
76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 

40 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 
150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 
184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 



WO 2005/014828 PCT/EP2004/008136 



87 



218 220 222 224, 226, 228, 230, 232, 234, 236, 238, 240, 242. 244, 246, 248, 250, 
252*. 254', 256', 258, 260, 262, 264, 266, 268, 270, 272, 274. 276, 278, 280, 282, 284, 
286, 288. 290. 292. 294. 296. 298. 300, 302. 304, 306, 308. 310, 312, 314, 316, 318, 
320* 322. 324*. 326. 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 
5 354'. 356. 358. 360. 362, 364, 366. 368, 370. 372. 374, 376, 378, 380, 382, 384, 386, 
ooo oon qqo nr "kQA mnre Dreferablv at least about 80%, 90%, 95% homologous 

vJw*w?, WWU, WW*- Wl ■ 1 ~ • — — - - 1- - * 

to the sequence in SEQ ID NO: 2, 4, 6, 8, 10, 12. 14. 16. 18, 20. 22. 24. 26, 28, 30. 32. 
34 36, 38. 40. 42, 44. 46, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86. 
88, 90, 92, 94, 96, 98, 100, 102, 104, 106. 108. 110. 112. 114. 116. 118, 120. 122. 124. 
10 126 128, 130. 132. 134. 136. 138, 140, 142, 144. 146, 148, 150, 152, 154, 156, 158, 
160* 162, 164, 166, 168,-170, 172,. 174, 176, 178, 180, 182, 184. 186, 188, 190. 192, 
194*. 196, 198, 200, 202. 204, 206, 208, 210, 212, 214. 216, 218, 220. 222, 224, 226, 
228 230, 232, 234, 236. 238. 240, 242. 244, 246. 248. 250. 252, 254. 256, 258, 260. 
262* 264, 266, 268, 270, 272, 274, 276. 278, 280, 282, 284, 286, 288, 290. 292, 294, 
15 296, 298, 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322. 324. 326, 328, 
330 332, 334, 336, 338, 340, 342, 344, 346, 348, 350. 352. 354. 356, 358, 360, 362, 
364', 366*. 368, 370, 372, 374, 376, 378, 380. 382. 384, 386. 388, 390. 392 or 394. and 
most preferably at least about 96%, 97%. 98%, or 99% identical to the sequence 
in SEQ ID NO: 2. 4, 6, 8. 10, 12. 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40. 
20 42. 44. 46. 56. 58, 60, 62, 64, 66, 68, 70, 72, 74, 76. 78. 80. 82. 84, 86, 88, 90, 92, 94, 
96 98, 100, 102. 104, 106, 108, 110. 112. 114. 116, 118, 120, 122, 124. 126. 128. 130, 
132 134. 136, 138, 140. 142. 144. 146. 148, 150, 152. 154. 156. 158. 160. 162. 164. 
166 168. 170. 172. 174. 176, 178. 180. 182, 184. 186. 188. 190. 192, 194, 196, 198, 
20o' 202, 204. 206, 208. 210. 212. 214. 216, 218. 220. 222. 224, 226. 228. 230, 232. 
25 234*. 236. 238, 240. 242, 244, 246. 248. 250. 252. 254, 256, 258, 260, 262, 264, 266, 
268 270, 272, 274. 276, 278, 280, 282, 284, 286, 288, 290, 292, 294. 296, 298, 300, 
302* 304. 306. 308, 310. 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332. 334, 
336*. 338. 340. 342. 344. 346. 348. 350. 352. 354. 356. 358, 360. 362. 364, 366, 368. 
370*. 372, 374. 376, 378, 380, 382. 384. 386, 388, 390. 392 or 394. 

30 [01 81 .0.0.0] To determine the percentage homology (= identity) of two amino acid 
sequences or of two nucleic acid molecules, the sequences are written one underneath 
the other for an optimal comparison (for example gaps may be inserted into the 
sequence of a protein or of a nucleic acid in order to generate an optimal alignment 
with the other protein or the other nucleic acid). 

35 [01 82.0.0.0] The amino acid residues or nucleic acid molecules at the corresponding 
amino acid positions or nucleotide positions are then compared. If a position in one 
sequence is occupied by the same amino acid residue or the same nucleic acid 
molecule as the corresponding position in the other sequence, the molecules are 
homologous at this position (i.e. amino acid or nucleic acid "homology" as used in the 

40 present context corresponds to amino acid or nucleic acid "identity". The percentage 
homology between the two sequences is a function of the number of identical positions 
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shared by the sequences (i.e. % homology = number of identical positions/total number 
of positions x 100). The terms "homology" and "identity" are thus to be considered as 
synonyms. 

[01 83-0,0.0] For the determination of the percentage homology (identity) of two or 
5 more amino acids or of two or more nucleotide sequences several computer software 
programs have been developed. The homology of two or more sequences can be 
calculated with for example the software fasta, which presently has been used in the 
version fasta 3 (W. R. Pearson and D. J. Lipman (1988), Improved Tools for Biological 
Sequence Comparison.PNAS 85:2444- 2448; W. R. Pearson (1990) Rapid and 

10 Sensitive Sequence Comparison with FASTP and FASTA, Methods in Enzymology 
183:63 - 98; W. R. Pearson and D. J. Lipman (1988) Improved Tools for Biological 
Sequence Comparison.PNAS 85:2444- 2448; W. R. Pearson (1990); Rapid and 
Sensitive Sequence Comparison with FASTP and FASTAMethods in Enzymology 
1 83:63 - 98). Another useful program for the calculation of homologies of different 

15 sequences is the standard blast program, which is included in the Biomax pedant 
software (Biomax, Munich, Federal Republic of Germany). This leads unfortunately 
sometimes to suboptimal results since blast does not always include complete 
sequences of the subject and the querry. Nevertheless as this program is very efficient 
it can be used for the comparison of a huge number of sequences. The following 

20 settings are typically used for such a comparisons of sequences: 

-p Program Name [String]; -d Database [String]; default = nr; -i Query File [File In]; 
default = stdin; -e Expectation value (E) [Real]; default = 10.0; -m alignment view 
options: 0 = pairwise; 1 = query-anchored showing identities; 2 = query-anchored no 
identities; 3 = flat query-anchored, show identities; 4 = flat query-anchored, no 

25 identities; 5 = query-anchored no identities and blunt ends; 6 = flat query-anchored, no 
identities and blunt ends; 7 = XML Blast output; 8 = tabular; 9 tabular with comment 
lines [Integer]; default = 0; -o BLAST report Output File [File Out] Optional; default = 
stdout; -F Filter query sequence (DUST with blastn, SEG with others) [String]; default 
= T; -G Cost to open a gap (zero invokes default behavior) [Integer]; default = 0; -E 

30 Cost to extend a gap (zero invokes default behavior) [Integer]; default = 0; -X X dropoff 
value for gapped alignment (in bits) (zero invokes default behavior); blastn 30, 
megablast 20, tblastx 0, all others 15 [Integer]; default = 0; -I Show Gl's in def lines 
[T/F]; default = F; -q Penalty for a nucleotide mismatch (blastn only) [Integer]; default = 
-3; -r Reward for a nucleotide match (blastn only) [Integer]; default = 1; -v Number of 

35 database sequences to show one-line descriptions for (V) [Integer]; default = 500; -b 
Number of database sequence to show alignments for (B) [Integer]; default = 250; -f 
Threshold for extending hits, default if zero; blastp 11, blastn 0, blastx 12, tblastn 13; 
tblastx 13, megablast 0 [Integer]; default = 0; -g Perfom gapped alignment (not 
available with tblastx) |T/F]; default = T; -Q Query Genetic code to use [Integer]; 

40 default =1;-D DB Genetic code (for tblast[nx] only) [Integer]; default = 1; -a Number 
of processors to use [Integer]; default = 1; -O SeqAlign file [File Out] Optional; -J 
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Believe the query defline [T/F]; default = F; -M Matrix [String]; default = BLOSUM62; - 
W Word size, default if zero (blastn 1 1 , megablast 28, all others 3) [Integer]; default - 
0; -z Effective length of the database (use zero for the real size) [Real]; default = 0; -K 
Number of best hits from a region to keep (off by default, if used a value of 100 is 

5 recommended) [Integer]; default = 0; -P 0 for multiple hit, 1 for single hit [Integer]; 
default = 0; -Y Effective length of the search space (use zero for the real size) [Real]; 
default = 0; -S Query strands to search against database (for blast[nx], and tblastx); 3 
is both, 1 is top, 2 is bottom [Integer]; default = 3; -T Produce HTML output [T/F]; 
default = F; -I Restrict search of database to list of Gl's [String] Optional; -U Use 

10 lower case filtering of FASTA sequence [T/F] Optional; default = F; -y X dropoff value 
for ungapped extensions in bits (0.0 invokes default behavior); blastn 20, megablast 
10, all others 7 [Real]; default = 0.0; -Z X dropoff value for final gapped alignment in 
bits (0.0 invokes default behavior); blastn/megablast 50, tblastx 0, all others 25 
[Integer]; default = 0; -R PSI-TBLASTN checkpoint file [File In] Optional; -n 

15 MegaBlast search [T/F]; default = F; -L Location on query sequence [String] Optional; 
-A Multiple Hits window size, default if zero (blastn/megablast 0, all others 40 [Integer]; 
default = 0; -w Frame shift penalty (OOF algorithm for blastx) [Integer]; default = 0; -t 
Length of the largest intron allowed in tblastn for linking HSPs (0 disables linking) 
[Integer]; default = 0. 

20 [01 84.0.0.0] Results of high quality are reached by using the algorithm of Needleman 
and Wunsch or Smith and Waterman. Therefore programs based on said algorithms 
are preferred. Advantageously the comparisons of sequences can be done with the 
program PileUp (J. Mol. Evolution., 25. 351-360, 1987, Higgins et aL. CABIOS, 5 1989: 
151-153) or preferably with the programs Gap and BestFit, which are respectively 

25 based on the algorithms of Needleman and Wunsch [J. Mol. Biol. 48; 443^53 (1970)] 
and Smith and Waterman [Adv. Appl. Math. 2; 482-489 (1981)]. Both programs are part 
of the GCG software-package [Genetics Computer Group, 575 Science Drive, 
Madison, Wisconsin, USA 53711 (1991); Altschul et al. (1997) Nucleic Acids Res. 
25:3389 et seq.]. Therefore preferably the calculations to determine the perentages of 

30 sequence homology are done with the program Gap over the whole range of the 
sequences. The following standard adjustments for the comparison of nucleic acid 
sequences were used: gap weight: 50, length weight: 3, average match: 10.000, 
average mismatch: 0.000. 

[0185.0.0.0] For example a sequence which has a 80% homology with sequence 
35 SEQ ID NO: 1 at the nucleic acid level is understood as meaning a sequence which, 
upon comparison with the sequence SEQ ID NO: 1 by the above Gap program 
algorithm with the above parameter set, has a 80% homology. 

[0186.0.0.0] In the state of the art, homology between two polypeptides is also 
understood as meaning the identity of the amino acid sequence over in each case the 
40 entire sequence length which is calculated by comparison with the aid of the program 



WO 2005/014828 PCT/EP2004/008136 

90 

algorithm GAP (Wisconsin Package Version 10.0, University of Wisconsin, Genetics 
Computer Group (GCG), Madison, USA), setting the following parameters: 

Gap weight 8 Length weight: 2 

Average match: 2,912 Average mismatch: -2,003 

5 [0187.0.0.0] For example a sequence which has a 80% homology with sequence 
SEQ ID NO: 2 at the protein level is understood as meaning a sequence which, upon 
comparison with the sequence SEQ ID NO: 2 by the above program algorithm with the 
above parameter set has a 80% homology. 

[0188.0.0.0] Functional equivalents derived from one of the polypeptides as depicted 

10 in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30. 32, 34, 36, 38. 40, 
42, 44. 46. 56, 58, 60, 62, 64, 66, 68, 70. 72, 74, 76, 78, 80, 82. 84. 86, 88, 90, 92, 94. 
96, 98, 100, 102, 104, 106. 108, 110, 112, 114, 116, 118, 120, 122. 124, 126, 128, 130. 
132, 134, 136, 138, 140, 142, 144, 146. 148. 150, 152, 154, 156, 158, 160, 162, 164, 
166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 

15 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 
234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 
268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292/ 294, 296, 298, 300, 
302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 
336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 

20 370, 372, 374, 376, 378, 380, 382, 384, 386, 388, 390, 392 or 394 according to the 
invention by substitution, insertion or deletion have at least 30%, 35%, 40%, 45% or 
50%, preferably at least 55%, 60%, 65% or 70% by preference at least 80%, especially 
preferably at least 85% or 90%, 91%, 92%, 93% or 94%, very especially preferably at 
least 95%, 97%, 98% or 99% homology with one of the polypeptides as shown in SEQ 

25 ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22. 24. 26. 28. 30, 32, 34, 36, 38, 40, 42, 44, 
46. 56, 58, 60, 62, 64. 66, 68, 70, 72, 74, 76, 78, 80, 82. 84, 86, 88, 90, 92, 94, 96. 98. 
100, 102, 104, 106. 108, 110, 112, 114,116, 118, 120, 122, 124, 126, 128, 130. 132, 
134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 
168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200. 

30 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 
236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 
270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290. 292. 294. 296. 298, 300, 302, 
304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 
338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 

35 372, 374, 376, 378, 380, 382, 384, 386, 388, 390, 392 or 394 according to the 

invention and are distinguished by essentially the same properties as the polypeptide 
as shown in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 
36, 38, 40, 42, 44, 46, 56, 58, 60, 62. 64, 66, 68, 70, 72, 74. 76, 78, 80, 82, 84, 86, 88, 
90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 

40 126, 128, 130, 132, 134, 136. 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 
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160 162 164, 166, 168, 170, 172, 174. 176, 178, 180, 182. 184, 186, 188, 190, 192, 
194* 196. 198, 200, 202, 204, 206, 208. 210. 212, 214, 216, 218, 220, 222, 224, 226, 
228 230. 232, 234. 236, 238, 240, 242, 244. 246. 248. 250. 252. 254. 256. 258. 260. 
262 264. 266. 268. 270. 272, 274, 276, 278, 280, 282, 284, 286, 288, 290. 292. 294. 
5 296 298. 300. 302, 304. 306, 308, 310. 312. 314. 316. 318. 320. 322, 324, 326. 328. 
330 332. 334. 336. 338. 340, 342. 344. 346. 348, 350. 352. 354, 356, 358, 360. 362. 
364. 366, 368, 370. 372. 374. 376. 378. 380. 382, 384. 386. 388. 390. 392 or 394. 

[01 89 0 0 0] Functional equivalents derived from the nucleic acid sequence as 
depicted in SEQ ID NO: 1. 3, 5. 7, 9, 11. 13. 15, 17, 19, 21, 23, 25, 27, 29. 31. 33. 35, 
10 37. 39. 41 43. 45. 55. 57. 59. 61. 63. 65. 67. 69, 71, 73. 75. 77. 79. 81. 83. 85. 87. 89. 
91 93, 95. 97, 99, 101, 103. 105. 107. 109. 111, 113. 115. 117, 119, 121, 123. 125. 
127 129 131, 133, 135. 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157. 159, 
161* 163'. 165, 167, 169. 171. 173. 175. 177. 179, 181, 183, 185, 187, 189, 191. 193. 
195 197. 199. 201. 203. 205. 207, 209. 211. 213. 215, 217. 219. 221. 223, 225. 227. 
15 229 231. 233, 235, 237, 239, 241, 243, 245, 247, 249, 251. 253. 255, 257, 259. 261, 
263 265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 287. 289, 291, 293. 295, 
297 299, 301, 303, 305. 307. 309. 311, 313. 315, 317, 319, 321. 323, 325, 327. 329, 
33l' 333. 335, 337, 339, 341, 343, 345. 347, 349, 351, 353, 355, 357, 359. 361. 363. 
365'. 367. 369. 371. 373. 375. 377. 379. 381. 383, 385, 387, 389, 391or 393according 
20 to the invention by substitution, insertion or deletion have at least 30%, 35%. 40%, 45 h 
or 50% preferably at least 55%, 60%, 65% or 70% by preference at least 80%. 
especially preferably at least 85% or 90%. 91%, 92%, 93% or 94%, very especially 
preferably at least 95%. 97%. 98% or 99% homology with one of the polypeptides as 
shown in SEQ ID NO: 2. 4, 6. 8, 10. 12. 14. 16. 18. 20. 22. 24. 26, 28, 30. 32. 34 36 
25 38. 40. 42. 44. 46, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80. 82. 84. 86. 88. 90. 
92 94 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120. 122, 124, 126, 
128 130 132. 134. 136. 138. 140. 142, 144. 146. 148. 150. 152. 154, 156, 158. 160. 
162 164' 166. 168. 170. 172. 174, 176, 178, 180, 182, 184, 186. 188. 190, 192, 194, 
196 198. 200. 202. 204. 206. 208, 210, 212. 214, 216, 218, 220, 222, 224. 226. 228. 
30 230' 232. 234. 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 
264' 266, 268. 270. 272. 274, 276. 278. 280. 282. 284. 286. 288. 290. 292, 294, 296, 
298 300 302, 304, 306, 308. 310. 312. 314. 316. 318, 320. 322. 324, 326, 328, 330, 
332' 334 336 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 
366 36 8 ', 370. 372, 374, 376, 378, 380, 382, 384, 386. 388, 390, 392 or 394 according 
35 to the invention and encode polypeptides having essentially the same propert.es as the 
polypeptide as shown in SEQ ID NO: 2. 4. 6. 8, 10, 12. 14, 16, 18, 20, 22. 24. 26, 28 
30 32 34. 36. 38. 40. 42, 44. 46. 56. 58. 60, 62, 64, 66. 68. 70. 72. 74. 76. 78. 80. 82. 
84 86 88. 90, 92, 94, 96, 98, 100, 102, 104. 106, 108. 110. 112. 114, 116, 118, 120, 
122 124 126, 128, 130, 132. 134, 136, 138, 140, 142, 144, 146, 148. 150. 152. 154, 
40 156'. 15 8 ', 160, 162, 164, 166. 168. 170, 172, 174, 176, 178. 180. 182. 184. 186. 188. 
190 192 194. 196. 198, 200, 202. 204, 206. 208. 210. 212. 214. 216, 218. 220, 222, 
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224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 
258, 260. 262, 264, 266, 268. 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 
292, 294. 296, 298. 300. 302. 304. 306, 308, 310, 312, 314, 316, 318. 320. 322, 324, 
326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 
5 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386, 388, 390, 392 or 
394. 

[0190.0.0.0] "Essentially the same properties" of a functional equivalent is above all 
understood as meaning that the functional equivalent has above mentioned acitivty, e.g 
conferring an increase in the fine chemical amount while increasing the amount of 
10 protein, activity or function of said functional equivalent in an organism, e.g. a 

microorgansim, a plant or plant or animal tissue, plant or animal cells or a part of the 
same. 

[0191.0.0.0] A nucleic acid molecule encoding an homologous to a protein sequence 
of SEQ ID NO: 2, 4, 6, 8. 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32. 34, 36, 38, 40, 
15 42. 44. 46. 56, 58, 60, 62, 64, 66. 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 

96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130. 
132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 15?., 154, 156, 158, 160, 162, 164, 

166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 

200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 
20 234, 236, 238, 240, 242, 244, 246. 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 

268, 270, 272, 274, 276, 278. 280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300, 
302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 

336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 

370, 372, 374, 376, 378, 380, 382, 384, 386, 388, 390, 392 or 394 can be created by 
25 introducing one or more nucleotide substitutions, additions or deletions into a 

nucleotide sequence of the nucleic acid molecule of the present invention, in particular 
of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31. 33, 35, 37, 39, 41 
43, 45, 55, 57, 59, 61, 63, 65, 67, 69, 71. 73, 75. 77, 79, 81; 83, 85, 87. 89, 91, 93, 95. 

97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131. 
30 133. 135, 137, 139, 141, 143. 145, 147, 149, 151, 153. 155, 157, 159, 161, 163, 165, 

167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 

201, 203, 205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231. 233, 
235, 237. 239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 26.1, 263, 265, 267, 

269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299, 301, 
35 303. 305, 307. 309. 311, 313. 315. 317, 319, 321, 323, 325, 327, 329. 331, 333, 335, 

337, 339, 341, 343, 345, 347, 349, 351, 353, 355, 357, 359, 361, 363, 365, 367, 369, 

371, 373, 375, 377, 379, 381, 383, 385, 387, 389, 391 or 393such that one or more 
amino acid substitutions, additions or deletions are introduced into the encoded protein. 
Mutations can be introduced into the encoding sequences of SEQ ID NO: 1 , 3, 5, 7, 9, 

40 11, 13, 15, 17. 19. 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41 43, 45, 55, 57, 59, 61, 63, 
65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 
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109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 
143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 
177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 
211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237. 239, 241, 243, 
5 245, 247, 249, 251 , 253, 255, 257, 259, 261 , 263, 265, 267, 269, 271 . 273, 275. 277, 

n-rn 001 OOO OOC 007 OOQ OCM OQ'i OQ*. 9Q7 9QQ 301 3(13. 305. 307. 309. 31 1 . 
^/ !?, £.0 I , £Uvl, £.<->v», £-\Jl , C\J*J, • 1 — ■ , ■ — > — •- - . 

313. 315, 317, 319, 321, 323, 325, 327, 329, 331, 333, 335. 337. 339. 341. 343. 345. 
347. 349. 351. 353. 355, 357, 359, 361, 363, 365, 367, 369, 371, 373, 375, 377. 379, 
381, 383, 385, 387, 389, 391 or 393 by standard techniques, such as site-directed 
10 mutagenesis and PCR-mediated mutagenesis. 

[0192.0.0.0] Preferably, conservative amino acid substitutions are made at one or 
more predicted non-essential amino acid residues. A "conservative amino acid 
substitution" is one in which the amino acid residue is replaced with an amino acid 
residue having a similar side chain. Families of amino acid residues having similar side 

15 chains have been defined in the art. These families include amino acids with basic side 
chains (e.g., lysine, arginine, histidine), acidic side chains (e.g., aspartic acid, glutamic 
acid), uncharged polar side chains (e.g., glycine, asparagine, glutamine, serine, 
threonine, tyrosine, cysteine), nonpolar side chains (e.g.. alanine, valine, leucine, 
isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side chains 

20 (e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, 
phenylalanine, tryptophan, histidine). 

[0193.0.0.0] Thus, a predicted nonessential amino acid residue in a polypeptide of 
the invention or a polypeptide used in the process of the invention is preferably 
replaced with another amino acid residue from the same family. Alternatively, in 

25 another embodiment, mutations can be introduced randomly along all or part of a 

coding sequence of a nucleic acid molecule of the invention or used in the process of 
the invention, such as by saturation mutagenesis, and the resultant mutants can be 
screened for activity described herein to identify mutants that retain or even have 
increased above mentioned activity, e.g. conferring an increase in content of the fine 

30 chemical. 

[0194.0.0.0] Following mutagenesis of one of the sequences of shown herein, the 
encoded protein can be expressed recombinantly and the activity of the protein can be 
determined using, for example, assays described herein (see Examples). 

[0195.0.0.0] The highest homology of the nucleic acid molecule used in the process 
35 according to the invention was found for the following database entries by Gap search. 

[0196.0.0.0] Homologues of the nucleic acid sequences used, with the sequence as 
depicted in SEQ ID NO: 1. 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27. 29. 31. 33. 35. 
37. 39, 41 43, 45, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85. 87, 89, 
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91,93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 
127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 
161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 
195, 197, 199, 201, 203, 205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 
5 229, 231 , 233, 235, 237, 239, 241 , 243, 245, 247, 249, 251 , 253, 255, 257, 259, 261 , 
263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 
297, 299, 301, 303, 305, 307, 309, 311, 313, 315, 317, 319, 321, 323, 325, 327, 329, 
331, 333, 335, 337, 339, 341, 343, 345, 347, 349, 351, 353, 355, 357, 359, 361, 363, 
365, 367, 369, 371, 373, 375, 377, 379, 381, 383, 385, 387, 389, 391or393, comprise 

10 also allelic variants with at least approximately 30%, 35%, 40% or 45% homology, by 
preference at least approximately 50%, 60% or 70%. more preferably at least 
approximately 90%, 91 %, 92%, 93%, 94% or 95% and even more preferably at least 
approximately 96%, 97%, 98%, 99% or more homology with one of the nucleotide 
sequences shown or the abovementioned derived nucleic add sequences or their 

15 homologues, derivatives or analogues or parts of these. Allelic variants encompass in 
particular functional variants which can be obtained by deletion, insertion or substitution 
of nucleotides from the sequences shown, preferably from SEQ ID NO: 1 , 3, 5, 7, 9, 1 1 , 
13, 15, 17, 19, 21, 23, 25. 27, 29, 31, 33, 35, 37, 39, 41 43, 45, 55, 57, 59, 61, 63, 65, 
67, 69, 71, 73, 75, 77, 79, 81. 83. 85. 87, 8~9, 91. 93, 95, 97, 99, 101. 103, 105, 107, 

20 109, 111, 113, 115, 117, 119. 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 
143, 145, 147, 149, 151, 153. 155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 
177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209. 
211, 213, 215, 217, 219, 221, 223, 225. 227, 229, 231, 233, 235, 237, 239, 241, 243, 
245, 247, 249, 251, 253, 255. 257, 259, 261. 263. 265. 267. 269, 271, 273, 275, 277, 

25 279. 281 , 283, 285, 287, 289, 291 , 293, 295, 297, 299, 301 , 303, 305, 307, 309, 31 1 , 
313, 315, 317, 319, 321, 323, 325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 345. 
347, 349, 351, 353, 355, 357, 359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 379, 
381, 383, 385, 387, 389, 391 or 393, or from the derived nucleic acid sequences, the 
intention being, however, that the enzyme activity or the biological activity of the 

30 resulting proteins synthesized is advantageously retained or increased. 

[0197.0.0.0] In one embodiment of the present invention, the nucleic acid molecule of 
the invention or used in the process of the invention comprises the sequences shown 
in any of the sequences SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 
31, 33, 35, 37, 39, 41 43, 45, 55. 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79. 81, 83, 

35 85, 87,89,91, 93, 95, 97, 99, 101, 103, 105. 107, 109, 111, 113, 115, 117, 119, 121, 
123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 
157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 
191, 193, 195, 197, 199, 201, 203, 205, 207. 209, 211, 213, 215, 217, 219, 221, 223, 
225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 

40 259, 261 , 263, 265, 267, 269, 271 , 273, 275, 277, 279, 281 , 283, 285, 287, 289, 291 , 
293, 295, 297, 299, 301, 303, 305. 307. 309, 311, 313, 315, 317, 319, 321, 323, 325, 
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327, 329. 331, 333, 335, 337, 339, 341, 343, 345, 347, 349, 351, 353, 355, 357, 359, 
361. 363, 365, 367, 369, 371, 373, 375. 377, 379, 381, 383, 385, 387, 389, 391or 393. 
It is preferred that the nucleic acid molecule comprises as little as possible other 
nucleotides not shown in any one of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 
5 23. 25, 27, 29, 31, 33, 35, 37, 39, 41 43, 45, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75. 
77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107. 109. 111, 113, 115, 
117, 119, 12l] 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 
151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 171. 173, 175. 177, 179, 181, 183, 
185, 187. 189. 191. 193, 195, 197, 199, 201, 203, 205, 207, 209, 211, 213, 215, 217, 
10 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251. 
253. 255. 257. 259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 
287, 289, 291, 293, 295, 297, 299, 301, 303, 305, 307, 309, 311, 313, 315, 317, 319, 
321 , 323. 325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 345, 347, 349, 351, 353, 
355, 357, 359, 361. 363. 365, 367. 369, 371, 373, 375, 377, 379, 381, 383, 385, 387, 
15 389, 391or 393. In one embodiment, the nucleic acid molecule comprises less than 
500, 400, 300, 200, 100, 90, 80, 70, 60, 50 or 40 further nucleotides. In a further 
embodiment, the nucleic acid molecule comprises less than 30, 20 or 10 further 
nucleotides. In one embodiment, the nucleic acid molecule use in the process of the 
invention is identical to the sequences shown in SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 
20 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41 43, 45, 55, 57, 59, 61, 63, 65, 67, 69, 
71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 
113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135. 137, 139. 141. 143. 145, 
147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169. 171, 173. 175. 177. 179. 
18l'. 183, 185. 187. 189. 191, 193, 195, 197, 199. 201, 203. 205, 207, 209, 211, 213, 
25 215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 
249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271. 273, 275, 277, 279, 281. 
283, 285, 287, 289, 291, 293, 295, 297. 299, 301, 303, 305, 307, 309, 311, 313, 315. 
317, 319. 321. 323. 325, 327, 329, 331, 333, 335, 337. 339, 341, 343, 345, 347, 349, 
351, 353, 355, 357, 359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 379, 381, 383, 
30 385, 387, 389, 391 or 393. 

[0198.0.0.0] Also preferred is that the nucleic acid molecule used in the process of 
the invention encodes a polypeptide comprising the sequence as depicted in SEQ ID 
NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34. 36. 38. 40. 42. 44. 46. 
56. 58. 60. 62. 64. 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90. 92. 94. 96. 98. 100. 

35 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132. 134, 
136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 
170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198. 200, 202, 
204*. 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 
238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 

40 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300, 302, 304, 
306', 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 
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340. 342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 
374, 376, 378, 380, 382, 384, 386, 388, 390, 392 or 394. In one embodiment, the 
nucleic acid molecule encodes less than 150, 130. 100, 80, 60, 50, 40 or 30 further 
amino acids. In a further embodiment, the encoded polypeptide comprises less than 
5 20, 15, 10, 9, 8, 7, 6 or 5 further amino acids. In one embodiment used in the inventive 
process, the encoded polypeptide is identical to the sequences as depicted in SEQ ID 
NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 
56, 58. 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 
102, 104. 106. 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 
10 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 
1 70. 1 72, 1 74, 1 76, 1 78, 1 80, 1 82, 1 84, 1 86, 1 88, 1 90, 1 92, 1 94,. 1 96, 1 98, 200, 202, 

204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 
238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264. 266, 268, 270, 
272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300, 302, 304, 

15 306, 308, 310. 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 
340, 342, 344, 346, 348, 350. 352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372. 
374, 376, 378, 380, 382, 384, 386, 388, 390, 392 or 394. 

[0199.0.0.0] In one embodiment, the nucleic acid molecule of the invention or used in 
the process encodes a polypeptide comprising the sequence as depicted in SEQ ID 
20 NO: 2, 4, 6, 8, 10, 12, 14. 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44. 46, 
56, 58. 60. 62. 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 

102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 

136, 138, 140, 142, 144, 146, 148, 150. 152, 154, 156, 158, 160, 162, 164, 166, 168. 

170. 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 
25 204, 206, 208, 210. 212, 214, 216. 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 

238, 240, 242, 244. 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 

272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300, 302, 304, 
306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 
340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 

30 374, 376, 378, 380. 382. 384, 386, 388, 390, 392 or 394 and comprises less than 100 
further nucleotides. In a further embodiment, said nucleic acid molecule comprises less 
than 30 further nucleotides. In one embodiment, the nucleic acid molecule used in the 
process is identical to a coding sequence of the sequences as depicted in SEQ ID NO: 
1, 3, 5. 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31. 33, 35, 37. 39. 41 43. 45, 55, 57, 

35 59, 61, 63, 65, 67. 69. 71. 73. 75. 77. 79. 81, 83, 85, 87. 89, 91, 93, 95, 97, 99, 101, 

103, 105, 107, 109, 111, 113, 115. 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 

137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 

171, 173, 175, 177. 179, 181, 183. 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 

205, 207. 209, 211. 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 
40 239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 

273, 275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 297. 299, 301, 303, 305, 
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307, 309, 311, 313, 315, 317, 319, 321, 323, 325, 327, 329. 331, 333. 335, 337, 339, 
341, 343. 345. 347, 349, 351, 353. 355, 357, 359, 361, 363, 365, 367, 369. 371, 373. 
375. 377. 379, 381, 383, 385, 387. 389. 391or393. 

[0200.0.0.0] Polypeptides (= proteins), which still have the essential enzymatic 

5 activity of the polypeptide of the present invention conferring an increase of the fine 
chemical i.e. whose activity is essentially not reduced, are polypeptides with at least 
10% or 20%, by preference 30% or 40%. especially preferably 50% or 60%. very 
especially preferably 80% or 90 or more of the wild type biological activity or enzyme 
activity, advantageously, the activity is essentially not reduced in comparison with the 

10 activity of a polypeptide as depicted in SEQ ID NO: 2, 4, 6, 8, 10. 12. 14. 16. 18. 20, 
22 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 56, 58, 60, 62, 64, 66. 68. 70, 72. 74. 
76*. 78, 8o', 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 
116 118, 120, 122. 124, 126, 128. 130. 132. 134. 136. 138. 140, 142, 144, 146, 148, 
150. 152, 154, 156, 158, 160. 162. 164, 166, 168, 170, 172. 174, 176, 178, 180. 182, 

15 184', 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 
218 220 222, 224, 226, 228, 230. 232. 234, 236, 238, 240, 242, 244, 246, 248, 250. 
252*, 254'. 256, 258. 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 
286 288. 290. 292. 294. 296. 298. 300. 302. 304. 306. 308. 310. 312. 314. 316. 318, 
32o' 322 324, 326, 328, 330, 332. 334. 336. 338. 340. 342. 344. 346. 348. 350. 352. 

20 354*. 356. 358, 360, 362, 364, 366, 368, 370, 372. 374. 376, 378, 380, 382, 384, 386, 
388, 390, 392 or 394 expressed under identical conditions. 

[0201.0.0.0] Homologues of as depicted in SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 
19 21. 23. 25, 27, 29, 31, 33, 35, 37, 39, 41 43. 45, 55. 57. 59, 61, 63, 65, 67, 69, 71, 
73 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107. 109. 111, 113, 
25 115 117, 119, 121. 123. 125. 127. 129, 131. 133. 135. 137. 139. 141. 143. 145. 147. 
149 151 153. 155. 157. 159. 161. 163. 165. 167. 169, 171, 173, 175, 177, 179, 181, 
183 185, 187, 189, 191. 193, 195, 197, 199, 201, 203, 205, 207, 209, 211. 213, 215, 
217 219 221, 223, 225, 227, 229. 231. 233. 235. 237, 239, 241, 243, 245. 247. 249, 
251 \ 253, 255, 257. 259. 261. 263. 265. 267. 269. 271. 273. 275. 277, 279, 281, 283, 
30 285 287. 289. 291. 293, 295, 297. 299, 301, 303, 305. 307. 309, 31 1. 313. 315. 317. 
319 321 323, 325. 327. 329. 331. 333. 335. 337, 339, 341, 343, 345, 347. 349, 351, 
353', 355, 357, 359, 361 , 363, 365, 367, 369, 371 , 373, 375, 377, 379, 381 , 383. 385. 
387, 389, 391 or 393 also mean truncated sequences, cDNA, single-stranded DNAor 
RNA of the coding and noncoding DNA sequence. Homologues of said sequences are 
35 also understood as meaning derivatives, which comprise noncoding regions such as. 
for example, UTRs, terminators, enhancers or promoter variants. The promoters 
upstream of the nucleotide sequences stated can be modified by one or more 
nucleotide substitutions), insertions) and/or deletion(s) without, however, interfering 
with the functionality or activity either of the promoters, the open reading frame 
40 (= ORF) or with the ^-regulatory region such as terminators or other 3'regulatory 

regions, which are far away from the ORF. It is furthermore possible that the activity of 
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the promoters is increased by modification of their sequence, or that they are replaced 
completely by more active promoters, even promoters from heterologous organisms. 
Appropriate promoters are known to the person skilled in the art and are mentioned 
herein below. 

5 [0202.0.0.0] In a further embodiment, the process according to the present invention 
comprises the following steps: 

selecting an organism or a part thereof expressing the polypeptide of this 
invention; 

mutagenizing the selected organism or the part thereof; 

comparing the activity or the expression level of said polypeptide in the 
mutagenized organism or the part thereof with the activity or the expression of 
said polypeptide in the selected organisms or the part thereof; 

selecting the mutagenized organisms or parts thereof, which comprise an 
increased activity or expression level of said polypeptide compared to the 
selected organism (a) or the part thereof; 

optionally, growing and cultivating the organisms or the parts thereof; and 

recovering, and optionally isolating, the free or bound the fine chemical produced 
by the selected mutated organisms or parts thereof. 

[0203.0.0.0] The organisms or part thereof produce according to the herein 
20 mentioned process of the invention an increased level of free and/or protein-bound fine 
chemical compared to said control or selected organisms or parts thereof. 

[0204.0.0.0] Advantageously the seclected organisms are mutagenized according to 
the invention. According to the invention mutagenesis is any change of the genetic 
information in the genom of an organism, that means any structural or compositional 

25 change in the nucleic acid preferably DNA of an organism that is not caused by normal 
segregation or genetic recombiantion processes. Such mutations may occur 
spontaneously, or may be induced by mutagens as described below. Such change can 
be induced either randomly or selectivly. In both cases the genetic information of the 
organism is modified. In general this lead to the situation that the activity of the gene 

30 product of the relevant genes inside the cells or inside the organism is increased. 

[0205.0.0.0] In case of the specific or so called site directed mutagenesis a distinct 
gene is mutated and thereby its activity and/or the activity or the encoded gene product 
is repressed, reduced or increased, preferably increased. In the event of a random 



(a) 
(b) 

10 (c) 
(d) 

15 
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mutagenesis one or more genes are mutated by chance and their activities and/or the 
activities of their gene productes are repressed, reduced or increased, preferably 
increased. 

[0206.0.0.0] For the purpose of a mutagenesis of a huge population of organisms, 
5 such population can be transformed with a DNA construct, which is useful for the 
activation of as much as possible genes of an organism, preferably all genes. For 
example the construct can contain a strong promoter or one or more enhancers, which 
are capable of transcriptionally activate genes in the vicinity of their integration side. 
With this method it is possible to statistically mutagenize, eg activate nearly all genes of 
10 an organism by the random integration of an activation construct. Afterwards the skilled 
worker can identify those mutagenized lines in which a gene of the invention has been 
activated, which in turns leads to the desired increase in the fine chemical production. 

[0207.0.0.0] The genes of the invention can also be activated by mutagensis, either 
of regulatory or coding regions. In the event of a random mutagenesis a huge number 

15 of organisms are treated with a mutagenic agent. The amount of said agent and the 
intensity of the treatment will be chosen in such a manner that statistically nearly every 
gene is mutated once. The process for the random mutagensis as well as the 
respective agens is well known by the skilled person. Such methods are disclosed for 
example by AM. van Harten [(1998), "Mutation breeding: theory and practical , 

20 applications", Cambridge University Press, Cambridge, UK], E Friedberg, G Walker, W 
Siede [(1995), „DNA Repair and Mutagenesis", Blackwell Publishing], or K. 
Sankaranarayanan. J. M. Gentile, L. R. Ferguson [(2000) .Protocols in Mutagenesis", 
Elsevier Health Sciences]. As the skilled worker knows the spontaneous mutation rate 
in the cells of an organism is very low and that a large numer of chemical, physical or 

25 biological agents are available for the mutagenesis of organisms. These agents are 

named as mutagens or mutagenic agents. As mentioned before three different kinds of 
mutagens (chemical, physical or biological agents) are available. 

[0208.0.0.0] There are different classes of chemical mutagens, which can be 
separated by their mode of action. For example base analogues such as 5- 

30 bromouracil. 2-amino purin. Other chemical mutagens are interacting with the DNA 
such as sulphuric acid, nitrous acid, hydroxyzine; or other alkylating agents such as 
monofunctional agents like ethyl methanesulfonate, dimethylsulfate. methyl 
methanesulfonate), Afunctional like dichloroethyl sulphide, Mitomycin, 
Nitrosoguanidine - dialkylnitrosamine, N-Nitrosoguanidin derivatives, N-alkyl-N-nrtro-N- 

35 nitroso-guanidine-), ntercalating dyes like Acridine, ethidium bromide). 

[0209.0.0.0] Physical mutagens are for example ionizing irradiation (X ray), UV 
irradiation. Different forms of irradiation are available and they are strong mutagens. 
Two main classes of irradiation can be distinguished: a) non-ionizing irradiation such as 
UV light or ionizing irradiation such as X ray. Biological mutagens are for example 
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transposable elements for example IS elements such as IS100, transposons such as 
Tn5, Tn10, Tn916 or Tn1000 or phages like Mu amp,ac , P1 . T5, Aplac etc. Methods for 
introducing this phage DNA into the appropriate microorganism are well known to the 
skilled worker (see Microbiology, Third Edition, Eds. Davis, B.D., Dulbecco, R., Eisen, 
5 H.N. and Ginsberg, H.S., Harper International Edition, 1980). The common procedure 
of a transposon mutagenesis is the insertion of a transposable element within a gene 
or nearby for example in the promotor or terminator region and thereby leading to a 
loss of the gene function. Procedures to localize the transposon within the genome of 
the organisms are well known by a person skilled in the art. 

10 [0210.0.0.0] Preferably a chemical or biochemical procedure is used for the 

mutagenesis of the organisms. A preferred chemical method is the mutagensis with N- 
methyl-N-nitro-nitrosoguanidine. 

[0211.0.0.0] Other biological methods are disclosed by Spee et al. (Nucleic Acids 
Research, Vol. 21, No. 3, 1993: 777 - 778). Spee et al. teaches a PCR method using 

15 dITP for the random mutagenesis. This method described by Spee et al. was further 
improved by Rellos et al. (Protein Expr. Purif., 5, 1994 : 270 - 277). The use of an in 
vitro repombination technique for molecular mutagenesis is described by Stemmer 
(Proc. Natl. Acad. Sci. USA, Vol. 91, 1994: 10747 - 10751). Moore et al. (Nature 
Biotechnology Vol. 14, 1996: 458 - 467) describe the combination of the PCR and 

20 recombination methods for increasing the enzymatic activity of an esterase toward a 
para-nitrobenzyl ester. Another route to the mutagenesis of enzymes is described by 
Greener et al. in Methods in Molecular Biology (Vol. 57, 1996: 375 - 385). Greener et 
al. use the specific Escherichia coli strain XL1-Red to generate Escherichia coli 
mutants, which have increased antibiotic resistance. 

25 [0212.0.0.0] In one embodiment, the protein according to the invention or the nucleic 
acid molecule characterized herein originates from a eukaryotic or prokaryotic 
organism such as a non-human animal, a plant, a microorganism such as a fungus, 
yeast, an alga, a diatom or a bacterium. Nucleic acid molecules, which advantageously 
can be used in the process of the invention originate from yeasts, for example the 

30 family Saccharomycetaceae, in particular the genus Saccharomyces, or yeast genera 
such as Candida, Hansenula, Pichia, Yarrowia, Rhodotorula or Schizosaccharomyces 
and the especially advantageous from the species Saccharomyces cerevisiae. 

[0213.0.0.0] In one embodiment, nucleic acid molecules, which advantageously can 
be used in the process of the invention orginate from yeast, for example from 
35 Saccharomycetaceae, particularly from the genus Saccharomyces advantageously 
form the species Saccharomyces cerevisiae. 

[0214.0.0.0] If, in the process according to the invention, plants are selected as the 
donor organism, this plant may, in principle, be in any phylogenetic relation of the 
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recipient plant. Donor and recipient plant may belong to the same family, genus, 
species, variety or line, resulting in an increasing homology between the nucleic acids 
to be integrated and corresponding parts of the genome of the recipient plant. Th.s also 
applies analogously to microorganisms as donor and recipient organism. 
5 it might also be advantageously to use nuclei acids molecules from very distinct 
species, since these might exhibit reduced sensitivity against endogenous regulatory 
mechanisms and such sequences might not be recognized by endogenous snenang 
mechanisms. 

[021 5.0.0.0] Accordingly, one embodiment of the application relates to the use of 
10 nucleic acid molecules in the process of the invention from plants, e.g. crop plants, e.g. 
from- B. napus; O. sativa, Glycine max; B. vulgaris, L. japonicus, Z. elegans, Z. mays, 
C. arietinum, A. thaliana. H. vulgare, N. tabacum, G. hirsutum, P. patens, F. distichus, 
sunflower linseed or maize or their homologues. 

[0216.0.0.0] Accordingly, in one embodiment, the invention relates to a nucleic acid 
15 molecule, which comprises a nucleic acid molecule selected from the group consisting 
of: 

a) nucleic acid molecule encoding of the polypeptide as depicted in SEQ ID NO: 2, 
4 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 
56 58, 60. 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82. 84. 86. 88. 90, 92, 94, 96, 

20 98. 100. 102. 104, 106, 108, 110. 112. 114, 116. 118, 120, 122, 124, 126, 128. 

130 132 134, 136, 138, 140, 142, 144. 146, 148, 150, 152, 154, 156, 158, 160, 
162 164 166, 168, 170. 172. 174, 176, 178. 180. 182. 184. 186, 188, 190, 192, 
194 196, 198. 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 
226 228 230, 232. 234. 236, 238. 240. 242, 244, 246, 248, 250. 252. 254. 256, 

25 258', 260, 262, 264, 266, 268. 270. 272. 274. 276, 278, 280. 282, 284. 286. 288. 

290 292 294. 296. 298. 300, 302, 304, 306. 308, 310, 312, 314, 316, 318, 320, 
322* 324 326, 328, 330, 332, 334,. 336, 338. 340, 342, 344, 346, 348, 350. 352. 
354'. 356. 358, 360. 362. 364. 366. 368. 370. 372. 374. 376. 378, 380. 382, 384, 
386, 388, 390, 392 or 394or a fragment thereof, which confers an increase in the 

30 amount of fine chemical in an organism or a part thereof; 

b) nucleic acid molecule comprising of the nucleic acid molecule as depicted in SEQ 
ID NO: 1, 3, 5. 7. 9. 11. 13. 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37. 39, 41 
43 45, 55. 57, 59, 61. 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 
93 95 97.99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 
35 127 129. 131, 133. 135. 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 

159 161, 163, 165, 167, 169, 171, 173, 175. 177. 179. 181. 183, 185, 187, 189, 
191 193, 195, 197, 199, 201. 203, 205. 207. 209, 211. 213. 215. 217. 219. 221. 
223 225 227. 229, 231, 233. 235, 237, 239. 241, 243. 245. 247. 249, 251, 253, 
255* 257*. 259, 261. 263. 265. 267. 269, 271. 273. 275. 277, 279, 281, 283, 285, 
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287, 289, 291, 293, 295, 297, 299, 301, 303, 305, 307, 309, 311, 313, 315, 317, 
319, 321, 323, 325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 345, 347, 349, 
351, 353, 355, 357, 359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 379, 381, 
383, 385, 387, 389, 391 or 393 or a fragment thereof, which confers an increase 
5 in the amount of fine chemical in an organism or a part thereof; 

c) nucleic acid molecule whose sequence can be deduced from a polypeptide 
sequence encoded by a nucleic acid molecule of (a) or (b) as a result of the 
degeneracy of the genetic code and conferring an increase in the amount of fine 
chemical in an organism or a part thereof; 

10 d) nucleic acid molecule which encodes a polypeptide which has at least 50% 

identity with the amino acid sequence of the polypeptide encoded by the nucleic 
acid molecule of (a) to (c) and conferring an increase in the amount of fine 
chemical in an organism or a part thereof; 

e) nucleic acid molecule which hybridizes with a nucleic acid molecule of (a) to (c) 
15 under stringent hybridization conditions and conferring an increase in the amount 

of fine chemical in an organism or a part thereof; 

f) nucleic acid molecule encoding a polypeptide, the polypeptide being derived by 
substituting, deleting and/or adding one or more amino acids of the amino acid 
sequence of the polypeptide encoded by the nucleic acid molecules (a) to (d), 

20 preferably to (a) to (c), and conferring an increase in the amount of the fine 

chemical in an organism or a part thereof; 

g) nucleic acid molecule encoding a fragment or an epitope of a polypeptide which 
is encoded by one of the nucleic acid molecules of (a) to (e) , preferably to (a) to 
(c) and conferring an increase in the amount of the fine chemical in an organism 

25 or a part thereof ; 

h) nucleic acid molecule which encompasses a nucleic add molecule which is 
obtained by amplifying nucleic acid molecules from a cDNA library or a genomic 
library using the primers in SEQ ID NO: 53, SEQ ID NO: 54, SEQ ID NO: 395 or 
SEQ ID NO: 396 and conferring an increase in the amount of the fine chemical in 

30 an organism or a part thereof; 

i) nucleic acid molecule encoding a polypeptide which is isolated, e.g. from a 
expression library, with the aid of monoclonal and/or polyclonal antibodies 
against a polypeptide encoded by one of the nucleic acid molecules of (a) to (g), 
preferably to (a) to (c) and conferring an increase in the amount of the fine 

35 chemical in an organism or a part thereof; 
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j) nucleic acid molecule encoding a polypeptide comprising the consensus 

sequence as depicted in SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ 
ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NQ: 397, SEQ ID NO: 398, 
SEQ ID NO: 399 and/or SEQ ID NO: 400 and conferring an increase in the 
5 amount of the fine chemical in an organism or a part thereof; and/or 

k) nucleic acid molecule which is obtainable by screening a suitable nucieic acid 

library under stringent hybridization conditions with a probe comprising one of the 
sequences of the nucleic acid molecule of (a) to (k) or with a fragment of at least 
15 nt, preferably 20 nt, 30 nt, 50 nt, 100 nt, 200 nt or 500 nt of the nucleic acid 
10 molecule characterized in (a) to (h) or of the nucleic acid molecule as depicted in 

SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25. 27, 29, 31, 33, 35, 37, 39, 
41 43, 45, 55. 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 
91, 93. 95, 97, 99, 101, 103. 105, 107. 109, 111, 113, 115, 117, 119, 121, 123, 
125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 
15 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179. 181, 183, 185, 187, 

189, 191, 193. 195. 197, 199, 201, 203, 205, 207, 209, 211, 213, 215, 217, 219, 
221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 
253, 255, 257, 259. 261, 263, 265, 267, 269, 271, 273, 275, 277, 279, 281=, 283, 
285, 287, 289, 291, 293, 295, 297, 299, 301, 303, 305, 307, 309, 311, 313, 315, 
20 317, 319, 321, 323, 325, 327, 329, 331, 333, 335, 337, 339, 341. 343, 345, 347, 

349, 351. 353. 355, 357. 359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 379, 
381, 383, 385, 387, 389, 391or 393 or a nucleic acid molecule encoding, 
preferably at least the mature form of, the polypeptide as depicted in SEQ ID NO: 
2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 
25 56, 58, 60, 62, 64, 66, 68, 70. 72. 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 

98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 
130, 132, 134, 136, 138, 140. 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 
162, 164, 166, 168, 170, 172, 174, 176. 178, 180, 182, 184, 186, 188. 190, 192, 
194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 
30 226, 228, 230, 232, 234, 236. 238. 240, 242, 244, 246, 248, 250, 252, 254, 256, 

258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 
290, 292, 294, 296, 298, 300, 302, 304, 306, 308, 310, 312, 314, 316. 318, 320, 
322, 324, 326. 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 
354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 
35 386, 388, 390, 392 or 394, and conferring an increase in the amount of the fine 

chemical in an organism or a part thereof; 

or which encompasses a sequence which is complementary thereto; 

whereby, preferably, the nucleic acid molecule according to (a) to (k) distinguishes over 
the sequence as depicted in SEQ ID NO: 1, 3. 5, 7, 9, 11, 13, 15, 17. 19, 21, 23, 25, 
40 27, 29, 31, 33. 35, 37. 39, 41 43, 45, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 
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81. 83, 85, 87, 89, 91, 93, 95, 97, 99, 101. 103, 105, 107, 109, 111, 113, 115, 117, 119, 
121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 
155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 
189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 211, 213, 215, 217, 219, 221, 
5 223, 225, 227, 229, 231 , 233, 235, 237, 239, 241 , 243, 245, 247, 249, 251 , 253, 255, 
257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 
291, 293, 295, 297, 299, 301, 303, 305, 307, 309, 311, 313, 315, 317, 319, 321, 323, 
325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 345, 347, 349, 351, 353, 355, 357, 
359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 379, 381, 383, 385, 387, 389, 391 or 
1 0 393 by one or more nucleotides. In one embodiment, the nucleic acid molecule of the 
invention does not consist of the sequence as depicted in SEQ ID NO: 1, 3, 5, 7, 9, 1 1, 
13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41 43, 45, 55, 57, 59, 61, 63, 65, 
67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 
109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 
15 143. 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 
177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201. 203, 205, 207, 209, 
211. 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241. 243, 
245, 247, 249, 251, 253, 255, 257, 259, 261. 263, 265, 267, 269, 271, 273, 275, 277, 
279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299, 301, 303, 305, 307, 309, 31 V, 
20 313, 315, 317, 319, 321, 323, 325, 327, 329, 331, 333, 335, 337, 339. 341, 343, 345, 
347, 349, 351, 353, 355, 357, 359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 379, 
381, 383, 385, 387, 389, 391or 393. In an other embodiment, the nucleic acid molecule 
of the present invention is at least 30 % identical and less than 100%, 99,999%, 
99,99%, 99,9% or 99% identical to the sequence as depicted in SEQ ID NO: 1. 3, 5, 7, 
25 9, 11, 13, 15, 17. 19, 21. 23, 25, 27. 29, 31, 33, 35, 37, 39, 41 43, 45, 55, 57, 59, 61, 
63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 
107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 
141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 
175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 
30 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 
243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 
277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299, 301, 303. 305, 307, 309, 
311, 313, 315, 317, 319, 321, 323, 325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 
345, 347, 349, 351, 353, 355, 357, 359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 
35 379, 381 , 383, 385, 387, 389, 391or 393. In a further embodiment the nucleic acid 

molecule does not encode the polypeptide sequence as depicted in SEQ ID NO: 2, 4, 
6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 56, 58, 60, 
62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 
106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 
40 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 
174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 
208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 



WO 2005/014828 PCT/EP2004/008136 

105 

242, 244, 246, 248, 250. 252, 254, 256, 258, 260. 262, 264, 266, 268, 270, 272, 274, 
276, 278. 280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300. 302, 304. 306, 308, 
31o', 312*. 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342. 
344, 346. 348, 350, 352, 354. 356. 358. 360. 362. 364, 366. 368, 370, 372, 374, 376, 
5 378, 380. 382, 384, 386, 388, 390, 392 or 394. Accordingly, in one embodiment, the 
, ~f th a nraeant in\/«ntion finmdes in one embodiment a 

I luuido aoiu 1 1 iuicouiC ui u iw **ww. >k .... . - 

polypeptide which differs at least in one or more amino acids from the polypeptide as 
depicted in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32. 34, 36, 
38, 40, 42, 44, 46, 56, 58, 60, 62, 64, 66, 68, 70. 72, 74, 76, 78, 80. 82, 84, 86, 88, 90, 
10 92, 94. 96, 98, 100. 102, 104, 106, 108. 110, 112, 114, 116. 118, 120, 122, 124, 126, 
128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, .152, 154, 156, 158, 160, 
162*, 164, 166, 168, 170, 172, 174, 176. 178, 180, 182, 184, 186, 188, 190. 192. 194. 
196, 198. 200, 202, 204, 206, 208, 210, 212, 214, 216, 218. 220, 222, 224, 226, 228, 
230, 232, 234, 236, 238, 240, 242. 244, 246, 248, 250, 252, 254, 256, 258. 260, 262, 
1 5 264, 266*, 268. 270. 272, 274, 276. 278, 280, 282, 284, 286, 288, 290, 292. 294, 296. 
298'. 300, 302, 304, 306. 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 
332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 364. 
366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386, 388, 390, 392 or 394. In 
another embodiment, the nucleic acid molecule as depicted in SEQ ID NO: 1, 3, 5, 7, 9, 
20 11, 13, 15, 17, 19, 21, 23, 25, 27. 29, 31. 33. 35. 37. 39, 41 43. 45, 55, 57. 59, 61, 63, 
65, 67, 69. 71. 73, 75, 77, 79, 81. 83, 85, 87, 89, 91, 93, 95, 97. 99, 101. 103. 105. 107. 
109. 111. 113. 115. 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137. 139. 141, 
143, 145, 147, 149. 151. 153, 155, 157, 159, 161, 163, 165, 167. 169. 171. 173. 175. 
177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 
25 21l! 213', 215, 217, 219. 221. 223. 225, 227, 229, 231, 233, 235, 237, 239. 241, 243, 
245, 247, 249, 251, 253. 255, 257, 259, 261, 263. 265, 267, 269, 271, 273, 275, 277, 
279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299, 301, 303, 305, 307, 309, 311. 
313*, 315. 317, 319, 321, 323, 325, 327, 329. 331. 333, 335. 337, 339, 341, 343, 345, 
347, 349. 351. 353. 355. 357. 359, 361, 363, 365, 367, 369, 371, 373, 375,-377, 379, 
30 381 383, 385, 387, 389, 391 or 393 does not encode a protein of the sequence as 

depicted in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36. 
38, 40, 42, 44, 46, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 
92! 94, 96. 98. 100. 102. 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 
128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148. 150. 152, 154, 156, 158, 160, 
35 I62! 164, 166. 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 
196, 198, 200, 202, 204, 206, 208, 210. 212, 214, 216, 218, 220, 222, 224, 226, 228, 
230, 232, 234, 236. 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 
264*, 266', 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 
29s', 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 
40 332] 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 
366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386, 388, 390, 392 or 394. 
Accordingly, in one embodiment, the protein encoded by a sequences of a nucleic acic 
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accoriding to (a) to (k) does not consist of the sequence as depicted in SEQ ID NO: 2, 
4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 56, 58, 
60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 
104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 
5 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 
172, 174, 176, 178, 180. 182, 184, 186. 188, 190, 192, 194, 196, 198, 200, 202, 204, 
206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 
240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 
274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300, 302, 304, 306. 

10 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 
342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372. 374. 
376, 378, 380, 382, 384, 386, 388, 390, 392 or 394. in a further embodiment, the 
protein of the present invention is at least 30 % identical to protein sequence as 
depicted in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32. 34, 36, 

15 38, 40, 42, 44, 46, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 
92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 
128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152. 154. 156, 158, 160, 
162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 
196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 

20 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252. 254, 256, 258, 260, 262, 
264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 
298, 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 
332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 
366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386, 388, 390, 392 or 394 and less 

25 than 100%, preferably less than 99,999%, 99,99% or 99,9%, more preferably less than 
99%, 98%, 97%, 96% or 95% identical to the sequence as depicted in SEQ ID NO: 2, 
4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 56, 58, 
60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 
104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124. 126. 128, 130, 132, 134, 136, 

30 138, 140, 142, 144, 146. 148, 150. 152. 154, 156, 158, 160, 162, 164, 166, 168, 170, 
172, 174, 176. 178, 180, 182. 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 
206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238. 
240, 242, 244, 246, 248, 250, 252. 254. 256, 258, 260, 262, 264, 266, 268, 270, 272, 
274, 276, 278, 280, 282, 284, 286, 288, 290, 292. 294. 296. 298, 300, 302, 304, 306, 

35 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 
342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 
376, 378, 380, 382, 384, 386, 388, 390, 392 or 394. 

[0217.0.0.0] The nucleic acid sequences used in the process are advantageously 
introduced in a nucleic acid construct, preferably an expression cassette, which makes 
40 " the expression of the nucleic acid molecules in an organism, advantageously a plant or 
a microorganism possible. 
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[0218.0.0.0] Accordingly, the invention also relates to a nucleic acid construct, 
preferably to an expression construct, comprising the nucleic acid molecule of the 
present invention functionally linked to one or more regulatory elements or signals. 

[0219.0.0.0] As described herein, the nucleic acid construct can also comprise 
5 further genes, which are introduced into the organisms or cells. It is possible and 
advantageous to introduce into, and express in, the host organisms regulatory genes 
such as genes for inductors, repressors or enzymes, which, owing to their enzymatic 
activity, engage in the regulation of one or more genes of a biosynthetic pathway. 
These genes can be of heterologous or homologous origin. Moreover, further 
10 biosynthesis genes may advantageously be present, or else these genes may be 
located on one or more further nucleic acid constructs. Genes, which are 
advantageously employed as biosynthesis genes are genes of the amino acid 
metabolism, of glycolysis, of the tricarboxylic acid metabolism or their combinations. As 
described herein, regulator sequences or factors can have a positive effect on 
1 5 preferably the gene expression of the genes introduced, thus increasing it Thus, an 
enhancement of the regulator elements may advantageously take place at the 
transcriptional level by using strong transcription signals such as promoters and/or 
enhancers. In addition, however, an enhancement of translation is also possible, for 
example by increasing mRNA stability or by inserting a translation enhancer sequence. 

20 [0220.0.0.0] In principle, the nucleic acid construct can comprise the herein 

described regulator sequences and further sequences relevant for the expression of 
the comprised genes. Thus, the nucleic acid construct of the invention can be used as 
expression cassette and thus can be used directly for introduction into the plant, or else 
they may be introduced into a vector. Accordingly in one embodiment the nucleic acid 

25 construct is an expression cassette comprising a microorganism promoter or a 

microorganism terminator or both. In another embodiment the expression cassette 
encompasses a plant promoter or a plant terminator or both. 

[0221 .0.0.0] Accordingly, in one embodiment, the process according to the invention 
comprises the following steps: 

30 (a) introducing of a nucleic acid construct comprising the nucleic acid molecule of 

the invention or used in the process of the invention or encoding the polypeptide 

of the present invention or used in the process of the invention; or 

(b) introducing of a nucleic acid molecule, including regulatory sequences or factors, 
which expression increases the expression of the nucleic acid molecule of the 
35 invention or used in the process of the invention or encoding the polypeptide of 

the present invention or used in the process of the invention in a cell, or an 
organism or a part thereof, preferably in a plant, plant cell or a microorganism, 
and 
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(c) expressing of the gene product encoded by the nucleic acid construct or the 
nucleic acid molecule mentioned under (a) or (b) in the cell or the organism. 

[0222.0.0.0] After the introduction and expression of the nucleic acid construct the 
transgenic organism or cell is advantageously cultured and subsequently harvested. 
5 The transgenic organism or cell may be a prokaryotic or eukaryotic organism such as a 
microorganism, a noh-human animal and plant for example a plant or animal cell, a 
plant or animal tissue, preferably a crop plant, or a part thereof. 

[0223.0.0.0] To introduce a nucleic acid molecule into a nucleic acid construct, e.g. 
as part of an expression cassette, the codogenic gene segment is advantageously 

10 subjected to an amplification and ligation reaction in the manner known by a skilled 
person. It is preferred to follow a procedure similar to the protocol for the Pfu DNA 
polymerase or a Pfu/Taq DNA polymerase mixture. The primers are selected according 
to the sequence to be amplified. The primers should expediently be chosen in such a 
way that the amplificate comprise the codogenic sequence from the start to the stop 

1 5 codon. After the amplification, the amplificate is expediently analyzed. For example, the 
analysis may consider quality and quantity and be carried out following separation by 
gel electrophoresis. Thereafter, the amplificate can be purified following a standard 
protocol (for example Qiagen). An aliquot of the purified amplificate is then available for 
the subsequent cloning step. The skilled worker generally knows suitable cloning 

20 vectors. 

[0224.0.0.0] They include, in particular, vectors which are capable of replication in 
easy to handle cloning systems like as bacterial yeast or insect cell based (e.g. 
baculovirus expression) systems, that is to say especially vectors which ensure 
efficient cloning in E. coli, and which make possible the stable transformation of plants. 
25 Vectors, which must be mentioned, in particular are various binary and cointegrated 

vector systems, which are suitable for the T-DNA-mediated transformation. Such vector 
systems are generally characterized in that they contain at least the vir genes, which 
are required for the Agrobacterium-mediated transformation, and the T-DNA border 
sequences. 

30 [0225.0.0.0] In general, vector systems preferably also comprise further cis- 

regulatory regions such as promoters and terminators and/or selection markers by 
means of which suitably transformed organisms can be identified. While vir genes and 
T-DNA sequences are located on the same vector in the case of cointegrated vector 
systems, binary systems are based on at least two vectors, one of which bears vir 

35 genes, but no T-DNA, while a second one bears T-DNA, but no vir gene. Owing to this 
fact, the last-mentioned vectors are relatively small, easy to manipulate and capable of 
replication in E. coli and in Agrobacterium. These binary vectors include vectors from 
the series pBIB-HYG, pPZP, pBecks, pGreen. Those, which are preferably used in 
accordance with the invention, are Bin19, pBI101, pBinAR, pGPTVand pCAMBIA. An 
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overview of binary vectors and their use is given by Hellens et al, Trends in Plant 
Science (2000) 5, 446-451 . 

[0226.0.0.0] For a vector preparation, vectors may first be linearized using restriction 
endonuclease(s) and then be modified enzymatically in a suitable manner. Thereafter, 
5 the vector is purified, and an aliquot is employed in the cloning step. In the cloning 
step, the enzyme-cleaved and, if required, purified amplificate is cloned together with 
similarly prepared vector fragments, using ligase. In this context, a specific nucleic acid 
construct, or vector or plasmid construct, may have one or else more codogenic gene 
segments. The codogenic gene segments in these constructs are preferably linked 
1 0 operably to regulatory sequences. The regulatory sequences include, in particular, 
plant sequences like the above-described promoters and terminators. The constructs 
can advantageously be propagated stably in microorganisms, in particular Escherichia 
coll and/or Agrobacterium tumefaciens, under selective conditions and enable the 
transfer of heterologous DNA into plants or other microorganisms. In accordance with a 
1 5 particular embodiment, the constructs are based on binary vectors (overview of a 
binary vector Hellens et al., 2000). As a rule, they contain prokaryotic regulatory 
sequences, such as replication origin and selection markers, for the multiplication in 
microorganisms such as Escherichia coll and Agrobacterium tumefaciens. Vectors can 
further contain agrobacterial T-DNA sequences for the transfer of DNA into plant 
20 genomes or other eukaryotic regulatory sequences for transfer into other eukaryotic 
cells, e.g. Saccharomyces sp. or other prokaryotic regulatory sequences for the 
transfer into other prokaryotic cells, e.g. Corynebacterium sp. or Bacillus sp. For the 
transformation of plants, the right border sequence, which comprises approximately 25 
base pairs, of the total agrobacterial T-DNA sequence is advantageously included. 
25 Usually, the plant transformation vector constructs according to the invention contain 
T-DNA sequences both from the right and from the left border region, which contain 
expedient recognition sites for site-specific acting enzymes, which, In turn, are encoded 
by some of the vir genes. 

[0227.0.0.0] Suitable host organisms are known to the skilled worker. Advantageous 
30 organisms are described further above in the present application. They include in 
particular eukaryotes or eubacteria, e.g. prokaryotes or archae bacteria. 
Advantageously host organisms are microorganisms selected from the group 
consisting of Actinomycetaceae, Bacillaceae, Brevibacteriaceae, Corynebacteriaceae, 
Enterobacteriacae, Gordoniaceae, Micrococcaceae, Mycobacteriaceae, Nocardiaceae, 
35 Pseudomonaceae, Rhizobiaceae, Streptomycetaceae, Chaetomiaceae, 
Choanephoraceae, Cryptococcaceae, Cunninghamellaceae, Demetiaceae, 
Moniliaceae, Mortierellaceae, Mucoraceae, Pythiaceae, Saccharomycetaceae, 
Saprolegniaceae, Schizosaccharomycetaceae, Sodariaceae, Sporobolomycetaceae, 
Tuberculariaceae. Adelotheciaceae, Dinophyceae, Ditrichaceae and Prasinophyceae. 
40 Preferably are unicellular, microorganisms, e.g. fungi, bacteria or protoza, such as 
fungi like the genus Claviceps or Aspergillus or gram-positive bacteria such as the 
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genera Bacillus, Corynebacterium, Micrococcus, Brevibacterium, Rhodococcus, 
Nocardia, Caseobacter or Arthrobacter or gram-negative bacteria such as the genera 
Escherichia, Flavobacterium or Salmonella, or yeasts such as the genera Rhodotorula, 
Hansenula, Pichia, Yerrowia, Saccharomyces, Schizosaccharomyces or Candida. 

5 [0228.0.0,0] Host organisms which are especially advantageously selected in the 
process according to the invention are microorganisms selected from the group of the 
genera and species consisting of Hansenula anomala, Saccharomyces cerevisiae, 
Candida utiiis, Claviceps purpurea, Bacillus circulans, Bacillus subtilis, Bacillus sp., 
Brevibacterium albidum, Brevibacterium album, Brevibacterium cerinum, 

10 Brevibacterium flavum, Brevibacterium glutamigenes, Brevibacterium iodinum, Brevi- 
bacterium ketoglutamicum, Brevibacterium lactofermentum, Brevibacterium linens, 
Brevibacterium roseum, Brevibacterium saccharolyticum, Brevibacterium sp., Coryne- 
bacterium acetoacidophilum, Corynebacterium acetoglutamicum, Corynebacterium 
ammoniagenes, Corynebacterium glutamicum (= Micrococcus glutamicum), Coryne- 

15 bacterium melassecola, Corynebacterium sp. or Escherichia coli, specifically 
Saccharomyces cerevisiae or Escherichia coli K12 and its described strains. 

[0229.0.0.0] Advantageously preferred in accordance with the invention are host 
organisms of the genus Agrobacterium tumefaciens or plants. Preferred plants are 
selected from among the families Aceraceae, Anacardiaceae, Apiaceae, Asteraceae, 

20 Apiaceae, Betulaceae, Boraginaceae, Brassicaceae, Bromeliaceae, Cactaceae, 
Caricaceae, Caryophyllaceae, Cannabaceae, Convolvulaceae, Chenopodiaceae, 
Elaeagnaceae, Geraniaceae, Gramineae, Juglandaceae, Lauraceae, Leguminosae, 
Linaceae, Cucurbitaceae, Cyperaceae, Euphorbiaceae, Fabaceae, Malvaceae, 
Nymphaeaceae, Papaveraceae, Rosaceae, Salicaceae, Solanaceae, Arecaceae, 

25 Iridaceae, Liliaceae, Orchidaceae, Gentianaceae, Labiaceae, Magnoliaceae, 
Ranunculaceae, Carifolaceae, Rubiaceae, Scrophulariaceae, Ericaceae, 
Polygonaceae, Violaceae, Juncaceae, Poaceae, perennial grass, fodder crops, 
vegetables and ornamentals. 

[0230.0.0.0] Especially preferred are plants selected from the groups of the families 
30 Apiaceae, Asteraceae, Brassicaceae, Cucurbitaceae, Fabaceae, Papaveraceae, 
Rosaceae, Solanaceae, Liliaceae or Poaceae. Especially advantageous are, in 
particular, crop plants. Accordingly, an advantageous plant preferably belongs to the 
group of the genus peanut, oilseed rape, canola, sunflower, safflower, olive, sesame, 
hazelnut, almond, avocado, bay, pumpkin/squash, linseed, soya, pistachio, borage, 
35 maize, wheat, rye, oats, sorghum and millet, triticale, rice, barley, cassava, potato, 
sugarbeet, fodder beet, egg. plant, and perennial grasses and forage plants, oil palm, 
vegetables (brassicas, root vegetables, tuber vegetables, pod vegetables, fruiting 
vegetables, onion vegetables, leafy vegetables and stem vegetables), buckwheat, 
Jerusalem artichoke, broad bean, vetches, lentil, alfalfa, dwarf bean, lupin, clover and 
40 lucerne. 
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[0231.0.0.0] In order to introduce, into a plant, the nucleic acid molecule of the 
invention or used in the process according to the invention, it has proved advantageous 
first to transfer them into an intermediate host, for example a bacterium or a eukaryotic 
unicellular cell. The transformation into E. coli, which can be carried out in a manner 

5 known per se, for example by means of heat shock or electroporation, has proved itself 
expedient in this context. Thus, the transformed E. coli colonies can be analysed for 
their cloning efficiency. This can be carried out with the aid of a PCR. Here, not only 
the identity, but also the integrity, of the plasmid construct can be verified with the aid 
of a defined colony number by subjecting an aliquot of the colonies to said PCR. As a 

1 0 rule, universal primers which are derived from vector sequences are used for this 

purpose, it being possible, for example, for a forward primer to be arranged upstream 
of the start ATG and a reverse primer to be arranged downstream of the stop codon of 
the codogenic gene segment. The amplificates are separated by electrophoresis and 
assessed with regard to quantity and quality. 

1 5 [0232.0.0.0] The nucleic acid constructs, which are optionally verified, are 

subsequently used for the transformation of the plants or other hosts, e.g. other 
eukaryotic cells or other prokaryotic cells. To this end, it may first be necessary to 
obtain the constructs from the intermediate host. For example, the constructs may be 
obtained as plasmids from bacterial hosts by a method similar to conventional plasmid 

20 isolation. 

[0233.0.0.0] The nucleic acid molecule of the invention or used in the process 
according to the invention can also be introduced into modified viral vectors like 
baculovirus vectors for expression in insect cells or plant viral vectors like tobacco 
mosaic virus or potato virus X-based vectors. Approaches leading to the expression of 

25 proteins from the modified viral genome including the the nucleic acid molecule of the 
invention or used in the process according to the invention involve for example the 
inoculation of tobacco plants with infectious RNA transcribed in vitro from a cDNA copy 
of the recombinant viral genome. Another approach utilizes the transfection of whole 
plants from wounds inoculated with Agrobacterium tumefaciens containing cDNA 

30 copies of recombinant plus-sense RNA viruses. Different vectors and virus are known 
to the skilled worker for expression in different target eg. production plants. 

[0234.0.0.0] A large number of methods for the transformation of plants are known. 
Since, in accordance with the invention, a stable integration of heterologous DNA into 
the genome of plants is advantageous, the T-DNA-mediated transformation has proved 

35 expedient in particular. For this purpose, it is first necessary to transform suitable 
vehicles, in particular agrobacteria, with a codogenic gene segment or the 
corresponding plasmid construct comprising the nucleic acid molecule of the invention. 
This can be carried out in a manner known per se. For example, said nucleic acid 
construct of the invention, or said expression construct or said plasmid construct, which 

40 has been generated in accordance with what has been detailed above, can be 
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transformed into competent agrobacteria by means of electroporation or heat shock In 
principle, one must differentiate between the formation of cointegrated vectors on the 
one hand and the transformation with binary vectors on the other hand. In the case of 
the first alternative, the constructs, which comprise the codogenic gene segment or the 
5 nucleic acid molecule of the invention have no T-DNA sequences, but the formation of 
the cointegrated vectors or constructs takes place in the agrobacteria by homologous 
recombination of the construct with T-DNA. The T-DNA is present in the agrobacteria in 
the form of Ti or Ri plasmids in which exogenous DNA has expediently replaced the 
oncogenes. If binary vectors are used, they can be transferred to agrobacteria either by 
10 bacterial conjugation or by direct transfer. These agrobacteria expediently already 
comprise the vector bearing the vir genes (currently referred to as helper Ti(Ri). 
plasmid). 

[0235.0.0.0] One or more markers may expediently also be used together with the 
nucleic acid construct, or the vector of the invention and, if plants or plant cells shall be 

1 5 transformed together with the T-DNA, with the aid of which the isolation or selection of 
transformed organisms, such as agrobacteria or transformed plant cells, is possible. 
These marker genes enable the identification of a successful transfer of the nucleic 
acid molecules according to the invention via a series of different principles, for 
example via visual identification with the aid of fluorescence, luminescence or in the 

20 wavelength range of light which is discernible for the human eye, by a resistance to 
herbicides or antibiotics, via what are known as nutritive markers (auxotrophism 
markers) or antinutritive markers, via enzyme assays or vja phytohormones. Examples 
of such markers which may be mentioned are GFP (= green fluorescent protein); the 
luciferin/luceferase system, the (3-galactosidase with its colored substrates, for example 

25 X-Gal, the herbicide resistances to, for example, imidazolinone, glyphosate, 

phosphinothricin or sulfonylurea, the antibiotic resistances to, for example, bleomycin, 
hygromycin, streptomycin, kanamycin, tetracyclin, chloramphenicol, ampicillin, 
gentamycin, geneticin (G418), spectinomycin or blasticidin, to mention only a few, 
nutritive markers such as the utilization of mannose or xylose, or antinutritive markers 

30 such as the resistance to 2-deoxyglucose. This list is a small number of possible 

markers. The skilled worker is very familiar with such markers. Different markers are 
preferred, depending on the organism and the selection method. 

[0236.0.0.0] As a rule, it is desired that the plant nucleic acid constructs are flanked 
by T-DNA at one or both sides of the codogenic gene segment. This is particularly 

35 useful when bacteria of the species Agrobacterium tumefaciens or Agrobacterium 

rhizogenes are used for the transformation. A method, which is preferred in accordance 
with the invention, is the transformation with the aid of Agrobacterium tumefaciens. 
However, biolistic methods may also be used advantageously for introducing the 
sequences in the process according to the invention, and the introduction by means of 

40 PEG is also possible. The transformed agrobacteria can be grown in the manner 

known per se and are thus available for the expedient transformation of the plants. The 
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plants or plant parts to be transformed are grown or provided in the customary manner. 
The transformed agrobacteria are subsequently allowed to act on the plants or plant 
parts until a sufficient transformation rate is reached. Allowing the agrobacteria to act 
on the plants or plant parts can take different forms. For example, a culture of 
5 morphogenic plant cells or tissue may be used. After the T-DNA transfer, antibiotics as 
a rule eliminate the bacteria, and the regeneration of plant tissue is induced. This is 
done in particular using suitable plant hormones in order to initially induce callus 
formation and then to promote shoot development. 

[0237.0.0.0] The transfer of foreign genes into the genome of a plant is called 

10 transformation. In doing this the methods described for the transformation and 

regeneration of plants from plant tissues or plant cells are utilized for transient or stable 
transformation. An advantageous transformation method is the transformation in 
planta. To this end, it is possible, for example, to allow the agrobacteria to act on plant 
seeds or to inoculate the plant meristem with agrobacteria. It has proved particularly 

15 expedient in accordance with the invention to allow a suspension of transformed 
agrobacteria to act on the intact plant or at least the flower primordia. The plant is 
subsequently grown on until the seeds of the treated plant are obtained (Clough and 
Bent, Plant J. (1998) 16, 735-743). To select transformed plants, the plant material 
obtained in the transformation is, as a rule, subjected to selective conditions so that 

20 transformed plants can be distinguished from untransformed plants. For example, the 
seeds obtained in the above-described manner can be planted and, after an initial 
growing period, subjected to a suitable selection by spraying. A further possibility 
consists in growing the seeds, if appropriate after sterilization, on agar plates using a 
suitable selection agent so that only the transformed seeds can grow into plants. 

25 Further advantageous transformation methods, in particular for plants, are known to the 
skilled worker and are described hereinbelow. 

[0238.0.0.0] Further advantageous and suitable methods are protoplast 
transformation by poly (ethylene glycol)-induced DNA uptake, the „biolistic tt method 
using the gene cannon - referred to as the particle bombardment method, 

30 electroporation, the incubation of dry embryos in DNA solution, microinjection and gene 
transfer mediated by Agrobacterium. Said methods are described by way of example in 
B. Jenes et al., Techniques for Gene Transfer, in: Transgenic Plants, Vol. 1, 
Engineering and Utilization, eds. S.D. Kung and R. Wu, Academic Press (1993) 128- 
143 and in Potrykus Annu. Rev. Plant Physiol. Plant Molec. Biol. 42 (1991) 205-225). 

35 The nucleic acids or the construct to be expressed is preferably cloned into a vector, 
which is suitable for transforming Agrobacterium tumefaciens, for example pBin19 
(Bevan et al., Nucl. Acids Res. 12 (1984) 871 1). Agrobacteria transformed by such a 
vector can then be used in known manner for the transformation of plants, in particular 
of crop plants such as by way of example tobacco plants, for example by bathing 

40 bruised leaves or chopped leaves in an agrobacterial solution and then culturing them 
in suitable media. The transformation of plants by means of Agrobacterium 
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tumefaciens is described, for example, by Hofgen and Willmitzer in Nucl. Acid Res. 
(1988) 16, 9877 or is known inter alia from F.F. White, Vectors for Gene Transfer in 
Higher Plants; in Transgenic Plants, Vol. 1, Engineering and Utilization, eds. S.D. Kung. 
and R. Wu, Academic Press, 1993, pp. 15-38. 

5 [0239.0.0.0] The abovementioned nucleic acid molecules can be cloned into the 
nucleic acid constructs or vectors according to the invention in combination together 
with further genes, or else different genes are introduced by transforming several 
nucleic acid constructs or vectors (including plasmids) into a host cell, advantageously 
into a plant cell or a microorgansims. 

10 [0240.0.0.0] In addition to the sequence mentioned in SEQ ID NO: 1, 3, 5, 7, 9, 11, 
13, 15. 17, 19, 21, 23, 25, 27, 29, 31. 33, 35, 37, 39, 41 43, 45, 55, 57, 59. 61, 63, 65, 
67, 69, 71, 73, 75, 77, 79. 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 
109, 111, 113, 115, 117, 119, 121. 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 
143, 145. 147, 149, 151, 153, 155, 157. 159. 161, 163, 165. 167, 169, 171, 173, 175, 

15 177. 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209 ; 
211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 
245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 
279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299, 301, 303, 305, 307, 309, 311, 
313, 315, 317, 319, 321, 323, 325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 345, 

20 347, 349, 351 , 353, 355, 357, 359, 361 , 363, 365, 367, 369, 371 , 373, 375, 377, 379, 
381 , 383, 385, 387, 389, 391 or 393 or its derivatives, it is advantageous additionally to 
express and/or mutate further genes in the organisms. Especially advantageously, 
additionally at least one further gene of the fine chemical biosynthetic pathway e.g. the 
amino acid biosynthetic pathway such as for L-tryptophane, L-isoleucine, L-leuc'ine, 

25 L-lysine, L-threonine and/or L-methionine to mention only a couple of them is 

expressed in the organisms such as plants or microorganisms. It is also possible that 
the regulation of the natural genes has been modified advantageously so that the gene 
and/or its gene product is no longer subject to the regulatory mechanisms which exist 
in the organisms. This leads to an increased synthesis of the fine chemicals e.g. the 

30 amino acids desired since, for example, feedback regulations no longer exist to the 
same extent or not at all. In addition it might be advantageously to combine the 
sequences shown in SEQ ID NO: 1, 3, 5, 7, 9, 1 1, 13, 15. 17, 19, 21, 23, 25, 27, 29, 
31, 33, 35. 37. 39, 41 43, 45, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 
85, 87, 89. 91, 93. 95. 97, 99, 101, 103, 105, 107, 109, 111, 113, 115. 117. 119, 121, 

35 123. 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 
157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 
191, 193, 195, 197, 199, 201. 203, 205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 
225, 227. 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 
259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 291. 

40 293. 295, 297. 299. 301, 303, 305, 307, 309. 31 1, 313, 315, 317, 319, 321, 323, 325, 
327, 329, 331, 333, 335, 337. 339, 341, 343, 345, 347, 349, 351. 353, 355, 357, 359, 



WO 2005/014828 PCT/EP2004/008136 

115 

361, 363, 365, 367, 369, 371, 373, 375, 377, 379, 381, 383, 385, 387, 389, 391or 393 
with genes which generally support or enhances to growth or yield of the target 
organismen, for example genes which lead to faster growth rate of microorganisms or 
genes which produces stress-, pathogen, or herbicide resistant plants. 

5 

[0241 ,0.0.0] In a further embodiment of the process of the invention, therefore, 
organisms are grown, in which there is simultaneous overexpression of at least one 
nucleic acid or one of the genes which code for proteins involved in the fine chemical 
metabolism e.g. the amino acid metabolism, in particular in amino acid synthesis. 

1 0 [0242.0.0.0] A further advantageous nucleic acid sequence which can be expressed 
in combination with the sequences used in the process and/or the abovementioned 
biosynthesis genes is the sequence of the ATP/ADP translocator as described in 
WO 01/20009. This ATP/ADP translocator leads to an increased synthesis of the 
essential amino acids lysine and/or methionine. Furthermore, an advantageous nucleic 

15 acid sequence coexpressed can be threonine adlolase and/or lysine decarboxylase as 
described in the state of the art. 

[0243.0.0.0] In a further advantageous embodiment of the process of the invention, 
the organisms used in the process are those in which simultaneously at least one of 
the aforementioned genes or one of the aforementioned nucleic acids is mutated so 

20 that the activity of the corresponding proteins is influenced by metabolites to a smaller 
extent compared with the unmutated proteins, or not at all, and that in particular the 
production according to the invention of the fine chemical for example of the amino 
acids is not impaired, or so that their specific enzymatic activity is increased. Less 
influence means in this connection that the regulation of the enzymic activity is less by 

25 at least 1 0%, advantageously at least 20, 30 or 40%, particularly advantageously by at 
least 50, 60 or 70%, compared with the starting organism, and thus the activity of the 
enzyme'is increased by these figures mentioned compared with the starting organism. 
An increase in the enzymatic activity means an enzymatic activity which is increased 
by at least 10%, advantageously at least 20, 30 or 40%, particularly advantageously by 

30 at least 50, 60 or 70%, compared with the starting organism. This leads to an increased 
productivity of the fine chemical. 

[0244.0.0.0] In a further advantageous embodiment of the process of the invention, 
the organisms used in the process are those in which simultaneously the fine chemical 
degrading protein is attenuated, in particular by reducing the rate of expression of the 
35 corresponding gene. 

[0245.0.0.0] In another embodiment of the process of the invention, the organisms 
used in the process are those in which simultaneously at least one of the 
aforementioned nucleic acids or of the aforementioned genes is mutated in such a way 
that the enzymatic activity of the corresponding protein is partially reduced or 
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completely blocked. A reduction in the enzymatic activity means an enzymatic activity, 
which is reduced by at least 10%, advantageously at least 20, 30 or 40%, particularly 
advantageously by at least 50, 60 or 70%, preferably more, compared with the starting 
organism. 

5 [0246.0.0.0] If it is intended to transform the host cell, in particular the plant cell, with 
several constructs or vectors, the marker of a preceding transformation must be 
removed or a further marker employed in a following transformation/The markers can 
be removed from the host cell, in particular the plant cell, as described hereinbelow via 
methods with which the skilled worker is familiar. In particular plants without a marker, 
10 in particular without resistance to antibiotics, are an especially preferred embodiment of 
the present invention. 

[0247.0.0.0] In the process according to the invention, the nucleic acid sequences 
used in the process according to the invention are advantageously linked operably to 
one or more regulatory signals in order to increase gene expression. These regulatory 

15 sequences are intended to enable the specific expression of the genes and the 
expression of protein. Depending on the host organism for example plant or 
microorganism, this may mean, for example, that the gene is expressed and/or 
overexpressed after induction only, or that it is expressed and/or overexpressed 
constitutively. These regulatory sequences are, for example, sequences to which the 

20 inductors or repressors bind and which thus regulate the expression of the nucleic acid. 
In addition to these novel regulatory sequences, or instead of these sequences, the 
natural regulation of these sequences may still be present before the actual structural 
genes and, if appropriate, may have been genetically modified so that the natural 
regulation has been switched off and gene expression has been increased. However, 

25 the nucleic acid construct of the invention suitable as expression cassette (= 

expression construct = gene construct) can also be simpler in construction, that is to 
say no additional regulatory signals have been inserted before the nucleic acid 
sequence or its derivatives, and the natural promoter together with its regulation has 
not been removed. Instead, the natural regulatory sequence has been mutated in such 

30 a way that regulation no longer takes place and/or gene expression is increased. 

These modified promoters can also be introduced on their own before the natural gene 
in the form of part sequences (= promoter with parts of the nucleic acid sequences 
according to the invention) in order to increase the activity. Moreover, the gene 
construct can advantageously also comprise one or more of what are known as 

35 enhancer sequences in operable linkage with the promoter, and these enable an 
increased expression of the nucleic acid sequence. Also, it is possible to insert 
additional advantageous sequences at the 3' end of the DNA sequences, such as, for 
example, further regulatory elements or terminators. 

[0248.0.0.0] The nucleic acid molecules, which encode proteins according to the 
40 invention and nucleic acid molecules, which encode other polypeptides may be present 
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in one nucleic acid construct or vector or in several ones. Advantageously, only one 
copy of the nucleic acid molecule of the invention or its encoding genes is present in 
the nucleic acid construct or vector. Several vectors or nucleic acid construct or vector 
can be expressed together in the host organism. The nucleic acid molecule or the 

5 nucleic acid construct or vector according to the invention can be inserted in a vector 
and be present in the ceil in a free form. If a stable transformation is preferred, a vector 
is used, which is stably duplicated over several generations or which is else be inserted 
into the genome. In the case of plants, integration into the plastid genome or, in 
particular, into the nuclear genome may have taken place. For the insertion of more 

1 0 than one gene in the host genome the genes to be expressed are present together in 
one gene construct, for example in above-described vectors bearing a plurality of 
genes. 

[0249.0.0.0] As a rule, regulatory sequences for the expression rate of a gene are 
located upstream (5'), within, and/or downstream (3') relative to to the coding sequence 
1 5 of the nucleic acid molecule of the invention or another codogenic gene segment. They 
control in particular transcription and/or translation and/or the transcript stability. The 
expression level is dependent on the conjunction of further cellular regulatory systems, 
such as the protein biosynthesis and degradation systems of the cell. 

[0250.0.0.0] Regulatory sequences include transcription and translation regulating 
20 sequences or signals, e.g. sequences located upstream (5'), which concern in 

particular the regulation of transcription or translation initiation, such as promoters or 
start codons, and sequences located downstream (3*), which concern in particular the 
regulation of transcription or translation termination and transcript stability, such as 
polyadenylation signals or stop codons. Regulatory sequences can also be present in 
25 transcribed coding regions as well in transcribed non-coding regions, e.g. in introns, as 
for example splicing sites. Promoters for the regulation of expression of the nucleic acid 
molecule according to the invention in a cell and which can be employed are. in 
principle, all those which are capable of stimulating the transcription of genes in the 
organisms in question, such as microorganisms or plants. Suitable promoters, which 
30 are functional in these organisms, are generally known. They may take the form of 
constitutive or inducible promoters. Suitable promoters can enable the development- 
and/or tissue-specific expression in multi-celled eukaryotes; thus, leaf-, root-, flower-, 
seed-, stomata-, tuber- or fruit-specific promoters may advantageously be used in 
plants. 

35 [0251 .0.0.0] The regulatory sequences or factors can, as described above, have a 
positive effect on, the expression of the genes introduced, thus increasing their 
expression. Thus, an enhancement of the expression can advantageously take place at 
the transcriptional level by using strong transcription signals such as strong promoters 
and/or strong enhancers. In addition, enhancement of expression on the translational 

40 level is also possible, for example by introducing translation enhancer sequences, e.g., 
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the Q enhancer e.g. improving the ribosomal binding to the transcript, or by increasing 
the stability of the mRNA, e.g. by replacing the 3'UTR coding region by a region 
encoding a 3'UTR known as conferring an high stability of the transcript or by 
stabilization of the transcript through the elimination of transcript instability, so that the 
5 mRNA molecule Is translated more often than the wild type. For example in plants AU- 
rich elements (AREs) and DST (downstream) elements destabilized transcripts. 
Mutagenesis studies have demonstrated that residues within two of the conserved 
domains, the ATAGAT and the GTA regions, are necessary for instability function. 
Therefore removal or mutation of such elements would obviously lead to more stable 
10 transcripts, higher transcript rates and higher protein acitivity.Translation enhancers are 
also the "overdrive sequence", which comprises the tobacco mosaic virus 5- 
untranslated leader sequence and which increases the protein/RNA ratio (Gallie et al., 
1 987, Nucl. Acids Research 1 5:8693-871 1 } 

[0252.0.0.0] Enhancers are generally defined as cis active elements, which can 
15 stimulate gene transcription independent of position and orientation. Different 
enhancers have been identified in plants, which can either stimulate transcription 
constitutively or tissue or stimuli specific. Well known examples for constitutive 
enhancers are the enhancer from the 35S promoter (Odell et al., 1985, Nature 
313:810-812) or the ocs enhancer (Fromm et al., 1989, Plant Cell 1: 977:984) Another 
20 examples are the G-Box motif tetramer which confers high-level constitutive expression 
in dicot and monocot plants (Ishige et al., 1999, Plant Journal, 18, 443-448) or the 
petE, a A/T-rich sequence which act as quantitative enhancers of gene expression in 
transgenic tobacco and potato plants (Sandhu et al., 1998; Plant Mol Biol. 37(5):885~ 
96). Beside that, a large variety of cis-active elements have been described which 
25 contribute to specific expression pattern, like organ specific expression or induced 
expression in response to biotic or abiotic stress. Examples are elements, which 
provide pathogen or wound-induced expression (Rushton,_2002, Plant Cell, 14, 749- 
762) or guard cell-specific expression (Plesch, 2001, Plant Journal 28, 455-464). 

[0253.0.0.0] Advantageous regulatory sequences for the expression of the nucleic 
30 acid molecule according to the invention in microorganisms are present for example in 
promoters such as the cos, tac, rha, trp, tet, trp-tet, Ipp, lac, Ipp-lac, laci q "' T7, T5, T3, 
gal, trc, ara, SP6, A-P R or A-P L promoter, which are advantageously used in Gram- 
negative bacteria. Further advantageous regulatory sequences are present for example 
in the Gram-positive promoters amy, dnaK, xylS and SP02, in the yeast or fungal 
35 promoters ADC1, MFcr, AC, P-60, UASH, MCB, PHO, CYC1, GAPDH, TEF, rp28, 
ADH. Promoters, which are particularly advantageous, are constitutive, tissue or 
compartment specific and inducible promoters. In general, "promoter" is understood as 
meaning, in the present context, a regulatory sequence in a nucleic acid molecule, 
which mediates the expression of a coding sequence segment of a nucleic acid 
40 molecule. In general, the promoter is located upstream to the coding sequence 
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segment. Some elements, for example expression-enhancing elements such as 
enhancer may. however, also be located downstream or even in the transcnbed reg.on. 

[0254 0 0 0] In principle, it is possible to use natural promoters together with their 
regulatory sequences, such as those mentioned above, for the novel process. It is also 
5 possible advantageously to use synthetic promoters, either additionally or alone, in 
particular when they mediate seed-specific expression such as described in, for 
example, WO 99/16890. 

[0255 0 0.0] The expression of the nucleic acid molecules used in the process may 
be desired alone or in combination with other genes or nucleic acids. Multiple nucleic 
10 acid molecules conferring the expression of advantageous genes can be introduced via 
the simultaneous transformation of several individual suitable nucleic acid constructs, 
i e expression constructs, or, preferably, by combining several expression cassettes 
on one construct It is also possible to transform several vectors with in each case 
several expression cassettes stepwise into the recipient organisms. 

15 [0256.0.0.0] As described above, the transcription of the genes introduced should 
advantageously be terminated by suitable terminators at the 3' end of the biosynthests 
genes introduced (behind the stop codon). A terminator, which may be used for this 
purpose is, for example, the OCS1 terminator, the nos3 terminator or the 35S 
terminator. As is the case with the promoters, different terminator sequences should be 

20 used for each gene. Terminators, which are useful in microorganism, are for example 
the flmA terminator, txn terminator or tap terminator. Such terminators can be rho- 
dependent or rho-independent. 

[0257 0.0.0] Different plant promoters such as. for example, the USP. the LegB4-, 
the DC3 promoter or the ubiquitin promoter from parsley or other herein mentioned 
25 promoter and different terminators may advantageously be used in the nucle.c acd 
construct. 

[0258 0 0 0] In order to ensure the stable integration, into the transgenic plant, of 
nucleic acid molecules used in the process according to the invention in combination 
with further biosynthesis genes over a plurality of generations, each of the coding 
30 regions used in the process should be expressed under the control of its own, 
preferably unique, promoter since repeating sequence motifs may lead to 
recombination events or to silencing or. in plants, to instability of the T-DNA. 

[0259 0 0 0] The nucleic acid construct is advantageously constructed in such a way 
that a promoter is followed by a suitable cleavage site for insertion of the nucleic acd to 
35 be expressed, advantageously in a polylinker, followed, if appropriate, by a terminator 
located behind the polylinker. If appropriate, this order is repeated several tomes so that 
several genes are combined in one construct and thus can be introduced into the 
transgenic plant in order to be expressed. The sequence is advantageously repeated 
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up to three times. For the expression, the nucleic acid sequences are inserted via the 
suitable cleavage site, for example in the polylinker behind the promoter. It is 
advantageous for each nucleic acid sequence to have its own promoter and, if 
appropriate, its own terminator, as mentioned above. However, it is also possible to 
5 insert several nucleic acid sequences behind a promoter and, if appropriate, before a 
terminator if a polycistronic transcription is possible in the host or target cells. In this 
context, the insertion site, or the sequence of the nucleic acid molecules inserted, in 
the nucleic acid construct is not decisive, that is to say a nucleic acid molecule can be 
inserted in the first or last position in the cassette without this having a substantial 
10 effect on the expression. However, it is also possible to use only one promoter type in 
the construct However, this may lead to undesired recombination events or silencing 
effects, as said. 

[0260.0.0.0] Accordingly, in a preferred embodiment, the nucleic acid construct 
according to the invention confers expression of the nucleic acid molecule of the 
15 invention, and, optionally further genes, in a plant and comprises one or more plant 
regulatory elements. Said nucleic acid construct according to the invention 
advantageously encompasses a plant promoter or a plant terminator or a plant 
promoter and a plant terminator. 

[0261.0.0.0] A "plant" promoter comprises regulatory elements, which mediate the 
20 expression of a coding sequence segment in plant cells. Accordingly, a plant promoter 
need not be of plant origin, but may originate from viruses or microorganisms, in 
particular for example from viruses which attack plant cells. 

[0262.0.0.0] The plant promoter can also originates from a plant cell, e.g. from the 
plant, which is transformed with the nucleic acid construct or vector as described 
25 herein. This also applies to other "plant" regulatory signals, for example in "plant" 
terminators. 

[0263.0.0.0] A nucleic acid construct suitable for plant expression preferably 
comprises regulatory elements which are capable of controlling the expression of 
genes in plant cells and which are operably linked so that each sequence can fulfill its 

30 function. Accordingly, the nucleic acid construct can also comprise transcription 
terminators. Examples for transcriptional termination are polyadenylation signals. 
Preferred polyadenylation signals are those which originate from Agrobacterium 
tumefaciens T-DNA, such as the gene 3 of the Ti plasmid pTiACHS, which is known as 
octopine synthase (Gielen et al., EMBO J. 3 (1984) 835 et seq.) or functional 

35 equivalents thereof, but all the other terminators which are functionally active in plants 
are also suitable. 

[0264.0.0.0] The nucleic acid construct suitable for plant expression preferably also 
comprises other operably linked regulatory elements such as translation enhancers, for 
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example the overdrive sequence, which comprises the tobacco mosaic virus 5'- 
untranslated leader sequence, which increases the protein/RNA ratio (Gallie et al., 
1987, Nucl. Acids Research 15:8693-8711). 

[0265.0.0.0] Other preferred sequences for use in operable linkage in gene 
5 expression constructs are targeting sequences, which are required for targeting the 
gene product into specific cell compartments (for a review, see Kermode, Crit. Rev. 
Plant Sci. 15, 4 (1996) 285-423 and references cited therein), for example into the 
vacuole, the nucleus, all types of plastids, such as amyloplasts, chloroplasts, 
chromoplasts, the extracellular space, the mitochondria, the endoplasmic reticulum, 
10 elaioplasts, peroxisomes, glycosomes, and other compartments of cells or 

extracellular. Sequences, which must be mentioned in this context are, in particular, the 
signal-peptide- or transit-peptide-encoding sequences which are known per se. For 
example, plastid-transit-peptide-encoding sequences enable the targeting of the 
expression product into the plastids of a plant cell. Targeting sequences are also 
1 5 known for eukaryotic and to a lower extent for prokaryotic organisms and can 
advantageously be operable linked with the nucleic acid molecule of the present 
invention to achieve an expression in one of said compartments or extracellular. 

[0266.0.0.0] For expression in plants, the nucleic acid molecule must, as described 
above, be linked operably to or comprise a suitable promoter which expresses the 
20 gene at the right point in time and in a cell- or tissue-specific manner. Usable 

promoters are constitutive promoters (Benfey et al., EMBO J. 8 (1989) 2195-2202), 
such as those which originate from plant viruses, such as 35S CAMV (Franck et al., 
Cell 21 (1980) 285-294), 19S CaMV (see also US 5352605 and WO 84/02913), 34S 
FMV (Sanger et al., Plant. Mol. Biol., 14, 1990: 433-443), the parsley ubiquitin 
25 promoter, or plant promoters such as the Rubisco small subunit promoter described in 
US 4,962,028 or the plant promoters PRP1 [Ward et al.. Plant. Mol. Biol. 22 (1993)]. 
SSU ' PGEL1 OCS [Leisner (1988) Proc Natl Acad Sci USA 85(5): 2553-2557], Hb4, 
usp, mas [Comal (1990) Plant Mol Biol 15 (3):373-381], STLS1, ScBV (Schenk (1999> 
Plant Mol Biol 39(6): 1221 -1230), B33, SAD1 or SAD2 (flax promoters, Jain et al., Crop 
30 Science. 39 (6), 1999: 1696-1701) or nos [Shaw et al. (1984) Nucleic Acids Res. 
12(20):7831-7846]. Stable, constitutive expression of the proteins according to the 
invention a plant can be advantageous. However, inducible expression of the 
polypeptide of the invention is advantageous, if a late expression before the harvest is 
of advantage, as metabolic manipulation may lead to a plant growth retardation. 

35 [0267.0.0.0] The expression of plant genes can also be facilitated as described 
above via a chemical inducible promoter (for a review, see Gatz 1997, Annu. Rev. 
Plant Physiol. Plant Mol. Biol., 48:89-108). Chemically inducible promoters are 
particularly suitable when it is desired to express the gene in a time-specific manner. 
Examples of such promoters are a salicylic acid inducible promoter -(WO 95/19443). 

40 and abscisic acid-inducible promoter (EP 335 528), a tetracyclin-inducible promoter 



WO 2005/014828 PCT7EP2004/008136 

122 

(Gatz et al. (1992) Plant J. 2, 397-404), a cyclohexanol- or ethanol-inducible promoter 
(WO 93/21 334) or others as described herein. 

[0268.0.0.0] Other suitable promoters are those which react to biotic or abiotic stress 
conditions, for example the pathogen-induced PRP1 gene promoter (Ward et al., Plant. 
5 Mol. Biol. 22 (1993) 361-366), the tomato heat-inducible hsp80 promoter (US 

5,187,267), the potato chill-inducible alpha-amylase promoter (WO 96/12814) or the 
wound-inducible pinll promoter (EP-A-0 375 091) or others as described herein. 

[0269.0.0.0] Preferred promoters are in particular those which bring about gene 
expression in tissues and organs in which the biosynthesis of fine chemical takes 

10 place, in seed cells, such as endosperm cells and cells of the developing embryo. 

Suitable promoters are the oilseed rape napin gene promoter (US 5,608,152), the Vicia 
faba USP promoter (Baeunilein et aL, Mol Gen Genet, 1991, 225 (3):459-67), the 
Arabidopsis oleosin promoter (WO 98/45461), the Phaseolus vulgaris phaseolin 
promoter (US 5,504,200), the Brassica Bce4 promoter (WO 91/13980), the bean arc5 

15 promoter, the carrot DcG3 promoter, or the Legumin B4 promoter (LeB4; Baeumlein et 
al., 1992, Plant Journal, 2 (2):233-9), and promoters which bring about the seed- 
specific expression in monocotyledonous plants such as maize, barley, wheat, rye, rice 
and the like. Advantageous seed-specific promoters are the sucrose binding protein 
promoter (WO 00/26388), the phaseolin promoter and the napin promoter. Suitable 

20 promoters which must be considered are the barley Ipt2 or Ipt1 gene promoter 

(WO 95/15389 and WO 95/23230), and the promoters described in WO 99/16890 
(promoters from the barley hordein gene, the rice glutelin gene, the rice oryzin gene, 
the rice prolamin gene, the wheat gliadin gene, the wheat glutelin gene, the maize zein 
gene, the oat glutelin gene, the sorghum kasirin gene and the rye secalin gene). 

25 Further suitable promoters are Amy32b, Amy 6-6 and Aleurain.[(JS 5,677,474], Bce4 
(oilseed rape) [US 5,530,149], glycinin (soya) [EP 571 741], phosphoenolpyruvate 
carboxylase (soya) [JP 06/62870], ADR12-2 (soya) [WO 98/08962], isocitrate lyase 
(oilseed rape) [US 5,689,040] or a-amylase (barley) [EP 781 849]. Other promoters 
which are available for the expression of genes in plants are leaf-specific promoters 

30 such as those described in DE-A 19644478 or light-regulated promoters such as, for 
example, the pea petE promoter. 

[0270.0.0.0] Further suitable plant promoters are the cytosolic FBPase promoter or 
the potato ST-LSI promoter (Stockhaus et al., EMBO J. 8, 1989, 2445), the Glycine 
max phosphoribosylpyrophosphate amidotransferase promoter (GenBank Accession 
35 No. U87999) or the node-specific promoter described in EP-A-0 249 676. 

[0271.0.0.0] Other promoters, which are particularly suitable, are those, which bring 
about plastid-specific expression. Suitable promoters such as the viral RNA 
polymerase promoter are described in WO 95/16783 and WO 97/06250, and the 
Arabidopsis dpP promoter, which is described in WO 99/46394. 
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[0272.0.0.0] Other promoters, which are used for the strong expression of 
heterologous sequences in as many tissues as possible, in particular also in leaves, 
are, in addition to several of the abovementioned viral and bacterial promoters, 
preferably, plant promoters of actin or ubiquitin genes such as, for example, the rice 
5 actinl promoter. Further examples of constitutive plant promoters are the sugarbeet V- 
>-rn <. ,. nA/r\ t\A m a K7o\ Cvamnips nf svnthetic constitutive Dromoters are 

Mirase piumuicio w n i-rwu./. w«4i. r .- - j • 

the Super promoter (WO 95/14098) and promoters derived from G-boxes (WO 
94/12015). If appropriate, chemical inducible promoters may furthermore also be used, 
compare EP-A 388186, EP-A 335528, WO 97/06268. 

1 0 [0273.0,0.0] As already mentioned herein, further regulatory sequences, which may 
be expedient, if appropriate, also include sequences, which target the transport and/or 
the localization of the expression products. Sequences, which must be mentioned in 
this context are, in particular, the signal-peptide- or transit-peptide-encoding sequences 
which are known per se. For example, plastid-transit-peptide-encoding sequences 

1 5 enable the targeting of the expression product into the plastids of a plant cell. 

[0274.0.0.0] Preferred recipient plants are, as described above, in particular those 
plants, which can be transformed in a suitable manner. These include 
monocotyledonous and dicotyledonous plants. Plants which must be mentioned in 
particular are agriculturally useful plants such as cereals and grasses, for example 

20 Triticum spp., Zea mays, Hordeum vulgare, oats, Secale cereale, Oryza sativa, 
Pennisetum glaucum, Sorghum bicolor, Triticale, Agrostis spp., Cenchrus ciliaris, 
Dactylis glomerata, Festuca arundinacea, Lolium spp., Medicago spp. and Saccharum 
spp., legumes and oil crops, for example Brassica juncea, Brassica napus, Glycine 
max! Arachis hypogaea, Gossypium hirsutum, Cicer arietinum, Helianthus annuus, 

25 Lens culinaris, Linum usitatissimum, Sinapis alba, Trifolium repens and Vicia 
narbonensis, vegetables and fruits, for example bananas, grapes, Lycopersicon 
esculentum, asparagus, cabbage, watermelons, kiwi fruit, Solanum tuberosum, Beta 
vulgaris, cassava and chicory, trees, for example Coffea species, Citrus spp., 
Eucalyptus spp., Picea spp., Pinus spp. and Populus spp., medicinal plants and trees, 

30 and flowers.. 

[0275.0.0.0] One embodiment of the present invention also relates to a method for 
generating a vector, which comprises the insertion, into a vector, of the nucleic acid 
molecule characterized herein, the nucleic acid molecule according to the invention or 
the expression cassette according to the invention. The vector can, for example, be 

35 introduced in to a cell, e.g. a microorganism or a plant cell, as described herein for the 
nucleic acid construct, or below under transformation or transfection or shown in the 
examples. A transient or stable transformation of the host or target cell is possible, 
however, a stable transformation is preferred. The vector according to the invention is 
preferably a vector, which is suitable for expressing the polypeptide according to the 

40 invention in a plant. The method can thus also encompass one or more steps for 
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integrating regulatory signals into the vector, in particular signals, which mediate the 
expression in microorganisms or plants. 

[0276.0.0.0] Accordingly, the present invention also relates to a vector comprising 
the nucleic acid molecule characterized herein as part of a nucleic acid construct 
5 suitable for plant expression or the nucleic acid molecule according to the invention. 

[0277.0.0.0] The advantageous vectors of the invention comprise the nucleic acid 
molecules which encode proteins according to the invention, nucleic acid molecules 
which are used in the process, or nucleic acid construct suitable for plant expression 
comprising the nucleic acid molecules used, either alone or in combination with further 

1 0 genes such as the biosynthesis or regulatory genes of the fine chemical metabolism 
e.g. with the genes mentioned herein above. In accordance with the invention, the term 
"vector" refers to a nucleic acid molecule, which is capable of transporting another 
nucleic acid to which it is linked. One type of vector is a "plasmid", which means a 
circular double-stranded DNA loop into which additional DNA segments can be ligated. 

15 A further type of vector is a viral vector, it being possible to ligate additional nucleic 
acids segments into the viral genome. Certain vectors are capable of autonomous 
replication in a host cell into which they have been introduced (for example bacterial 
vectors with bacterial replication origin). Other preferred vectors are advantageously 
completely or partly integrated into the genome of a host cell when they are introduced 

20 into the host cell and thus replicate together with the host genome. Moreover, certain 
vectors are capable of controlling the expression of genes with which they are in 
operable linkage. In the present context, these vectors are referred to as "expression 
vectors". As mentioned above, they are capable of autonomous replication or may be 
integrated partly or completely into the host genome. Expression vectors, which are 

25 suitable for DNA recombination techniques usually, take the form of plasmids. In the 
present description, "plasmid" and "vector" can be used interchangeably since the 
plasmid is the most frequently used form of a vector. However, the invention is also 
intended to encompass these other forms of expression vectors, such as viral vectors, 
which exert similar functions. The term vector is furthermore also to encompass other 

30 vectors which are known to the skilled worker, such as phages, viruses such as SV40, 
CMV, TMV, transposons, IS elements, phasmids, phagemids, cosmids, and linear or 
circular DNA. 

[0278.0.0.0] The recombinant expression vectors which are advantageously used in 
the process comprise the nucleic acid molecules according to the invention or the 
35 nucleic acid construct according to the invention in a form which is suitable for 

expressing, in a host cell, the nucleic acid molecules according to the invention or 
described herein. Accordingly, the the recombinant expression vectors comprise one or 
more regulatory signals selected on the basis of the host cells to be used for the 
expression, in operable linkage with the nucleic acid sequence to be expressed. 
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[0279.0.0.0] In a recombinant expression vector, "operable linkage" means that the 
nucleic acid molecule of interest is linked to the regulatory signals in such a way that 
expression of the nucleic acid molecule is possible: they are linked to one another in 
such a way that the two sequences fulfill the predicted function assigned to the 
5 sequence (for example in an in-vitro transcription/translation system, or in a host cell if 
the vector is introduced into the host cell). 

[0280.0.0.0] The term "regulatory sequence" is intended to comprise promoters, 
enhancers and other expression control elements (for example polyadenylation 
signalsThese regulatory sequences are described, for example, in Goeddel: Gene 
10 Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, 
CA (1 990), or see: Gruber and Crosby, in: Methods in Plant Molecular Biology and 
Biotechnolgy, CRC Press, Boca Raton, Florida, Ed.: Glickand Thompson, chapter 7, 
89-108, including the references cited therein. Regulatory sequences encompass 
those, which control the constitutive expression of a nucleotide sequence in many 
1 5 types of host cells and those which control the direct expression of the nucleotide 

sequence in specific host cells only, and under specific conditions. The skilled worker 
knows that the design of the expression vector may depend on factors such as the 
selection of the host cell to be transformed, the extent to which the desired protein is 
expressed, and the like. A preferred selection of regulatory sequences is described 
20 above, for example promoters, terminators, enhancers and the like. The term 

regulatory sequence is to be considered as being encompassed by the term regulatory 
signal. Several advantageous regulatory sequences, in particular promoters and 
terminators are described above. In general, the regulatory sequences described as 
advantageous for nucleic acid construct suitable for expression are also applicable for 
25 vectors. 

[0281 .0.0.0] The recombinant expression vectors used can be designed specifically 
for the expression, in prokaryotic and/or eukaryotic cells, of nucleic acid molecules 
used in the process. This is advantageous since intermediate steps of the vector 
construction are frequently carried out in microorganisms for the sake of simplicity. For 
30 example, the genes according to the invention and other genes can be expressed in 
bacterial cells, insect cells (using baculovirus expression vectors), yeast cells and other 
fungal cells [Romanos (1992), Yeast 8:423-488; van den Hondel, (1991), in: More 
Gene Manipulations in Fungi, J.W. Bennet & L.L. Lasure. Ed., pp. 396-428: Academic 
Press- San Diego; and van den Hondel, C.A.M.J.J. (1991), in: Applied Molecular 
35 Genetics of Fungi, Peberdy, J.F., et al., Ed., pp. 1-28, Cambridge University Press: 
Cambridge], algae [Falciatore et al., 1999, Marine Biotechnology. 1, 3:239-251] using 
vectors and following a transformation method as described in WO 98/01572, and 
preferably in cells of multi-celled plants [see Schmidt, R. and Willmitzer, L. (1988) Plant 
Cell Rep.:583-586; Plant Molecular Biology and Biotechnology, C Press, Boca Raton, 
40 Florida, chapter 6/7, pp.71-1 19 (1993); F.F. White, in: Transgenic Plants, Bd. 1, 

Engineering and Utilization, Ed.: Kung and R. Wu, Academic Press (1993), 128-43; 
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Potrykus. Annu. Rev. Plant Physiol. Plant Molec. Biol. 42 (1991), 205-225 (and 
references cited therein)]. Suitable host cells are furthermore discussed in Goeddel, 
Gene Expression Technology: Methods in Enzymology 185, Academic Press, San 
Diego, CA (1990). As an alternative, the sequence of the recombinant expression 
5 vector can be transcribed and translated in vitro, for example using T7 promotor- 
regulatory sequences and T7 polymerase. 

[0282.0.0.0] Proteins can be expressed in prokaryotes using vectors comprising 
constitutive or inducible promoters, which control the expression of fusion proteins or 
nonfusion proteins. Typical fusion expression vectors are, inter alia, pGEX (Pharmacia 

10 Biotech Inc; Smith, D.B., and Johnson, K.S. (1988) Gene 67:31-40), pMAL (New 
England Biolabs, Beverly, MA) and pRIT5 (Pharmacia, Piscataway, NJ), in which 
glutathione-S-transferase (GST), maltose-E-binding protein or protein A is fused with 
the recombinant target protein. Examples of suitable inducible nonfusion E. coli 
expression vectors are, inter alia, pTrc (Amann et al. (1988) Gene 69:301-315) and 

15 pET 1 1d [Studier et al., Gene Expression Technology: Methods in Enzymology 185, 
Academic Press, San Diego, California (1990) 60-89]. The target gene expression of 
the pTrc vector is based on the transcription of a hybrid trp-lac fusion promoter by the 
host RNA polymerase. The target gene expression from the pET 1 1d vector is based 
on the transcription of a T7-gn10-lac fusion promoter, which is mediated by a 

20 coexpressed viral RNA polymerase (T7 gn1). This viral polymerase is provided by the 
host strains BL21 (DE3) or HMS174 (DE3) by a resident A-prophage, which harbors a 
T7 gn1 gene under the transcriptional control of the lacUV 5 promoter. 

[0283.0.0.0] Other vectors which are suitable in prokaryotic organisms are known to 
the skilled worker; these vectors are for example in E. coli pLG338, pACYC184, the 
25 pBR series, such as pBR322, the pUC series such as pUC18 or pUC19, the M1 13mp 
series, pKC30, pRep4, pHS1, pHS2, pPLc236, pMBL24, pLG200, pUR290, pIN-lll 113 - 
B1, Agt11 or pBdCI, in Streptomyces plJ101, plJ364, plJ702 or plJ361, in Bacillus 
pUB110, pC194 or pBD214, in Corynebacterium pSA77 or pAJ667. 

[0284.0.0.0] In a further embodiment, the expression vector is a yeast expression 
30 vector. Examples of vectors for expression in the yeasts S. cerevisiae encompass 
pYeDesaturased (Baldari et al. (1987) Embo J. 6:229-234), pMFa (Kurjan and 
Herskowitz (1982) Cell 30:933-943), pJRY88 (Schultz et al. (1987) Gene 54:113-123) 
and pYES2 (Invitrogen Corporation, San Diego, CA). Vectors and methods for the 
construction of vectors which are suitable for use in other fungi, such as the 
35 filamentous fungi, encompass those which are described in detail in: van den Hondel, 
C.A.MJJ. [(1991), J.F. Peberdy, Ed., pp. 1-28, Cambridge University Press: 
Cambridge; or in: More Gene Manipulations in Fungi; J.W. Bennet & L.L Lasure, Ed., 
pp. 396-428: Academic Press: San Diego]. Examples of other suitable yeast vectors 
are 2DM, pAG-1 , YEp6, YEp13 or pEMBLYe23. 
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[0285.0.0.0] Further vectors, which may be mentioned by way of example, are 
pALS1, plL2 or pBB1 16 in fungi or pLGV23, pGHIac + . pBIN19, pAK2004 or pDH51 in 
plants. 

[0286.0.0.0] As an alternative, the nucleic acid sequences can be expressed in 
5 insect cells using baculovirus expression vectors. Baculovirus vectors which are 
available for expressing proteins in cultured insect cells (for example Sf9 cells) 
encompass the pAc series (Smith et al. (1983) Mol. Cell Biol. 3:2156-2165) and the 
pVL series (Lucklow and Summers (1989) Virology 170:31-39). 

[0287.0.0.0] The abovementioned vectors are only a small overview of potentially 
10 " suitable vectors. Further plasmids are known to the skilled worker and are described, 
for example, in: Cloning Vectors (Ed. Pouwels, P.H., et al.. Elsevier, Amsterdam- 
New York-Oxford, 1985, ISBN 0 444 904018). Further suitable expression systems for 
prokaryotic and eukaryotic cells, see the chapters 16 and 17 by Sambrook, J., Fritsch, 
E.F.. and Maniatis, T., Molecular Cloning: A Laboratory Manual, 2nd Edition, Cold 
15 Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
NY, 1989. 

[0288.0.0.0] Accordingly, one embodiment of the invention relates to a vector where 
the nucleic acid molecule according to the invention is linked operably to regulatory 
sequences which permit the expression in a prokaryotic or eukaryotic or in a 
20 prokaryotic and eukaryotic host 

[0289.0.0.0] Accordingly, one embodiment of the invention relates to a host cell, 
which has been transformed stably or transiently with the vector according to the 
invention or the nucleic acid molecule according to the invention or the nucleic acid 
construct according to the invention. 

25 [0290.0.0.0] Depending on the host organism, the organisms used in the process 
according to the invention are cultured or grown in a manner with which the skilled 
worker is familiar. As a rule, microorganisms are grown in a liquid medium comprising a 
carbon source, usually in the form of sugars, a nitrogen source, usually in the form of 
organic nitrogen sources such as yeast extract or salts such as ammonium sulfate, 

30 trace elements such as iron salts, manganese salts, magnesium salts, and, if 

appropriate, vitamins, at temperatures between 0°C and 100°C, preferably between 
10°C and 60°C, while passing in oxygen. In the event the microorganism is anaerobe, 
no oxygen is blown through the culture medium. The pH value of the liquid nutrient 
medium may be kept constant, that is to say regulated during the culturing phase, or 

35 not. The organisms may be cultured batchwise, semibatchwise or continuously. 
Nutrients may be provided at the beginning of the fermentation or fed in 
semicontinuously or continuously. 
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[0291.0.0.0] The fine chemical produced can be isolated from the organism by 
methods with which the skilled worker is familiar. For example via extraction, salt 
precipitation and/or ion-exchange chromatography etc. To this end, the organisms may 
be disrupted beforehand. The process according to the invention can be conducted 
5 batchwise, semibatchwise or continuously. A summary of known culture and isolation 
techniques can be found in the textbook by Chmiel [BioprozefJtechnik 1, Einfuhrung in 
die Bioverfahrenstechnik (Gustav Fischer Verlag, Stuttgart, 1991)], Demain et al. 
(Industrial Microbiology and Biotechnology, second edition, ASM Press, Washington, 
D.C., 1999, ISBN 1-55581-128-0] or in the textbook by Storhas [Bioreaktoren und 
10 periphere Einrichtungen (Vieweg Verlag, Braunschweig/Wiesbaden, 1994)]. 

[0292.0.0.0] In one embodiment, the present invention relates to a polypeptide 
encoded by the nucleic acid molecule according to the present invention, preferably 
conferring an increase in the fine chemical content in an organism or cell after 
increasing the expression or activity. 

1 5 [0293.0.0.0] The present invention also relates to a process for the production of a 
polypeptide according to the present invention, the polypeptide being expressed in a 
host cell according to the invention, preferably in a microorganism or a transgenic plant 
cell. 

[0294.0.0.0] In one embodiment, the nucleic acid molecule used in the process for 
20 the production of the polypeptide is derived from a microorganism, preferably from a 
prokaryotic or protozoic cell with a eukaryotic organism as host cell. E.g., in one 
embodiment the polypeptide is produced in a plant cell or plant with a nucleic acid 
molecule derived from a prokaryote or a fungus or an alga or another microorganismus 
but not from plant. 

25 [0295.0.0.0] The skilled worker knows that protein and DNA expressed in different 
organisms differ in many respects and properties, e.g. DNA modulation and imprinting, 
such as methylation or post-translational modification, as for example glucosylation, 
phosphorylation, acetylation, myristoylation, ADP-ribosylation, farnesylation, 
carboxylation, sulfation, ubiquination, etc. though having the same coding sequence. 

30 Preferably, the cellular expression control of the corresponding protein differs 

accordingly in the control mechanisms controlling the activity and expression of an 
endogenous protein or another eukaryotic protein. One major difference between 
proteins expressed in prokaryotic or eukaryotic organisms is the amount and pattern of 
glycosylation. For example in E. coli there are no glycosylated proteins. Proteins 

35 expressed in yeasts have high mannose content in the glycosylated proteins, whereas 
in plants the glycosylation pattern is complex. 



[0296.0.0.0] The polypeptide of the present invention is preferably produced by 
recombinant DNA techniques. For example, a nucleic acid molecule encoding the 



WO 2005/014828 PCT/EP2004/008136 

129 

protein is cloned into a vector (as described above), the vector is introduced into a host 
cell (as described above) and said polypeptide is expressed in the host cell. Said 
polypeptide can then be isolated from the cells by an appropriate purification scheme 
using standard protein purification techniques. Alternative to recombinant expression, 
5 the polypeptide or peptide of the present invention can be synthesized chemically using 
standard peptide synthesis techniques. 

[0297.0.0.0] Moreover, native polypeptides conferring the increase of the fine 
chemical in an organism or part thereof can be isolated from cells (e.g., endothelial 
cells), for example using the antibody of the present invention as described below, in 

1 0 particular, an anti-YNL090W protein antibody or an antibody against polypeptides as 
depicted in SEQ ID NO: 2, 4, 6, 8, 10. 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 
38, 40, 42, 44, 46, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82. 84, 86, 88, 90, 
92, 94. 96, 98, 100, 102, 104, 106. 108. 110, 112, 114, 116, 118, 120, 122, 124, 126, 
128. 130, 132, 134. 136, 138, 140, 142, 144, 146, 148, 150. 152, 154, 156, 158, 160, 

15 162. 164. 166, 168. 170. 172. 174. 176. 178. 180. 182, 184, 186, 188, 190. 192, 194, 
196, 198, 200, 202, 204. 206, 208. 210, 212, 214, 216, 218, 220, 222, 224. 226, 228, 
230, 232, 234, 236, 238, 240, 242, 244, 246, 248. 250, 252. 254, 256, 258, 260, 262, 
264. 266. 268, 270, 272, 274, 276, 278, 280, 282. 284, 286, 288, 290, 292, 294, 296, 
298, 300, 302, 304, 306, 308, 310, 312, 314, 316, 318. 320. 322. 324. 326. 328. 330. 

20 332. 334, 336. 338, 340. 342. 344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 
366, 368, 370, 372, 374, 376, 378, 380. 382. 384. 386, 388. 390, 392 or 394, which 
can be produced by standard techniques utilizing the polypeptid of the present 
invention or fragment thereof, i.e., the polypeptide of this invention. Preferred are 
monoclonal antibodies. 

25 [0298.0.0.0] In one embodiment, the present invention relates to a polypeptide 

having the amino acid sequence encoded by a nucleic acid molecule of the invention or 
obtainable by a process of the invention. Said polypeptide confers preferably the 
aforementioned activity, in particular, the polypeptide confers the increase of the fine 
chemical in a cell or an organsim or a part thereof after increasing the cellular activity, 

30 e.g. by increasing the expression or the specific activity of the polypeptide. 

[0299.0.0.0] In one embodiment, the present invention relates to a polypeptide 
having the sequence as depicted in SEQ ID NO: 2, 4, 6, 8, 10. 12. 14. 16. 18. 20, 22, 
24, 26, 28, 30, 32, 34, 36, 38, 40. 42, 44, 46, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 
78, 80, 82, 84, 86, 88, 90. 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 

35 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 
152, 154, 156, 158, 160, 162, 164. 166, 168. 170. 172, 174, 176, 178. 180, 182, 184, 
186, 188, 190, 192. 194, 196, 198, 200, 202. 204, 206, 208. 210, 212, 214. 216, 218, 
220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246. 248. 250, 252, 
254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274. 276, 278, 280, 282, 284, 286, 

40 288, 290, 292, 294. 296, 298, 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 
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322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 
356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386, 388, 
390, 392 or 394 or as coded by the nucleic acid molecule as depicted in SEQ ID NO: 1, 
3, 5/7. 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41 43, 45, 55, 57, 
5 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 
103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 
137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 
171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 
205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 
10 239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 
273, 275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299, 301, .303, 305, 
307, 309, 311, 313, 315, 317, 319, 321, 323, 325, 327, 329, 331, 333, 335, 337, 339, 
341 , 343, 345, 347, 349, 351 , 353, 355, 357, 359, 361 , 363, 365, 367, 369, 371 , 373, 
375, 377, 379, 381, 383, 385, 387, 389, 391or 393or functional homologues thereof. 

1 5 [0300.0.0.0] In one advantageous embodiment, in the method of the present 
invention the activity of a polypeptide is increased comprising or consisting of the 
consensus sequence as depicted in SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, 
. SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 397, SEQ ID NO: 398, 
SEQ ID NO: 399 and/or SEQ ID NO: 400 and in one another embodiment, the present 

20 invention relates to a polypeptide comprising or consisting of the consensus sequence 
as depicted in SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ 
ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 397, SEQ ID NO: 398, SEQ ID NO: 399 
and/or SEQ ID NO: 400 whereby 20 or less, preferably 15 or 10, preferably 9, 8, 7, or 
6, more preferred 5 or 4, even more preferred 3, even more preferred 2, even more 

25 preferred 1 , most preferred 0 of the amino acids positions indicated can be replaced by 
any amino add. 

[0301 .0.0.0] In one embodiment not more than 1 5%, preferably 1 0%, even more 
preferred 5%, 4%, 3%, or 2%, most preferred 1% or 0% of the amino acid position 
indicated by a letter are/is replaced another amino acid. 

30 [0302.0.0.0] In one embodiment 20 or less, preferably 15 or 10, preferably 9, 8, 7, or 
6, more preferred 5 or 4, even more preferred 3, even more preferred 2, even more 
preferred 1, most preferred 0 amino acids are inserted into the consensus sequence. 

[0303.0.0.0] The consensus sequences of specified domains were derived from a 
multiple alignment of all sequences. The consensus sequences are disclosed under 
35 SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, 
SEQ ID NO: 52, SEQ ID NO: 397, SEQ ID NO: 398, SEQ ID NO: 399 and/or SEQ ID 
NO: 400. The letters represent the three letter amino acid code and indicate that the 
amino acids are conserved in all aligned proteins. The letter Xaa stands for amino 
acids, which are not conserved in all sequences. In some cases of the sequences as 
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depicted in SEQ ID NO: 47, SEQ ID NO:. 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID 
NO: 51, SEQ ID NO: 52, SEQ ID NO: 397, SEQ ID NO: 398, SEQ ID NO: 399 and/or 
SEQ ID NO: 400 preferred amino acids are mentioned in the sequence protocol in 
others all natural amino acids are possible. Figur 1 shows an alignment with part of the 
5 sequences in the one letter amino acid code. The conserved sequences are framed. 

[0304.0.0.0] The alignment was performed either with the Software AlignX (sept 
25, 2002) a component of Vector NTI Suite 8.0 , lnforMax™,lnvitrogen™ life science 
software, U.S. Main Office, 7305 Executive Way.Frederick, MD 21704.USA with the 

10 following settings: For pairwise alignments: gap opening penality: 10,0; gap extension 
penality 0,1. For multiple alignments: Gap opening penalty: 10,0; Gap extension 
penalty: 0,1; Gap separation penalty range: 8; Residue substitution matrix: blosum62; 
Hydrophilic residues: GPSNDQEKR; Transition weighting: 0,5; Consensus 
calculation options: Residue fraction for consensus: 1 or preferably the percent 

1 5 sequence identity between two nucleic acid or polypeptide sequences was determined 
using the Vector NTI 6.0 (PC) software package (InforMax, 7600 Wisconsin Ave., . 
Bethesda, MD 20814). A gap opening penalty of 15 and a gap extension penalty of 
6.66 are used for determining the percent identity of two nucleic acids. A gap opening 
penalty of 10 and a gap extension penalty of 0.1 are used for determining the percent 

20 identity of two polypeptides. All other parameters are set at the default settings. For 
purposes of a multiple alignment (Clustal W algorithm), the gap opening penalty is 10, 
and the gap extension penalty is 0.05 with blosum62 matrix. 

[0305.0.0.0] In one advantageous embodiment, the method of the present invention 
comprises the increasing of a polypeptide comprising or consisting of plant or 

25 microorganism specific consensus sequences, 

In one embodiment, said polypeptide of the invention distinguishes over the sequence 
as depicted in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20. 22, 24, 26, 28, 30, 32, 34, 
36, 38, 40, 42, 44, 46, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 881 
90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 

30 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148. 150, 152, 154, 156, 158, 
160, 162. 164. 166. 168. 170, 172, 174. 176, 178, 180, 182, 184, 186, 188, 190, 192, 
194, 196, 198, 200, 202, 204, 206. 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 
228, 230. 232, 234, 236, 238, 240, 242. 244, 246, 248, 250, 252, 254, 256, 258, 260, 
262, 264. 266, 268, 270. 272, 274. 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 

35 296, 298, 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 
330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 
364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386, 388, 390, 392 or 394 by 
one or more amino acids. In one embodiment, polypeptide distinguishes form the 
sequence as depicted in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 

40 30, 32. 34, 36, 38, 40, 42, 44, 46, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 
84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 
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122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 
156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184. 186, 188, 
190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 
224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 
5 258, 260, 262, 264, 266, 268. 270, 272, 274. 276, 278, 280, 282, 284, 286, 288, 290. 
292. 294, 296, 298, 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 
326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356. 358, 
360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386, 388, 390, 392 or 
394 by more than 5, 6, 7, 8 or 9 amino acids, preferably by more than 10, 15, 20, 25 or 

10 30 amino acids, evenmore preferred are more than 40, 50, or 60 amino acids and, 
preferably, the sequence of the polypeptide of the invention distinguishes from the. 
sequence as depicted in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 
30, 32, 34, 36, 38, 40, 42, 44, 46, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78. 80. 82, 
84, 86, 88, 90. 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 

15 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 
156, 158, 160, 162, 164, 166, 168, 170, 172. 174, 176, 178, 180, 182, 184, 186, 188, 
190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 
224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256. 
258. 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 

20 292, 294, 296, 298, 300, 302, 304, 306. 308. 310, 312, 314, 316, 318, 320, 322, 324, 
326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346/348, 350, 352, 354, 356, 358, 
360, 362, 364, 366, 368, 370. 372, 374, 376, 378, 380, 382, 384, 386, 388, 390, 392 or 
394 by not more than 80% or 70% of the amino acids, preferably not more than 60% or 
50%, more preferred not more than 40% or 30%, even more preferred not more than 

25 20% or 10%. In an other embodiment, said polypeptide of the invention does not 
consist of the sequence as depicted in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 
22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 56, 58, 60, 62, 64, 66, 68, 70. 72, 74, 
76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 
116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140. 142. 144, 146, 148, 

30 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 
184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 
218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 
252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 
286, 288, 290, 292, 294, 296, 298, 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 

35 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 
354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386, 
388, 390, 392 or 394. 

[0306.0.0.0] In one embodiment, the polypeptide of the invention comprises any one 
of the sequences not known to the public before. In one embodiment, the polypeptide 
40 of the invention orginates from a non-plant cell, in particular from a microorganism, and 
was expressed in a plant cell. In one embodiment, the present invention relates to a 
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polypeptide encoded by the nucleic acid molecule of the invention or used in the 
process of the invention for which an activity has not been described yet. 

[0307.0.0.0] In one embodiment, the invention relates to polypeptide conferring an 
increase in the fine chemical in an organism or part being encoded by the nucleic acid 

5 molecule of the invention or used in the process of the invention and having a 

sequence which distinguishes from the sequence as depicted in SEQ ID NO: 2, 4, 6, 8, 
10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 56, 58, 60, 62, 
64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100. 102, 104, 106, 
108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 

10 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 
176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 
210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 
244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 
278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300, 302, 304, 306, 308, 310, 

15 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 
346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 
380, 382, 384, 386, 388, 390, 392 or 394 by one or more amino acids. In an other 
embodiment, said polypeptide of the invention does not consist of the sequence as 
depicted in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 

20 38, 40, 42, 44, 46, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 
92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 
128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 
162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 
196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 

25 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258. 260, 262, 
264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 
298, 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 
332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 
366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386, 388, 390, 392 or 394. In a 

30 further embodiment, said polypeptide of the present invention is less than 100%, 

99,999%, 99,99%, 99,9% or 99% identical. In one embodiment said polypeptide does 
not consist of the sequence encoded by the nucleic acid molecules as depicted in SEQ 
ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31. 33, 35, 37, 39, 41 43, 45, 
55, 57. 59. 61. 63. 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 

35 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 
135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 
169, 171, 173, 175, 177, 179, 181. 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 
203, 205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231. 233. 235, 
237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 

40 271 , 273, 275, 277, 279, 281 , 283, 285, 287, 289, 291 , 293, 295, 297, 299, 301 , 303, 
305, 307, 309, 311, 313, 315, 317, 319, 321, 323, 325, 327. 329, 331, 333, 335, 337, 
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339, 341, 343, 345, 347, 349, 351, 353, 355, 357, 359, 361, 363, 365, 367, 369, 371, 
373, 375, 377, 379, 381, 383, 385, 387, 389, 391or 393. 

[0308.0.0.0] In one embodiment, the present invention relates to a polypeptide 
having the biological activity represented by a protein as depicted in SEQ ID NO: 2, 4, 
5 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 56, 58, 60, 
62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 
106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 
140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 
174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 

10 . 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 
242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 
276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300, 302, 304, 306, 308, 
310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 
344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 

15 378, 380, 382, 384, 386, 388, 390, 392 or 394and which distinguishes over the 

aforementioned sequences by one or more amino acids, preferably by more than 5, 6, 
7, 8 or 9 amino acids, preferably by more than 10, 15, 20, 25 or 30 amino acids, 
evenmore preferred are more than 40, 50, or 60 amino acids but even more preferred 
by less than 70% of the amino acids, more preferred by less than 50%, even more 

20 preferred my less than 30% or 25%, more preferred are 20% or 15%, even more 
preferred are less than 10%. 

[0309.0.0.0] The terms "protein" and "polypeptide" used in this application are 
interchangeable. "Polypeptide" refers to a polymer of amino acids (amino acid 
sequence) and does not refer to a specific length of the molecule. Thus peptides and 

25 oligopeptides are included within the definition of polypeptide. This term does also refer 
to or include post-translational modifications of the polypeptide, for example, 
glycosylates, acetylations, phosphorylations and the like. Included within the 
definition are, for example, polypeptides containing one or more analogs of an amino 
acid (including, for example, unnatural amino acids, etc.), polypeptides with substituted 

30 linkages, as well as other modifications known in the art, both naturally occurring and 
non-naturally occurring. 

[0310.0.0.0] Preferably, the polypeptide is isolated. An "isolated" or "purified" protein 
or nucleic acid molecule or biologically active portion thereof is substantially free of 
cellular material when produced by recombinant DNA techniques or chemical 
35 precursors or other chemicals when chemically synthesized. 

[031 1 .0.0.0] The language "substantially free of cellular material" includes 
preparations of the polypeptide of the invention in which the protein is separated from 
cellular components of the cells in which it is naturally or recombinant^ produced. In 
one embodiment, the language "substantially free of cellular material" includes 
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preparations having less than about 30% (by dry weight) of "contaminating protein", 
more preferably less than about 20% of "contaminating protein", still more preferably 
less than about 10% of "contaminating protein", and most preferably less than about 
5% "contaminating protein". The term "Contaminating protein" relates to polypeptides, 

5 which are not polypeptides of the present invention. When the polypeptide of the 
present invention or biologically active portion thereof is recombinantly produced, it is 
also preferably substantially free of culture medium, i.e., culture medium represents 
less than about 20%, more preferably less than about 10%, and most preferably less 
than about 5% of the volume of the protein preparation. The language "substantially 

1 0 free of chemical precursors or other chemicals" includes preparations in which the 
polypeptide of the present invention is separated from chemical precursors or other 
chemicals, which are involved in the synthesis of the protein. The language 
"substantially free of chemical precursors or other chemicals" includes preparations 
having less than about 30% (by dry weight) of Chemical precursors or non-polypeptide 

15 of the inventionchemicals, more preferably less than about 20% chemical precursors or 
non-polypeptide of the inventionchemicals, still more preferably less than about 10% 
chemical precursors or non-polypeptide of the invention-chemicals, and most 
preferably less than about 5% chemical precursors or non- polypeptide of the 
invention-chemicals. In preferred embodiments, isolated proteins or biologically active 

20 portions thereof lack contaminating proteins from the same organism from which the 
polypeptide of the present invention is derived. Typically, such proteins are produced 
by recombinant techniques. 

[031 2.0.0.0] A polypeptide of the invention can participate in the process of the 
present invention. The polypeptide or a portion thereof comprises preferably an amino 

25 acid sequence which is sufficiently homologous to an amino acid sequence as depicted 
in SEQ ID NO: 2, 4, 6. 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 
42, 44, 46, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90. 92, 94, 
96. 98, 100, 102, 104. 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 
132. 134. 136, 138, 140, 142. 144, 146, 148. 150. 152, 154, 156, 158, 160, 162. 164, 

30 166*. 168, 170, 172, 174. 176. 178. 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 
200, 202, 204, 206, 208. 210. 212, 214, 216. 218. 220. 222. 224. 226. 228, 230. 232. 
234. 236, 238, 240, 242. 244, 246, 248, 250, 252, 254, 256, 258. 260, 262, 264, 266, 
268, 270, 272, 274, 276. 278. 280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300, 
302. 304, 306, 308, 310, 312, 314, 316, 318, 320, 322. 324, 326, 328, 330, 332, 334, 

35 336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 360. 362. 364, 366, 368, 
370, 372, 374, 376, 378, 380. 382. 384. 386. 388, 390, 392 or 394 such that the protein 
or portion thereof maintains the ability to confer the activity of the present invention. 
The portion of the protein is preferably a biologically active portion as described herein. 
Preferably, the polypeptide used in the processof the invention has an amino acid 

40 sequence identical as shown in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 
26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 
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80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 

120, 122, 124, 126, 128. 130, 132, 134, 136, 138. 140, 142, 144, 146, 148, 150, 152. 

154. 156, 158. 160, 162. 164, 166, 168, 170, 172. 174, 176, 178. 180. 182. 184. 186, 

188. 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 
5 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254. 

256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288. 
290. 292, 294, 296, 298, 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 
324, 326, 328. 330, 332, 334, 336, 338, 340, 342, 344, 346, 348. 350, 352, 354, 356, 
358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386. 388, 390, 

10 392 or 394. 

[0313.0.0.0] Further, the polypeptide can have an amino acid sequence which is 
encoded by a nucleotide sequence which hybridizes, preferably hybridizes under 
stringent conditions as described above, to a nucleotide sequence of the nucleic acid 
molecule of the present invention. Accordingly, the polypeptide has an amino acid 

15 sequence which is encoded by a nucleotide sequence that is at least about 35%. 40%, 
45%, 50%, 55%, 60%, 65% or 70%, preferably at least about 75%, 80%, 85% or 90, 
and more preferably at least about 91%, 92%, 93%, 94% or 95%, and even more 
preferably at least about 96%, 97%, 98%, 99% or more homologous to one of the 
amino acid sequences of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 

20 30, 32, 34, 36, 38, 40, 42, 44, 46, 56, 58, 60, 62, 64, 66. 68. 70, 72, 74. 76. 78, 80, 82. 
84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108. 110. 112. 114, 116, 118, 120, 
122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 
156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188. 
190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 

25 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 
258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 
292, 294, 296, 298, 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 320. 322, 324, 
326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 
360, 362, 364, 366, 368, 370, 372, 374, 376, 378. 380. 382, 384, 386, 388, 390, 392 or 

30 394. The preferred polypeptide of the present invention preferably possesses at least 
one of the activities according to the invention and described herein. A preferred 
polypeptide of the present invention includes an amino acid sequence encoded by a 
nucleotide sequence which hybridizes, preferably hybridizes under stringent conditions, 
to a nucleotide sequence of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 

35 29, 31. 33. 35, 37, 39, 41 43, 45, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73. 75, 77, 79, 81, 
83, 85, 87, 89, 91. 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 

121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 

155. 157. 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 

189, 191, 193, 195, 197, 199, 201. 203. 205, 207, 209, 211, 213, 215, 217, 219, 221, 
40 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 

257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 
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291, 293, 295, 297, 299. 301, 303, 305, 307, 309, 311, 313, 315, 317, 319, 321, 323, 
325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 345, 347, 349, 351, 353, 355, 357, 
359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 379, 381. 383, 385, 387. 389. 391 or 
393 or which is homologous thereto, as defined above. 

5 [0314.0.0.0] Accordingly the polypeptide of the present invention can vary from SEQ 
ID NO: 2, 4. 6, 8. 10, 12. 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42. 44, 
46, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 
100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122. 124. 126, 128, 130, 132, 
134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 
10 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 
202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224. 226. 228, 230, 232, 234, 
236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 
270, 272, 274, 276, 278, 280, 282, 284. 286, 288, 290, 292, 294, 296, 298, 300, 302, 
304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326. 328, 330, 332, 334, 336, 
15 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368. 370, 
372, 374. 376, 378, 380, 382, 384, 386. 388, 390, 392 or 394 in amino acid sequence 
due to natural variation or mutagenesis, as described in detail herein. Accordingly, the 
polypeptide comprise an amino acid sequence which is at least about 35%, 40%, 45%, 
50%, 55%, 60%, 65% or 70%, preferably at least about 75%, 80%, 85% or 90, and 
20 more preferably at least about 91 %, 92%, 93%, 94% or 95%, and most preferably at 
least about 96%, 97%, 98%, 99% or more homologous to an entire amino acid 
sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 
36, 38, 40, 42, 44, 46, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86. 88, 
90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112. 114. 116. 118. 120, 122, 124, 
25 126, 128, 130, 132, 134, 136. 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 
160, 162, 164, 166, 168. 170. 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 
194, 196. 198, 200, 202, 204, 206, 208, 210. 212, 214, 216. 218, 220, 222. 224. 226, 
228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 
262*, 264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 
30 296, 298, 300. 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 
330, 332, 334, 336, 338, 340, 342, 344, 346. 348, 350, 352, 354, 356, 358, 360, 362, 
364, 366. 368, 370, 372, 374, 376, 378, 380. 382, 384, 386, 388, 390, 392 or 394. 

[031 5.0.0.0] For the comparison of amino acid sequences the same algorithms as 
described above or nucleic acid sequences can be used. Results of high quality are 

35 reached by using the algorithm of Needleman and Wunsch or Smith and Waterman. 
Therefore programs based on said algorithms are preferred. Advantageously the 
comparisons of sequences can be done with the program PileUp (J. Mol. Evolution., 
25, 351-360, 1987, Higgins et al., CABIOS, 5 1989: 151-153) or preferably with the 
programs Gap and BestFit, which are respectively based on the algorithms of 

40 Needleman and Wunsch [J. Mol. Biol. 48; 443-453 (1970)] and Smith and Waterman 
[Adv. Appl. Math. 2; 482-489 (1981)]. Both programs are part of the GCG software- 
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package [Genetics Computer Group, 575 Science Drive, Madison, Wisconsin, USA 
5371 1 (1991); Altschu! et al. (1997) Nucleic Acids Res. 25:3389 et seq.]. Therefore 
preferably the calculations to determine the perentages of sequence homology are 
done with the program Gap over the whole range of the sequences. The following 
5 standard adjustments for the comparison of amino acid sequences were used: gap 
weight: 8, length weight: 2, average match: 2.912, average mismatch: -2.003. 

[031 6.0.0.0] Biologically active portions of an polypeptide of the present invention 
include peptides comprising amino acid sequences derived from the amino acid 
sequence of the polypeptide of the present invention or used in the process of the 

10 present invention, e.g., the amino acid sequence shown in SEQ ID NO: 2, 4, 6, 8, .10, 
12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 56, 58, 60, 62, 64, 
66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 
108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 
142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 

15 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 
210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 
244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 
278, 280, 282, 284, 286, 288, 290, 292^294, 296, 298, 300, 302, 304, 306, 308, 310, 
312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 

20 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 
380, 382, 384, 386, 388, 390, 392 or 394 or the amino acid sequence of a protein 
homologous thereto, which include fewer amino acids than a full length polypeptide of 
the present invention or used in the process of the present invention or the full length 
protein which is homologous to an polypeptide of the present invention or used in the 

25 process of the present invention depicted herein, and exhibit at least one activity of 
polypeptide of the present invention or used in the process of the present invention . 

[0317.0.0.0] Typically, biologically (or immunologically) active portions i.e. peptides, 
e.g., peptides which are, for example, 5, 10, 15, 20, 30, 35, 36, 37, 38; 39, 40, 50, 100 
or more amino acids in length comprise a domain or motif with at least one activity or 
30 epitope of a polypeptide of the present invention or used in the process of the present 
invention. Moreover, other biologically active portions, in which other regions of the 
polypeptide are deleted, can be prepared by recombinant techniques and evaluated for 
one or more of the activities described herein. 

[0318.0.0.0] Manipulation of the nucleic acid molecule of the invention may result in 
35 the production of proteins having the biological activity represented by a protein as 

depicted in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 
38, 40, 42, 44, 46, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 
92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 
128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 
40 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 
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196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218. 220, 222, 224, 226, 228, 
230 232 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 
264*, 266*. 268', 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 
298*. 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 
5 332*. 334, 336, 338, 340, 342, 344, 346, 348, 350. 352, 354, 356, 358, 360, 362, 364, 

— . . — ,r* mn *an ooo iqa "3QK *3»8 3Qfl nr 394. and 

36b, 368, 3/U, 3/^, ■31'+, O/d, 010, oou, Ju^., ^w-t, » • 

having differences from said aforementioned wild-type proteins. These proteins may be 
improved in efficiency or activity, may be present in greater numbers in the cell than is 
usual, or may be decreased in efficiency or activity in relation to the wild type protein. 

10 [031 9.0.0.0] Any mutagenesis strategies for the polypeptide of the present invention 
or the polypeptide used in the process of the present invention to result in increasing 
said activity are not meant to be limiting; variations on these strategies will be readily 
apparent to one skilled in the art. Using such strategies, and incorporating the 
mechanisms disclosed herein, the nucleic acid molecule and polypeptide of the 

1 5 invention may be utilized to generate plants or parts thereof, expressing wildtype 
proteins of the invention or mutated protein encoding nucleic acid molecules and 
polypeptide molecules of the invention such that the yield, production, and/or efficiency 
of production of a desired compound is improved. 

[0320.0.0.0] This desired compound may be any natural product of plants, which 
20 includes the final products of biosynthesis pathways and intermediates of naturally- 
occurring metabolic pathways, as well as molecules which do not naturally occur in the 
metabolism of said cells, but which are produced by a said cells of the invention. 
Preferrably, the compound is a composition of amino acids or a recovered amino acid, 
in particular, the fine chemical, free or in protein-bound form. 

25 [0321.0.0.0] The invention also provides chimeric or fusion proteins. 

[0322.0.0.0] As used herein, an "chimeric protein" or "fusion protein" comprises an 
polypeptide operatively linked to a polypeptide which does not confer above-mentioned 
activity, in particulare, which does not confer an increase of content of the fine chemical 
in a cell or an organism or a part thereof, if its activity is increased. 

30 [0323.0.0.0] In one embodiment a protein (= polypeptide)" having the biological 
activity represented by a protein as depicted in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 
18, 20, 22, 24, 26. 28. 30. 32. 34, 36, 38. 40. 42, 44, 46, 56, 58, 60, 62, 64, 66, 68, 70, 
72*. 74*. 76*. 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 
114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 

35 14s'. 15o'. 152. 154, 156. 158. 160. 162, 164, 166, 168, 170. 172, 174. 176, 178, 180, 
182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 
216 218, 220, 222. 224, 226, 228, 230, 232, 234, 236, 238, 240, 242. 244, 246. 248. 
250'. 252*. 254, 256. 258, 260, 262, 264, 266. 268, 270, 272, 274, 276, 278, 280, 282, 
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284, 286, 288, 290, 292, 294, 296, 298, 300, 302, 304, 306, 308, 310, 3.12, 314, 316, 
318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 
352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 
386, 388, 390, 392 or 394refers to a polypeptide having an amino acid sequence 
5 corresponding to the polypeptide of the invention or used in the process of the 

invention, whereas a "non- polypeptide of the invention" or "other polypeptide" refers to 
a polypeptide having an amino acid sequence corresponding to a protein which is not 
substantially homologous a polypeptide of the invention, preferably which is not 
substantially homologous to a polypeptide having the biological activity represented by 

10 a protein as depicted in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 
30, 32, 34, 36, 38, 40, 42, 44, 46, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78,-80, 82, 
84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 
122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 
156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 

15 190, 192. 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214, .21 6, 218, 220, 222, 
224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 
258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 
292, 294, 296, 298, 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 
326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 

20 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386, 388, 390, 392 or 
394, e.g., a protein which does not confer the activity described herein and which is 
derived from the same or a different organism. 

[0324.0.0.0] Within the fusion protein, the term "operatively linked" is intended to 
indicate that the polypeptide of the invention or a polypeptide used in the process of the 

25 invention and the "other polypeptide" or a part thereof are fused to each other so that 
both sequences fulfil the proposed function addicted to the sequence used. The "other 
polypeptide" can be fused to the N-terminus or C-terminus of the polypeptide of the 
invention or used in the process of the invention. For example, in one embodiment the 
fusion protein is a GST-LMRP fusion protein in which the sequences of the polypeptide 

30 of the invention or the polypeptide used in the process of the invention are fused to the 
C-terminus of the GST sequences. Such fusion proteins can facilitate the purification of 
recombinant polypeptides of the invention or a poylpeptide usefull in the process of the 
invention. 

[0325.0.0.0] In another embodiment, the fusion protein is a polypeptide of the 
35 invention or a polypeptide used in the process of the invention containing a 

heterologous signal sequence at its N-terminus. In certain host cells (e.g., mammalian 
host cells), expression and/or secretion of a polypeptide of the invention or a 
poylpeptide used in the process of the invention can be increased through use of a 
heterologous signal sequence. As already mentioned above, targeting sequences, are 
40 required for targeting the gene product into specific cell compartment (for a review, see 
Kermode, Crit. Rev. Plant Sci. 15, 4 (1996) 285-423 and references cited therein), for 
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example into the vacuole, the nucleus, all types of plastids, such as amyloplasts, 
chloroplasts, chromoplasts, the extracellular space, the mitochondria, the endoplasmic 
reticulum, elaioplasts, peroxisomes, glycosomes, and other compartments of cells or 
extracellular. Sequences, which must be mentioned in this context are, in particular, the 

5 signal-peptide- or transit-peptide-encoding sequences which are known per se. For 
example, plastid-transit-peptide-encoding sequences enable the targeting of the 
expression product into the plastids of a plant cell. Targeting sequences are also 
known for eukaryotic and to a lower extent for prokaryotic organisms and can 
advantageously be operable linked with the nucleic acid molecule of the present 

1 0 invention to achieve an expression in one of said compartments or extracellular. 

[0326.0.0.0] Preferably, a chimeric or fusion protein of the invention is produced by 
standard recombinant DNA techniques. For example, DNA fragments coding for the 
different polypeptide sequences are ligated together in-frame in accordance with 
conventional techniques, for example by employing blunt-ended or stagger-ended 

1 5 termini for ligation, restriction enzyme digestion to provide for appropriate termini, 
filling-in of cohesive ends as appropriate, alkaline phosphatase treatment to avoid 
undesirable joining, and enzymatic ligation. The fusion gene can be synthesized by 
conventional techniques including automated DNA synthesizers. Alternatively, PCR 
amplification of gene fragments can be carried out using anchor primers, which give 

20 rise to complementary overhangs between two consecutive gene fragments which can 
subsequently be annealed and reamplified to generate a chimeric gene sequence (see, 
for example, Current Protocols in Molecular Biology, eds. Ausubel et al. John Wiley & 
Sons: 1992). Moreover, many expression vectors are commercially available that 
already encode a fusion moiety (e.g., a GST polypeptide). The nucleic acid molecule 

25 of the invention can be cloned into such an expression vector such that the fusion 
moiety is linked in-frame to the encoded protein. 

[0327.0.0.0] Furthermore, folding simulations and computer redesign of structural 
motifs of the protein of the invention can be performed using appropriate computer 
programs (Olszewski, Proteins 25 (1996), 286-299; Hoffman. Comput. Appl. Biosci. 1 1 
30 (1 995), 675-679). Computer modeling of protein folding can be used for the 

conformational and energetic analysis of detailed peptide and protein models (Monge, 
J. Mol. Biol. 247 (1995), 995-1012; Renouf, Adv. Exp. Med. Biol. 376 (1995), 37-45). 
The appropriate programs can be used for the identification of interactive sites the 
polypeptide of the invention or polypeptides used in the process of the invention and its 

35 substrates or binding factors or other interacting proteins by computer assistant 

searches for complementary peptide sequences (Fassina, Immunomethods (1 994). 
1 14-120). Further appropriate computer systems for the design of protein and peptides 
are described in the prior art, for example in Berry, Biochem. Soc. Trans. 22 (1994), 
1033-1036; Wodak, Ann. N. Y. Acad. Sci. 501 (1987), 1-13; Pabo. Biochemistry 25 

40 (1 986), 5987-5991 . The results obtained from the above-described computer analysis 
can be used for, e.g., the preparation of peptidomimetics of the protein of the invention 
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or fragments thereof. Such pseudopeptide analogues of the, natural amino acid 
sequence of the protein may very efficiently mimic the parent protein (Benkirane, J. 
Biol. Chem. 271 (1996), 33218-33224). For example, incorporation of easily available 
achiral Q-amino acid residues into a protein of the invention or a fragment thereof 
5 results in the substitution of amide bonds by polymethylene units of an aliphatic chain, 
thereby providing a convenient strategy for constructing a peptidomimetic (Banerjee, 
Biopolymers 39 (1996), 769-777). 

[0328.0.0.0] Superactive peptidomimetic analogues of small peptide hormones in 
other systems are described in the prior art (Zhang, Biochem. Biophys. Res. Commun. 

10 224 (1996), 327-331). Appropriate peptidomimetics of the protein of the present 
invention can also be identified by the synthesis of peptidomimetic combinatorial 
libraries through successive amide alkylation and testing the resulting compounds, e.g., 
for their binding and immunological properties. Methods for the generation and use of 
peptidomimetic combinatorial libraries are described in the prior art, for example in 

15 Ostresh, Methods in Enzymology 267 (1996), 220-234 and Dorner, Bioorg. Med. 
Chem. 4 (1996), 709-715. 

[0329.0.0.0] Furthermore, a three-dimensional and/or crystallographic structure of 
the protein of the invention can be used for the design of peptidomimetic inhibitors of 
the biological activity of the protein of the invention (Rose, Biochemistry 35 (1996), 
20 12933-12944; Rutenber, Bioorg. Med. Chem. 4 (1996), 1545-1558). 

[0330.0.0.0] Furthermore, a three-dimensional and/or crystallographic structure of 
the protein of the invention and the identification of interactive sites the polypeptide of 
the invention and its substrates or binding factors can be used for design of mutants 
with modulated binding or turn over activities. For example, the active center of the 
25 polypeptide of the present invention can be modelled and amino acid residues 

participating in the catalytic reaction can be modulated to increase or decrease the 
binding of the substrate to activate or improve the polypeptide. The identification of the 
active center and the amino acids involved in the catalytic reaction facilitates the 
screening for mutants having an increased activity. 

30 [0331.0.0.0] One embodiment of the invention also relates to an antibody, which 
binds specifically to the polypeptide according to the invention or parts, i.e. specific 
fragments or epitopes of such a protein. 

[0332.0.0.0] The antibodies of the invention can be used to identify and isolate the 
polypeptide according to the invention and encoding genes in any organism, preferably 
35 plants, prepared in. plants described herein. These antibodies can be monoclonal 
antibodies, polyclonal antibodies or synthetic antibodies as well as fragments of 
antibodies, such as Fab, Fv or scFv fragments etc. Monoclonal antibodies can be 
prepared, for example, by the techniques as originally described in Kohler and Milstein, 
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Nature 256 (1975), 495, and Galfr6, Meth. Enzymol. 73 (1981), 3, which comprise the 
fusion of mouse myeloma cells to spleen cells derived from immunized mammals. 

[0333.0.0.0] Furthermore, antibodies or fragments thereof to the aforementioned 
peptides can be obtained by using methods, which are described, e.g., in Harlow and 

5 Lane "Antibodies, A Laboratory Manual", CSH. Press, Cold Spring Harbor, 1 988. These 
antibodies can be used, for example, for the immunoprecipitation and 
immunolocalization of proteins according to the invention as well as for the monitoring 
of the synthesis of such proteins, for example, in recombinant organisms, and for the 
identification of compounds interacting with the protein according to the invention. For 

1 0 example, surface plasmon resonance as employed in the BIAcore system can be used 
to increase the efficiency of phage antibodies selections, yielding a high increment of 
affinity from a single library of phage antibodies, which bind to an epitope of the protein 
of the invention (Schier, Human Antibodies Hybridomas 7 (1996), 97-105; Malmborg, J. 
Immunol. Methods 183 (1995), 7-13). In many cases, the binding phenomena of 

15 antibodies to antigens are equivalent to other ligand/anti-ligand binding. 

[0334.0.0.0] In one embodiment, the present invention relates to an antisense 
nucleic acid molecule comprising the complementary sequence of the nucleic acid 
molecule of the present invention. 

[0335.0.0.0] Methods to modify the expression levels and/or the activity are known to 
20 persons skilled in the art and include for instance overexpression, co-suppression, the 
use of ribozymes, sense and anti-sense strategies or other gene silencing approaches 
like RNA interference (RNAi) or promoter methylation."Sense strand" refers to the 
strand of a double-stranded DNA molecule that is homologous to an mRNA transcript 
thereof. The "anti-sense strand" contains an inverted sequence, which is 
25 complementary to that of the "sense strand". 

In addition the expression levels and/or the activity can be modified by the introduction 
of mutations in the regulatory or coding regions of the nucleic acids of the invention. 
Furthermore antibodies can be expressed which specifically binds to a polypeptide of 
interest and thereby blocks it acitivity. The protein-binding factors can, for example, 
30 also be aptamers [Famulok M and Mayer G (1 999) Curr. Top Microbiol. Immunol. 243: 
123-36] or antibodies or antibody fragments or single-chain antibodies. Obtaining these 
factors has been described, and the skilled worker is familiar therewith. For example, a 
cytoplasmic scFv antibody has been employed for modulating activity of the 
phytochrome A protein in genetically modified tobacco plants [Owen M et al. (1992) 
35 Biotechnology (NY) 10(7): 790-794; Franken E et al. (1 997) Curr. Opin. Biotechnol. 
8(4): 411-416; Whitelam (1996) Trend Plant Sci. 1: 286-272]. 

[0336.0.0.0] An "antisense" nucleic acid molecule comprises a nucleotide sequence, 
which is complementary to a "sense" nucleic acid molecule encoding a protein, e.g., 
complementary to the coding strand of a double-stranded cDNA molecule or 
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complementary to an encoding mRNA sequence. Accordingly, an antisense nucleic 
acid molecule can bond via hydrogen bonds to a sense nucleic acid molecule. The 
antisense nucleic acid molecule can be complementary to an entire coding strand of a 
nucleic acid molecule conferring the expression of the polypeptide of the invention or 
5 used in the process of the present invention, as the nucleic acid molecule of the 

invention coding strand, or to only a portion thereof- Accordingly, an antisense nucleic 
acid molecule can be antisense to a "coding region" of the coding strand of a 
nucleotide sequence of a nucleic acid molecule of the present invention. The term 
"coding region" refers to the region of the nucleotide sequence comprising codons, 

10 which are translated into amino acid residues. Further, the antisense nucleic acid 
molecule is antisense to a "noncoding region" of the coding strand of a nucleotide 
sequence encoding the polypeptide of the invention or a polypeptide used In the 
process of the invention. The term "noncoding region" refers to 5' and 3' sequences 
which flank the coding region that are not translated into a polypeptide, i.e., also 

15 referred to as 5' and 3' untranslated regions (5'-UTR or 3'-UTR). 

[0337.0.0.0] Given the coding strand sequences encoding the polypeptide of the 
present invention antisense nucleic acid molecules of the invention can be designed 
according to the rules of Watson and Crick base pairing. 

[0338.0.0.0] The antisense nucleic acid molecule can be complementary to the entire 

20 coding region of the mRNA encoding the nucleic acid molecule ot the invention or used 
in the process of the present invention, but can also be an oligonucleotide which is 
antisense to only a portion of the coding or noncoding region of said mRNA. For 
example, the antisense oligonucleotide can be complementary to the region 
surrounding the translation start site of said mRNA. An antisense oligonucleotide can 

25 be, for example, about 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 100 or 200 nucleotides in 
length. An antisense nucleic acid molecule of the invention can be constructed using 
chemical synthesis and enzymatic ligation reactions using procedures known in the art. 
For example, an antisense nucleic acid molecule (e.g., an antisense oligonucleotide) 
can be chemically synthesized using naturally occurring nucleotides or variously 

30 modified nucleotides designed to increase the biological stability of the molecules or to 
increase the physical stability of the duplex formed between the antisense and sense 
nucleic acids, e.g., phosphorothioate derivatives and acridine substituted nucleotides 
can be used. Examples of modified nucleotides which can be used to generate the 
antisense nucleic acid include 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, 

35 hypoxanthine, xanthine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5- 
carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluracil, 
dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 1- 
methylguanine, 1-methylinosine,:2,2-dimethylguanine, 2-methyladenine, 2- 
methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 5- 

40 methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta-D- 

mannosylqueosine, S'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6- 
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isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouraci!, queosine, 
2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracH, uracil-5- 
oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3-(3- 
amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. Alternatively, the 
5 antisense nucleic acid can be produced biologically using an expression vector into 
which a nucleic acid molecule has been subdoned in an antisense orientation (i.e., 
RNA transcribed from the inserted nucleic acid molecule will be of an antisense 
orientation to a target nucleic acid molecule of interest, described further in the 
following subsection). 

10 [0339.0.0.0] The antisense nucleic acid molecules of the invention are typically 
administered to a cell or generated in situ such that they hybridize with or bind to 
cellular mRNA and/or genomic DNA encoding a polypeptide of the invention having 
aforementioned the fine chemical increasing activity to thereby inhibit expression of the 
protein, e.g., by inhibiting transcription and/or translation. 

15 [0340.0.0.0] The hybridization can be by conventional nucleotide complementarity to 
form a stable duplex, or, for example, in the case of an antisense nucleic acid molecule 
which binds to DNA duplexes, through specific interactions in the major groove of the 
double helix. The antisense nucleic acid molecule can also be delivered to cells using 
the vectors described herein. To achieve sufficient intracellular concentrations of the 

20 antisense molecules, vector in which the antisense nucleic acid molecule is placed 
under the control of a strong prokaryotic, viral, or eukaryotic including plant promoters 
are preferred. 

[0341 .0.0.0] In a further embodiment, the antisense nucleic acid molecule of the 
invention can be an cc-anomeric nucleic acid molecule. A a-anoraeric nucleic acid 
25 molecule forms specific double-stranded hybrids with complementary RNA in which, 
contrary to the usual units, the strands run parallel to each other (Gaultier et al. (1987) 
Nucleic Acids. Res. 1 5:6625-6641 ). The antisense nucleic acid molecule can also 
comprise a 2'-o-methylribonucleotide (Inoue et al. (1987) Nucleic Acids Res. 15:6131- 
6148) or a chimeric RNA-DNA analogue (Inoue et al. (1987) FEBS Lett. 215:327-330). 

30 [0342.0.0.0] Further the antisense nucleic acid molecule of the invention can be also 
a ribozyme. Ribozymes are catalytic RNA molecules with ribonuclease activity, which 
are capable of cleaving a single-stranded nucleic acid, such as an mRNA, to which 
they have a complementary region. Thus, ribozymes (e.g., hammerhead ribozymes 
(described in Haselhoff and Geriach (1988) Nature 334:585-591)) can be used to 

35 catalytically cleave mRNA transcripts encoding the polypeptide of the invention to 
thereby inhibit translation of said mRNA. A ribozyme having specificity for a nucleic 
acid molecule encoding the polypeptide of the invention or used in the process of the 
invention can be designed based upon the nucleotide sequence of the nucleic acid 
molecule of the invention or coding a protein used in the process of the invention or on 
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the basis of a heterologous sequence to be isolated according to methods taught in this 
invention. For example, a derivative of a Tetrahymena L-19 IVS RNA can be 
constructed in which the nucleotide sequence of the active site is complementary to the 
nucleotide sequence to be cleaved in an encoding mRNA. See, e.g., Cech et al. U.S. 
5 Patent No. 4,987,071 and Cech et al. U.S. Patent No. 5,1 16,742. Alternatively, mRNA 
encoding the polypeptide of the invention or a polypeptide used in the process of the 
invention can be used to select a catalytic RNA having a specific ribonuclease activity 
from a pool of RNA molecules. See, e.g., Bartel, D. and Szostak, J.W. (1993) Science 
261:1411-1418. 

1 0 [0343.0.0.0] The antisense molecule of the present invention comprises also a 
nucleic acid molecule comprising a nucleotide sequences complementary to the 
regulatory region of an nucleotide sequence encoding the natural occurring polypeptide 
of the invention, e.g. the polypeptide sequences shown in the sequence listing, or 
identified according to the methods described herein, e.g., its promoter and/or 

1 5 enhancers, e.g. to form triple helical structures that prevent transcription of the gene in 
target cells. See generally, Helene, C. (1991) Anticancer Drug Des. 6(6): 569-84; 
Helene, C. et al. (1992) Ann. N.Y. Acad. Sci. 660:27-36; and Maher, L.J. (1992) 
Bioassays 14(1 2):807-1 5. 

[0344.0.0.0] Furthermore the present invention relates to a double stranded RNA 
20 molecule capable for the reduction or inhibition of the activity of the gene product of a 
gene encoding the polypeptide of the invention, a polypeptide used in the process of 
the invention, the nucleic acid molecule of the invention or a nucleic acid molecule 
used in the process of the invention encoding. 

[0345.0.0.0] The method of regulating genes by means of double-stranded RNA 
25 ("double-stranded RNA interference"; dsRNAi) has been described extensively for 
animal, yeast, fungi and plant organisms such as Neurospora, zebrafish, Drosophila, 
mice, planaria, humans, Trypanosoma, petunia or Arabidopsis (for example Matzke MA 
et al. (2000) Plant Mol. Biol. 43: 401-415; Fire A. et al. (1998) Nature 391: 806-811; 
WO 99/32619; WO 99/53050; WO 00/68374; WO 00/44914; WO 00/44895; WO 
30 00/49035; WO 00/63364). In addition RNAi is also documented as an advantageously 
tool for the repression of genes in bacteria such as E. coli for example by Tchurikov et 
al. [J. Biol. Ghem., 2000, 275 (34): 26523 - 26529]. Fire et al. named the phenomenon 
RNAi for "RNA interference\ The techniques and methods described in the above 
references are expressly referred to. Efficient gene suppression can also be observed 
35 in the case of transient expression or following transient transformation, for example as 
the consequence of a biolistic transformation (Schweizer P et al. (2000) Plant J 2000 
24: 895-903); dsRNAi methods are based on the phenomenon that the simultaneous 
introduction of complementary strand and counterstrand of a gene transcript brings 
about highly effective suppression of the expression of the gene in question. The 
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resulting phenotype is very similar to that of an analogous knock-out mutant 
(Waterhouse PM et al. (1998) Proc. Natl. Acad. Sci. USA 95: 13959-64). 

[0346.0.0.0] Tuschl et al. [Gens Dev., 1999, 13 (24): 3191 - 3197] was able to 
show that the efficiency of the RNAi method is a function of the length of the duplex, 

5 the length of the 3'-end overhangs, and the sequence in these overhangs. Based on 
the work of Tuschl et al. the following guidelines can be given to the skilled worker. To 
achieve good results the 5' and 3* untranslated regions of the used nucleic acid 
sequence and regions close to the start codon should be avoided as this regions are 
richer in regulatory protein binding sites and interactions between RNAi sequences and 

10 such regulatory proteins might lead to undesired interactions. Preferably a region of the 
used mRNA is selected, which is 50 to 1 00 nt (= nucleotides or bases) downstream of 
the AUG start codon. Only dsRNA (= double-stranded RNA) sequences from exons are 
useful for the method, as sequences from introns have no effect. The G/C content in 
this region should be greater than 30% and less than 70% ideally around 50%. A 

1 5 possible secondary structure of the target mRNA is less important for the effect of the 
RNAi method. 

[0347.0.0.0] The dsRNAi method has proved to be particularly effective.and 
advantageous for reducing the expression of the nucleic acid sequences of the SEQ ID 
NO: 1, 3, 5, 7, 9. 11, 13, 15, 17, .19, 21, 23, 25, 27, 29, 31, 33, 35, 37. 39, 41 43, 45, 
20 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81. 83, 85. 87, 89, 91. 93, 95, 97, 99, 
101, 103, 105, 107, 109, 111. 113, 115, 117, 119, 121, 123, 125. 127, 129. 131. 133. 
135 137. 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 
169 171. 173. 175. 177. 179. 181, 183, 185, 187, 189, 191, 193, 195, 197, 199. 201, 
203 205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 
25 237 239, 241 , 243, 245, 247. 249. 251 . 253, 255, 257, 259, 261 , 263, 265. 267, 269, 
27l" 273, 275, 277, 279. 281. 283, 285, 287, 289, 291, 293, 295, 297. 299, 301, 303, 
305 307, 309, 311, 313, 315, 317, 319, 321, 323, 325, 327, 329, 331, 333, 335, 337, 
339, 341, 343, 345, 347, 349, 351, 353, 355, 357, 359, 361. 363. 365. 367. 369. 371. 
373. 375, 377, 379, 381, 383, 385, 387, 389, 391 or 393 and/or homologs thereof. As 
30 described inter alia, in WO 99/3261 9, dsRNAi approaches are clearly superior to 
traditional antisense approaches. The invention therefore furthermore relates to 
double-stranded RNA molecules (dsRNA molecules) which, when introduced into an 
organism, advantageously into a plant (or a cell, tissue, organ or seed derived 
therefrom), bring about altered metabolic activity by the reduction in the expression of 
35 the nucleic acid sequences of the SEQ ID NO: 1, 3, 5, 7, 9, 11. 13, 15, 17, 19, 21, 23, 
25, 27. 29. 31, 33, 35. 37, 39. 41 43. 45, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 
79*. 81, 83. 85. 87, 89, 91, 93, 95, 97, 99, 101, 103. 105, 107. 109. 111, 113, 115, 117, 
119 121. 123. 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145. 147, 149, 151, 
153*, 155, 157, 159, 161, 163, 165, 167, 169, 171, 173. 175, 177, 179, 181, 183. 185, 
40 187, 189, 191, 193, 195, 197. 199. 201. 203, 205, 207, 209, 211, 213, 215, 217, 219, 
221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245. 247. 249, 251, 253, 
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255, 257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 
289, 291, 293, 295, 297, 299, 301, 303, 305, 307, 309, 311, 313, 315, 317, 319, 321, 
323, 325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 345, 347, 349, 351, 353, 355, 
357, 359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 379, 381, 383, 385, 387, 389, 
5 391 or 393 and/or homologs thereof. In a double-stranded RNA molecule for reducing 
the expression of an protein encoded by a nucleic acid sequence of one of the SEQ ID 
NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41 43. 45, 
55, 57, 59, 61, 63, 65. 67, 69, 71. 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97. 99, 
101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 

10 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 
169, 171, 173, 175, 177. 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 
203, 205, 207, 209, 21 1, 213. 215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 
237, 239, 241, 243, 245. 247, 249. 251. 253, 255, 257, 259, 261, 263, 265, 267, 269, 
271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299, 301, 303, 

15 305, 307, 309, 311, 313, 315, 317, 319, 321, 323, 325, 327. 329, 331, 333, 335, 337, 
339, 341, 343, 345, 347, 349, 351, 353, 355, 357, 359, 361, 363, 365, 367, 369, 371, 
373, 375, 377, 379, 381, 383, 385, 387, 389, 391 or 393 and/or homologs thereof, one 
of the two RNA strands is essentially identical to at least part of a nucleic acid 
sequence, and the respective other RNA strand is essentially identical to at least part 

20 of the complementary strand of a nucleic acid sequence. 

[0348.0.0.0] The term "essentially identical" refers to the fact that the dsRNA 
sequence may also include insertions, deletions and individual point mutations in 
comparison to the target sequence while still bringing about an effective reduction in 
expression. Preferably, the homology as defined above amounts to at least 30%, 

25 preferably at least 40%, 50%, 60%, 70% or 80%, very especially preferably at least 
90%, most preferably 100%, between the "sense" strand of an inhibitory dsRNA and a 
part-segment of a nucleic acid sequence of the invention (or between the "antisense" 
strand and the complementary strand of a nucleic acid sequence, respectively). The 
part-segment amounts to at least 10 bases, preferably at least 17, 18, 19, 20, 21, 22, 

30 23. 24, 25, 26, 27, 28, 29 or 30 bases, especially preferably at least 40, 50, 60, 70, 80 
or 90 bases, very especially preferably at least 100, 200, 300 or 400 bases, most 
preferably at least 500, 600, 700, 800. 900 or more bases or at least 1 000 or 2000 
bases or more in length. In another preferred embodiment of the invention the part- 
segment amounts to 17, 18, 19. 20, 21, 22, 23, 24, 25, 26 or 27 bases, preferably to 

35 20, 21, 22, 23, 24 or 25 bases. These short sequences are preferred in animals and 
plants. The longer sequences preferably between 200 and 800 bases are preferred in 
nonmammalian animals, preferably in invertebrates, in yeast, fungi or bacteria, but they 
are also useable in plants. Long double-stranded RNAs are processed in the 
organisms into many siRNAs (= small/short interfering RNAs) for example by the 

40 protein Dicer, which is a ds-specific Rnase III enzyme. As an alternative, an "essentially 
identical" dsRNA may also be defined as a nucleic acid sequence, which is capable of 
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hybridizing with part of a gene transcript (for example in 400 mM NaCI, 40 mM PIPES 
pH 6.4, 1 mM EDTA at 50°C or 70°C for 12 to 16 h). 

[0349.0.0.0] The dsRNA may consist of one or more strands of polymerized 
ribonucleotides. Modification of both the sugar-phosphate backbone and of the 
5 nucleosides may furthermore be present. For example, the phosphodiester bonds of 
the natural RNA can be modified in such a way that they encompass at least one 
nitrogen or sulfur heteroatom. Bases may undergo modification in such a way that the 
activity of, for example, adenosine deaminase is restricted. These and other 
modifications are described herein below in the methods for stabilizing antisense RNA. 

10 [0350.0.0.0] The dsRNA can be prepared enzymatically; it may also be synthesized 
chemically, either in full or in part. 

[0351.0.0.0] The double-stranded structure can be formed starting from a single, self- 
complementary strand or starting from two complementary strands. In a single, self- 
complementary strand, "sense" and "antisense" sequence can be linked by a linking 

1 5 sequence ("linker") and form for example a hairpin structure. Preferably, the linking 
sequence may take the form of an intron, which is spliced out following dsRNA 
synthesis. The nucleic acid sequence encoding a dsRNA may contain further elements 
such as, for example, transcription termination signals or polyadenylation signals. If the 
two strands of the dsRNA are to be combined in a cell or an organism advantageously 

20 in a plant, this can be brought about in a variety of ways. 

[0352.0.0.0] Formation of the RNA duplex can be initiated either outside the cell or 
within the cell. As shown in WO 99/53050, the dsRNA may also encompass a hairpin 
structure, by linking the "sense" and "antisense" strands by a "linker" (for example an 
intron). The self-complementary dsRNA structures are preferred since they merely 
25 require the expression of a construct and always encompass the complementary 
strands in an equimolar ratio. 

[0353.0.0.0] The expression cassettes encoding the "antisense" or the "sense" strand 
of the dsRNA or the self-complementary strand of the dsRNA are preferably inserted 
into a vector and stably inserted into the genome of a plant, using the methods 
30 described herein below (for example using selection markers), in order to ensure 
permanent expression of the dsRNA. 

[0354.0.0.0] The dsRNA can be introduced using an amount which makes possible 
at least one copy per cell. A larger amount (for example at least 5, 10, 100, 500 or 
1 000 copies per cell) may bring about more efficient reduction. 

35 [0355.0.0.0] As has already been described, 1 00 % sequence identity between the 
dsRNA and a gene transcript of a nucleic acid sequence of one of the SEQ ID NO: 1, 
3, 5, 7, 9, 1 1, 13, 15, 17. 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41 43, 45, 55, 57, 
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59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 
103, 105, 107. 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 
137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 
171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201. 203, 
5 205, 207, 209, 211. 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235. 237, 
239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 
273, 275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299, 301, 303, 305, 
307, 309, 311, 313, 315, 317, 319, 321, 323, 325, 327, 329, 331, 333, 335, 337, 339, 
341 , 343, 345, 347, 349, 351 , 353, 355, 357, 359, 361 , 363, 365, 367, 369, 371 , 373, 

10 375, 377, 379, 381 , 383, 385, 387, 389, 391or 393 or it's homolog is not necessarily 
. . required in order to bring about effective reduction in the expression. The advantage is, 
accordingly, that the method is tolerant with regard to sequence deviations as may be 
present as a consequence of genetic mutations, polymorphisms or evolutionary 
divergences. Thus, for example, using the dsRNA, which has been generated starting 

1 5 from a sequence of one of SEQ ID NO: 1 , 3, 5, 7, 9, 1 1 , 1 3, 1 5, 1 7, 1 9, 21 , 23, 25, 27, 
29, 31, 33, 35. 37, 39, 41 43, 45, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 
83. 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 
121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 
155. 157, 159, 161. 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 

20 189, 191, 193, 195. 197, 199, 201, 203, 205, 207, 209, 211, 213, 215.217, 219, 221, 
223, 225. 227, 229, 231, 233, 235, 237. 239, 241, 243, 245, 247, 249, 251, 253, 255, 
257, 259, 261 , 263, 265, 267, 269, 271 , 273, 275, 277, 279, 281 , 283, 285, 287, 289, 
291, 293, 295, 297, 299, 301, 303, 305, 307, 309, 311, 313, 315, 317, 319, 321, 323, 
325. 327, 329, 331 , 333, 335, 337, 339, 341 , 343, 345, 347, 349, 351 , 353, 355, 357, 

25 359, 361 , 363, 365, 367, 369, 371 , 373, 375, 377, 379, 381 , 383, 385, 387, 389, 391 or 
393 or homologs thereof of the one organism, may be used to suppress the 
corresponding expression in another organism. 

[0356.0.0.0] Due to the high degree of sequence homology between sequences from 
various organisms (e. g. plants), allows the conclusion that these proteins may be 
30 conserved to a high degree within, for example other, plants, it is optionally possible 
that the expression of a dsRNA derived from one of the disclosed sequences as shown 
herein or homologs thereof should also have has an advantageous effect in other plant 
species. Preferably the consensus sequences shown herein can be used for the 
construction of useful dsRNA molecules. 

35 [0357.0.0.0] The dsRNA can be synthesized either in vivo or in vitro. To this end, a 
DNA sequence encoding a dsRNA can be introduced into an expression cassette 
under the control of at least one genetic control element (such as, for example, 
promoter, enhancer, silencer, splice donor or splice acceptor or polyadenylation 
signal). Suitable advantageous constructs are described herein below. Polyadenylation 

40 is not required, nor do elements for initiating translation have to be present. 
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[0358.0.0.0] A dsRNA can be synthesized chemically or enzymatically. Cellular RNA 
polymerases or bacteriophage RNA polymerases (such as, for example T3, T7 or SP6 
RNA polymerase) can be used for this purpose. Suitable methods for the in-vitro 
expression of RNA are described (WO 97/3201 6; US 5.593,874; US 5,698,425, 

5 US 5,712,135, US 5,789,214, US 5,804,693). Prior to introduction into a cell, tissue or 
organism, a dsRNA which has been synthesized in vitro either chemically or 
enzymatically can be isolated to a higher or lesser degree from the reaction mixture, for 
example by extraction, precipitation, electrophoresis, chromatography or combinations 
of these methods. The dsRNA can be introduced directly into the cell or else be applied 

10 extracellularly (for example into the interstitial space). 

[0359.0.0.0] Advantageously the RNAi method leads to only a partial loss of gene 
function and therefore enables the skilled worker to study a gene dose effect in the 
disered organism and to fine tune the process of the invention. Futhermore it enables a 
person skilled in the art to study multiple functions of a gene. 

1 5 [0360.0.0.0] Stable transformation of the plant with an expression construct, which 
brings about the expression of the dsRNA is preferred, however. Suitable methods are 
described herein below. 

[0361 .0.0.0] A further embodiment of the invention also relates to a method for the 
generation of a transgenic host or host cell, e.g. a eukaryotic or prokaryotic cell, 
20 preferably a transgenic microorganism, a transgenic plant cell or a transgenic plant 
tissue or a transgenic plant, which comprises introducing, into the plant, the plant cell 
or the plant tissue, the nucleic acid construct according to the invention, the vector 
according to the invention, or the nucleic acid molecule according to the invention. 

[0362.0.0.0] A further embodiment of the invention also relates to a method for the 
25 transient generation of a host or host cell, eukaryotic or prokaryotic cell, preferably a 
transgenic microorganism, a transgenic plant cell or a transgenic plant tissue or a 
transgenic plant, which comprises introducing, into the plant, the plant cell or the plant 
tissue, the nucleic acid construct according to the invention, the vector according to the 
invention, the nucleic acid molecule characterized herein as being contained in the 
30 nucleic acid construct of the invention or the nucleic acid molecule according to the 

invention, whereby the introduced nucleic acid molecules, nucleic acid construct and/or 
vector is not integrated into the genome of the host or host cell. Therefore the 
transformants are not stable during the propagation of the host in respect of the 
introduced nucleic acid molecules, nucleic acid construct and/or vector. 

35 [0363.0.0.0] In the process according to the invention, transgenic organisms are also 
to be understood as meaning - if they take the form of plants - plant cells, plant tissues, 
plant organs such as root, shoot, stem, seed, flower, tuber or leaf, or intact plants 
which are grown for the production of the fine chemical. 
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[0364.0.0.0] Growing is to be understood as meaning for example culturing the 
transgenic plant cells, plant tissue or plant organs on or in a nutrient medium or the 
intact plant on or in a substrate, for example in hydroponic culture, potting compost or 
on a field soil. 

[0365.0.0.0] In a further advantageous embodiment of the process, the nucleic acid 
molecules can be expressed in single-celled plant cells (such as algae), see Falciatore 
et al., 1999, Marine Biotechnology 1 (3): 239-251 and references cited therein, and 
plant cells from higher plants (for example spermatophytes such as crops). Examples 
of plant expression vectors encompass those which are described in detail herein or in: 
Becker, D. [(1992) Plant Mol. Biol. 20:1195-1197] and Bevan, M.W. [(1984), Nucl. 
Acids Res. 12:871 1-8721 ; Vectors for Gene Transfer in Higher Plants; in: Transgenic 
Plants, Vol. 1, Engineering and Utilization, Ed.: Kung and R. Wu, Academic Press, 
1993, pp. 15-38]. An overview of binary vectors and their use is also found in Hellens, 
R. [(2000), Trends in Plant Science, Vol. 5 No. 1 0, 446-451 . 

[0366.0.0.0] Vector DNA can be introduced into prokaryotic or eukaryotic cells via 
conventional transformation or transfection techniques. The terms "transformation" and 
"transfection" include conjugation and transduction and, as used in the present context, 
are intended to encompass a multiplicity of prior-art methods for introducing foreign 
nucleic acid molecules (for example DNA) into a host cell, including calcium phosphate 
coprecipitation or calcium chloride coprecipitation, DEAE-dextran-mediated 
transfection, PEG-mediated transfection, lipofection, natural competence, chemically 
mediated transfer, electroporation or particle bombardment. Suitable methods for the 
transformation or transfection of host cells, including plant cells, can be found in 
Sambrook et al. (Molecular Cloning: A Laboratory Manual., 2nd Ed., Cold Spring 
Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
1989) and in other laboratory handbooks such as Methods in Molecular Biology, 1995, 
Vol. 44, Agrobacterium protocols, Ed.: Gartland and Davey, Humana Press, Totowa, 
New Jersey. 

[0367.0.0.0] The above-described methods for the transformation and regeneration 
of plants from plant tissues or plant cells are exploited for transient or stable 
transformation of plants. Suitable methods are the transformation of protoplasts by 
polyethylene-giycol-induced DNA uptake, the biolistic method with the gene gun - 
known as the particle bombardment method -, electroporation, the incubation of dry 
embryos in DNA-containing solution, microinjection and the Agrobacterium-mediated 
gene transfer. The abovementioned methods are described for example in B. Jenes, 
Techniques for Gene Transfer, in: Transgenic Plants, Vol. 1, Engineering 
and Utilization, edited by S.D. Kung and R. Wu, Academic Press (1993) 128-143 and in 
Potrykus Annu. Rev. Plant Physiol. Plant Molec. Biol. 42 (1991) 205-225. The construct 
to be expressed is preferably cloned into a vector, which is suitable for transforming 
Agrobacterium tumefaciens, for example pBin19 (Bevan, Nucl. Acids Res. 12 (1984) 
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871 1). Agrobacteria transformed with such a vector can then be used in the known 
manner for the transformation of plants, in particular crop plants, such as, for example, 
tobacco plants, for example by bathing scarified leaves or leaf segments in an 
agrobacterial solution and subsequently culturing them in suitable media. The 
5 transformation of plants with Agrobacterium tumefaciens is described for example by 
uAfnan ™A wiiirrViWor in Mi iH AHH ( 1 Q&8\ 1fi QR77 or known from, inter alia. 

I IWI^WI I Gil IWi * V mi ■ ii»*»vi ■■ ■ « vmwi. § iwim ■ »w~w. y • — '—l ™" -* « 

F.F. White, Vectors for Gene Transfer in Higher Plants; in Transgenic Plants, Vol. 1, 
Engineering and Utilization, edited by S.D. Kung and R. Wu, Academic Press, 1993, 
pp. 15-38. 

10 [0368.0.0.0] To select for the successful transfer of the nucleic acid molecule, vector 
or nucleic acid construct of the invention according to the invention into a host 
organism, it is advantageous to use marker genes as have already been described 
above in detail. It is known of the stable or transient integration of nucleic acids into 
plant cells that only a minority of the cells takes up the foreign DNA and, if desired, 

15 integrates it into its genome, depending on the expression vector used and the 

transfection technique used. To identify and select these integrants, a gene encoding 
for a selectable marker (as described above, for example resistance to antibiotics) is 
usually introduced into the host cells together with the gene of interest. Preferred 
selectable markers in plants comprise those, which confer resistance to an herbicide 

20 such as glyphosate or gluphosinate. Other suitable markers are, for example, markers, 
which encode genes involved in biosynthetic pathways of, for example, sugars or 
amino acids, such as (J-galactosidase, ura3 or ilv2. Markers, which encode genes such 
as luciferase, gfp or other fluorescence genes, are likewise suitable. These markers 
and the aforementioned markers can be used in mutants in whom these genes are not 

25 functional since, for example, they have been deleted by conventional methods. 
Furthermore, nucleic acid molecules, which encode a selectable marker, can be 
introduced into a host cell on the same vector as those, which encode the polypeptides 
of the invention or used in the process or else in a separate vector. Cells which have 
been transfected stably with the nucleic acid introduced can be identified for example 

30 by selection (for example, cells which have integrated the selectable marker survive 
whereas the other cells die). 

[0369.0.0.0] Since the marker genes, as a rule specifically the gene for resistance to 
antibiotics and herbicides, are no longer required or are undesired in the transgenic 
host cell once the nucleic acids have been introduced successfully, the process 

35 according to the invention for introducing the nucleic acids advantageously employs 
techniques which enable the removal, or excision, of these marker genes. One such a 
method is what is known as cotransformation. The cotransformation method employs 
two vectors simultaneously for the transformation, one vector bearing the nucleic acid 
according to the invention and a second bearing the marker gene(s). A large proportion 

40 of transformants receives or, in the case of plants, comprises (up to 40% of the 

transformants and above), both vectors. In case of transformation with Agrobacteria, 
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the transformants usually receive only a part of the vector, the sequence flanked by the 
T-DNA which usually represents the expression cassette. The marker genes can 
subsequently be removed from the transformed plant by performing crosses. In another 
method, marker genes integrated into a transposon are used for the transformation 
together with desired nucleic acid (known as the Ac/Ds technology). The transformants 
can be crossed with a transposase resource or the transformants are transformed with 
a nucleic acid construct conferring expression of a transposase, transiently or stable. In 
some cases (approx. 10%), the transposon jumps out of the genome of the host cell 
once transformation has taken place successfully and is lost. In a further number of 
cases, the transposon jumps to a different location. In these cases, the marker gene 
must be eliminated by performing crosses. In microbiology, techniques were developed 
which make possible, or facilitate, the detection of such events. A further advantageous 
method relies on what are known as recombination systems, whose advantage is that 
elimination by crossing can be dispensed with. The best-known system of this type is 
what is known as the Cre/lox system. Cre1 is a recombinase, which removes the 
sequences located between the loxP sequences. If the marker gene is integrated 
between the loxP sequences, it is removed, once transformation has taken place 
successfully, by expression of the recombinase. Further recombination systems are the 
HIN/HIX, FLP/FRT and REP/STB system" (Tribble et al., J. Biol. Chem., 275, 2000: 
22255-22267; Velmurugan et al., J. Cell Biol., 149, 2000: 553-566). A site-specific 
integration into the plant genome of the nucleic acid sequences according to the 
invention is possible. Naturally, these methods can also be applied to microorganisms 
such as yeast, fungi or bacteria. 

[0370.0.0.0] Agrobacteria transformed with an expression vector according to the 
invention may also be used in the manner known per se for the transformation of plants 
such as experimental plants like Arabidopsis or crop plants, such as, for example, 
cereals, maize, oats, rye, barley, wheat, soya, rice, cotton, sugarbeet, canola, 
sunflower, flax, hemp, potato, tobacco, tomato, carrot, bell peppers, oilseed rape, 
tapioca, cassava, arrow root, tagetes, alfalfa, lettuce and the various tree, nut, and 
grapevine species, in particular oil-containing crop plants such as soya, peanut, castor- 
oil plant, sunflower, maize, cotton, flax, oilseed rape, coconut, oil palm, safflower 
(Carthamus tinctorius) or cocoa beans, for example by bathing scarified leaves or leaf 
segments in an agrobacterial solution and subsequently growing them in suitable 
media. 

[0371.0.0.0] In addition to the transformation of somatic cells, which then has to be 
regenerated into intact plants, it is also possible to transform the cells of plant 
meristems and in particular those cells which develop into gametes. In this case, the 
transformed gametes follow the natural plant development, giving rise to transgenic 
plants. Thus, for example, seeds of Arabidopsis are treated with agrobacteria and 
seeds are obtained from the developing plants of which a certain proportion is 
transformed and thus transgenic (Feldman, KA and Marks MD (1987). Mol Gen Genet 
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208:274-289; Feldmann K (1992). In: C Koncz, N-H Chua and J Shell, eds, Methods in 
Arabidopsis Research. Word Scientific, Singapore, pp. 274-289). Alternative methods 
are based on the repeated removal of the influorescences and incubation of the 
excision site in the center of the rosette with transformed agrobacteria, whereby 

5 transformed seeds can likewise be obtained at a later point in time (Chang (1994). 
Plant J. 5: 551-558; Katavic (1994). Mol Gen Genet, 245: 363-370). However, an 
especially effective method is the vacuum infiltration method with its modifications such 
as the "floral dip" method. In the case of vacuum infiltration of Arabidopsis, intact plants 
un( j er reduced pressure are treated with an agrobacteria! suspension (Bechthold, N 

10 (1993). C R Acad Sci Paris Life Sci, 316: 1 194-1 199), while in the case of the n floral dip" 
method the developing floral tissue is incubated briefly with a surfactant-treated 
agrobacterial suspension (Clough, SJ und Bent, AF (1998). The Plant J. 16, 735-743). 
A certain proportion of transgenic seeds are harvested in both cases, and these seeds 
can be distinguished from nontransgenic seeds by growing under the above-described 

15 selective conditions. In addition the stable transformation of plastids is of advantages 
because plastids are inherited maternally is most crops reducing or eliminating the risk 
of transgene flow through pollen. The transformation of the chloroplast genome is 
generally achieved by a process, which has been schematically displayed in Klaus et 
aL, 2004 (Nature Biotechnology 22(2), 225-229). Briefly the sequences to be 

20 transformed are cloned together with a selectable marker gene between flanking 
sequences homologous to the chloroplast genome. These homologous flanking 
sequences direct site-specific integration into the plastome. Plastidal transformation 
has been described for many different plant species and an overview can be taken 
from Bock (2001) Transgenic plastids in basic research and plant biotechnology. J Mol 

25 Biol. 2001 Sep 21; 312(3): 425-38 or Maliga, P (2003) Progress towards 

commercialization of plastid transformation technology. Trends Biotechnol. 21, 20-28. 
Further biotechnological progress has recently been reported in form of marker free 
plastid transformants, which can be produced by a transient cointegrated maker gene 
(Klaus et aL, 2004, Nature Biotechnology 22(2), 225-229). 

30 [0372.0.0.0] The genetically modified plant cells can be regenerated via all methods 
with which the skilled worker is familiar. Suitable methods can be found in the 
abovementioned publications of S.D. Kung and R. Wu, Potrykus or Hofgen and 
Willmitzer. 

[0373.0.0.0] Accordingly, the present invention thus also relates to a plant cell 
35 comprising the nucleic acid construct according to the invention, the nucleic acid 
molecule according to the invention or the vector according to the invention. 

[0374.0.0.0] Accordingly the present invention relates to any cell transgenic for any 
nucleic acid characterized as part of the invention, e.g. conferring the increase of the 
fine chemical in a cell or an organism or a part thereof, e.g. the nucleic acid molecule of 
40 the invention, the nucleic acid construct of the invention, the antisense molecule of the 
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invention, the vector of the invention or a nucleic acid molecule encoding the 
polypeptide of the invention, e.g. encoding a polypeptide having the biological activity 
of the protein of the invention. Due to the above mentioned activity the fine chemical 
content in a cell or an organism is increased. For example, due to modulation or 
5 manupulation, the cellular activity is increased, e.g. due to an increased expression or 
specific activity of the subject matters of the invention in a cell or an organism or a part 
thereof. Transgenic for a polypeptide having biological activity of the protein of the 
invention or activity means herein that due to modulation or manipulation of the 
genome, the biological activity of the protein of the invention is increased in the cell or 
10 organism or part thereof. Examples are described above in context with the process of 
. the invention 

[0375.0.0.0] Transgenic", for example regarding a nucleic acid molecule, an nucleic 
add construct or a vector comprising said nucleic acid molecule or an organism 
transformed with said nucleic acid molecule, nucleic acid construct or vector, refers to 
15 all those subjects originating by recombinant methods in which either 

a) the nucleic acid sequence, or 

b) a genetic control sequence linked operably to the nucleic acid sequence, for 
example a promoter, or 

c) (a) and (b) 

are not located in their natural genetic environment or have been modified by 
recombinant methods, an example of a modification being a substitution, addition, 
deletion, inversion or insertion of one or more nucleotide residues. Natural genetic 
environment refers to the natural chromosomal locus in the organism of origin, or to the 
presence in a genomic library. In the case of a genomic library, the natural genetic 
environment of the nucleic acid sequence is preferably retained, at least in part. The 
environment flanks the nucleic acid sequence at least at one side and has a sequence 
of at least 50 bp, preferably at least 500 bp, especially preferably at least 1000 bp, very 
especially preferably at least 5000 bp, in length. 

[0376.0.0.0] A naturally occurring expression cassette - for example the naturally 
occurring combination of the promoter of the protein of the invention with the 
corresponding protein gene - becomes a transgenic expression cassette when it is 
modified by non-natural, synthetic "artificial" methods such as, for example, 
mutagenization. Such methods have been described (US 5,565,350; WO 00/15815; 
also see above). 

[0377.0.0.0] Further, the plant cell, plant tissue or plant can also be transformed such 
that further enzymes and proteins are (over)expressed which expression supports an 
increase of the fine chemical. 
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[0378.0.0.0] However, transgenic also means that the nucleic acids according to the 
invention are located at their natural position in the genome of an organism, but that 
the sequence has been modified in comparison with the natural sequence and/or that 
the regulatory sequences of the natural sequences have been modified. Preferably, 
5 transgenic/recombinant is to be understood as meaning the transcription of the nucleic 
acids used in the process according to the invention occurs at a non-natural position in 
the genome, that is to say the expression of the nucleic acids is homologous or, 
preferably, heterologous. This expression can be transiently or of a sequence 
integrated stably into the genome. 

1 0 [0379.0.0.0] The term "transgenic plants" used in accordance with the invention also 
refers to the progeny of a transgenic plant, for example the T 1f T 2 , T 3 and subsequent 
plant generations or the BC 1f BC 2 , BC 3 and subsequent plant generations. Thus, the 
transgenic plants according to the invention can be raised and selfed or crossed with 
other individuals in order to obtain further transgenic plants according to the invention. 

15 Transgenic plants may also be obtained by propagating transgenic plant cells 

vegetatively. The present invention also relates to transgenic plant material, which can 
be derived from a transgenic plant population according to the invention. Such material 
includes plant cells and certain tissues, organs and parts of plants in all their 
manifestations, such as seeds, leaves, anthers, fibers, tubers, roots, root hairs, stems, 

20 embryo, calli, cotelydons, petioles, harvested material, plant tissue, reproductive tissue 
and cell cultures, which are derived from the actual transgenic plant and/or can be 
used for bringing about the transgenic plant. 

[0380.0.0.0] Any transformed plant obtained according to the invention can be used 
in a conventional breeding scheme or in in vitro plant propagation to produce more 

25 transformed plants with the same characteristics and/or can be used to introduce the 
same characteristic in other varieties of the same or related species. Such plants are 
also part of the invention. Seeds obtained from the transformed plants genetically also 
contain the same characteristic and are part of the invention. As mentioned before, the 
present invention is in principle applicable to any plant and crop that can be 

30 transformed with any of the transformation method known to those skilled in the art. 

Another embodiment of the invention is the use of the nucleic acid molecule as claimed 
in above in mapping and breeding processes for the identification of plant varieties 
having and increased capacity for production of the fine chemical. 

[0381.0.0.0] In an especially preferred embodiment, the organism, the host cell, plant 
35 cell, plant, microorganism or plant tissue according to the invention is transgenic. 

[0382.0.0.0] Accordingly, the invention therefore relates to transgenic organisms 
transformed with at least one nucleic acid molecule, nucleic acid construct or vector 
according to the invention, and to cells, cell cultures, tissues, parts - such as, for 
example, in the case of plant organisms, plant tissue, for example leaves, roots and the 
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like - or propagation material derived from such organisms, or intact plants. The terms 
"recombinant (host)", and "transgenic (host)" are used interchangeably in this context. 
Naturally, these terms refer not only to the host organism or target cell in question, but 
also to the progeny, or potential progeny, of these organisms or cells. Since certain 
modifications may occur in subsequent generations owing to mutation or environmental 
effects, such progeny is not necessarily identical with the parental cell, but still comes 
within the scope of the term as used herein. 

[0383.0.0.0] Suitable organisms for the process according to the invention or as 
hosts are all these eukaryotic or prokaryotic organisms, which are capable of 
synthesizing the fine chemcial. The organisms used as hosts are microorganisms, such 
as bacteria, fungi, yeasts or algae, non-human animals, or plants, such as 
dictotyledonous or monocotyledonous plants. 

[0384.0.0.0] In principle all plants can be used as host organism, especially the 
plants mentioned above as source organism. Preferred transgenic plants are, for 
example, selected from the families Aceraceae, Anacardiaceae, Apiaceae, Asteraceae, 
Brassicaceae, Cactaceae, Cucurbitaceae, Euphorbiaceae, Fabaceae, Malvaceae, 
Nymphaeaceae, Papaveraceae, Rosaceae, Salicaceae, Solanaceae, Arecaceae, 
Bromeliaceae, Cyperaceae, Iridaceae, Liliaceae, Orchidaceae, Gentianaceae, Labia- 
ceae, Magnoliaceae, Ranunculaceae, Carifolaceae, Rubiaceae, Scrophulariaceae, 
Caryophyllaceae, Ericaceae, Polygonaceae, Violaceae, Juncaceae or Poaceae and 
preferably from a plant selected from the group of the families Apiaceae, Asteraceae, 
Brassicaceae, Cucurbitaceae, Fabaceae, Papaveraceae, Rosaceae, Solanaceae, . 
Liliaceae or Poaceae. Preferred are crop plants such as plants advantageously 
selected from the group of the genus peanut, oilseed rape, canola, sunflower, 
safflower, olive, sesame, hazelnut, almond, avocado, bay, pumpkin/squash, linseed, 
soya, pistachio, borage, maize, wheat, rye, oats, sorghum and millet, triticale, rice, 
barley, cassava, potato, sugarbeet, egg plant, alfalfa, and perennial grasses and forage 
plants, oil palm, vegetables (brassicas, root vegetables, tuber vegetables, pod 
vegetables, fruiting vegetables, onion vegetables, leafy vegetables and stem 
vegetables), buckwheat, Jerusalem artichoke, broad bean, vetches, lentil, dwarf bean, 
lupin, clover and Lucerne for mentioning only some of them. 

[0385.0.0.0] Preferred plant cells, plant organs, plant tissues or parts of plants 
originate from the under source organism mentioned plant families, preferably from the 
abovementioned plant genus, more preferred from abovementioned plants spezies. 

[0386.0.0.0] Transgenic plants comprising the fine chemical synthesized in the 
process according to the invention can be marketed directly without isolation of the 
compounds synthesized. In the process according to the invention, plants are 
understood as meaning all plant parts, plant organs such as leaf, stalk, root, tubers or 
seeds or propagation material or harvested material or the intact plant. In this context, 
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the seed encompasses all parts of the seed such as the seed coats, epidermal cells, 
seed cells, endosperm or embryonic tissue. The fine chemical produced in the process 
according to the invention may, however, also be isolated from the plant in free or 
bound form. The fine chemical produced by this process can be harvested by 
5 harvesting the organisms either from the culture in which they grow or from the field. 
This can be done via expressing, grinding and/or extraction, salt precipitation and/or 
ion-exchange chromatography of the plant parts, preferably the plant seeds, plant 
fruits, plant tubers and the like. 

[0387.0.0.0] In a further embodiment, the present invention relates to a process for 
10 the generation of a microorganism, comprising the introduction, into the microorganism 
or parts thereof, of the nucleic acid construct of the invention, or the vector of the 
invention or the nucleic acid molecule of the invention. 

[0388.0.0.0] In another embodiment, the present invention relates also to a 
transgenic microorganism comprising the nucleic acid molecule of the invention, the 
1 5 nucleic acid construct of the invention or the vector as of the invention. Appropriate 
microorganisms have been described herein before under source organism, preferred 
are in particular aforementioned strains suitable for the production of fine chemicals. 

[0389.0.0.0] Accordingly, the present invention relates also to a process according to 
the present invention whereby the produced fine chemical or fine chemical composition 
20 is isolated. 

[0390.0.0.0] In this manner, more than 50% by weight, advantageously more than 
60% by weight, preferably more than 70% by weight, especially preferably more than 
80% by weight, very especially preferably more than 90% by weight, of the fine 
chemical produced in the process can be isolated. The resulting fine chemical can, if 
25 appropriate, subsequently be further purified, if desired mixed with other active 

ingredients such as vitamins, amino acids, carbohydrates, antibiotics and the like, and, 
if appropriate, formulated. 

[0391.0.0.0] The fine chemical obtained in the process is suitable as starting material 
for the synthesis of further products of value. For example, they can be used in 

30 combination with other ingredients or alone for the production of pharmaceuticals, 
foodstuffs, animal feeds or cosmetics. Accordingly, the present invention relates a 
method for the production of a pharmaceuticals, food stuff, animal feeds, nutrients or 
cosmetics comprising the steps of the process according to the invention, including the 
isolation of the fine chemical or fine chemical composition produced and if desired 

35 formulating the product with a pharmaceutical acceptable carrier or formulating the 
product in a form acceptable for an application in agriculture. A further embodiment 
according to the invention is the use of the fine chemical produced in the process or of 
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the transgenic organisms in animal feeds, foodstuffs, medicines, food supplements, 
cosmetics or pharmaceuticals. 

[0392.0.0.0] In principle all microorganisms can be used as host organism especially 
the ones mentioned under source organism above. It is advantageous to use in the 
5 process of the invention transgenic microorganisms such as fungi such as the genus 
Claviceps or Aspergillus or Gram-positive bacteria such as the genera Bacillus, 
Corynebacterium, Micrococcus, Brevibacterium, Rhodococcus, Nocardia, Caseobacter 
or Arthrobacter or Gram-negative bacteria such as the genera Escherichia, 
Flavobacterium or Salmonella or yeasts such as the genera Rhodotorula, 
10 Saccharomyces, Hansenula or Candida. Particularly advantageous organisms are 
selected from the group of genera Corynebacterium, Brevibacterium, Escherichia, 
Bacillus, Rhodotorula, Saccharomyces, Hansenula, Candida, Claviceps or 
Flavobacterium. It is very particularly advantageous to use in the process of the 
invention microorganisms selected from the group of genera and species consisting of 
15 Saccharomyces cerevisiae, Hansenula anomala, Candida utilis, Claviceps purpurea, 
Bacillus circulans, Bacillus subtilis, Bacillus sp., Brevibacterium albidum, 
Brevibacterium album, Brevibacterium cerinum, Brevibacterium flavum, Brevibacterium 
glutamigenes, Brevibacterium iodinum, Brevibacterium ketoglutamicum, 
Brevibacterium lactofermentum, Brevibacterium linens, Brevibacterium roseum, 
20 Brevibacterium saccharolyticum, Brevibacterium sp., Corynebacterium 
acetoacidophilum, Corynebacterium acetoglutamicum, Corynebacterium 
ammoniagenes, Corynebacterium glutamicum (= Micrococcus glutamicum), 
Corynebacterium melassecola, Corynebacterium sp. or Escherichia coli, specifically 
Saccharomyces cerevisiae, Escherichia coli K12 and its described strains. 

25 [0393.0.0.0] The process of the invention is, when the host organisms are 

microorganisms, advantageously carried out at a temperature between 0°C and 95°C, 
preferably between 10°C and 85°C, particularly preferably between 15°C and 75°C, 
very particularly preferably between 15°C and 45°C. The pH is advantageously kept at 
between pH 4 and 12, preferably between pH 6 and 9, particularly preferably between 

30 pH 7 and 8, during this. The process of the invention can be operated batchwise, 

semibatchwise or continuously. A summary of known cultivation methods is to be found 
in the textbook by Chmiel (Bioprozelitechnik 1. Einfuhrung in die Bioverfahrenstechnik 
(Gustav Fischer Verlag, Stuttgart, 1991)) or in the textbook by Storhas (Bioreaktoren 
und periphere Einrichtungen (Vieweg Verlag, Braunschweig/Wiesbaden, 1994)). The 

35 culture medium to be used must meet the requirements of the respective strains in a 
suitable manner. Descriptions of culture media for various microorganisms are present 
in the handbook "Manual of Methods for General Bacteriology" of the American Society 
for Bacteriology (Washington D. C, USA, 1981). These media, which can be employed 
according to the invention include, as described above, usually one or more carbon 

40 sources, nitrogen sources, inorganic salts, vitamins and/or trace elements. Preferred 
carbon sources are sugars such as mono-, di- or polysaccharides. Examples of very 



WO 2005/014828 PCT/EP2004/008136 

161 

good carbon sources are glucose, fructose, mannose, galactose, ribose, sorbose, 
ribulose, lactose, maltose, sucrose, raffinose, starch or cellulose. Sugars can also be 
added to the media via complex compounds such as molasses, or other byproducts of 
sugar refining. It may also be advantageous to add mixtures of various carbon sources. 
5 Other possible carbon sources are oils and fats such as, for example, soybean oil, 
sunflower oil, peanut oil and/or coconut fat, fatty acids such as, for example, palmitic 
acid, stearic acid and/or linoleic acid, alcohols and/or polyalcohols such as, for 
example, glycerol, methanol and/or ethanol and/or organic acids such as, for example, 
acetic acid and/or lactic acid. Nitrogen sources are usually organic or inorganic 

10 nitrogen compounds or materials, which contain these compounds. Examples of 

nitrogen sources include ammonia in liquid or gaseous form or ammonium salts such 
as ammonium sulfate, ammonium chloride, ammonium phosphate, ammonium 
carbonate or ammonium nitrate, nitrates, urea, amino acids or complex nitrogen 
sources such as corn steep liquor, soybean meal, soybean protein, yeast extract, meat 

15 extract and others. The nitrogen sources may be used singly or as a mixture. Inorganic 
salt compounds, which may be present in the media include the chloride, phosphorus 
or sulfate salts of calcium, magnesium, sodium, cobalt, molybdenum, potassium, 
manganese, zinc, copper and iron. 

[0394.0.0.0] For preparing sulfur-containing fine chemicals, in particular the fine 
20 chemical, it is possible to use as sulfur source inorganic sulfur-containing compounds 
such as, for example, sulfates, sulfites, dithionites, tetrathionates, thiosulfates, sulfides 
or else organic sulfur compounds such as mercaptans and thiols. 

[0395.0.0.0] It is possible to use as phosphorus source phosphoric acid, potassium 
dihydrogenphosphate or dipotassium hydrogenphosphate or the corresponding 

25 sodium-containing salts. Chelating agents can be added to the medium in order to 
keep the metal ions in solution. Particularly suitable chelating agents include 
dihydroxyphenols such as catechol or protocatechuate, or organic acids such as citric 
acid. The fermentation media employed according to the invention for cultivating 
microorganisms normally also contain other growth factors such as vitamins or growth 

30 promoters, which include, for example, biotin, riboflavin, thiamine, folic acid, nicotinic 
acid, pantothenate and pyridoxine. Growth factors and salts are often derived from 
complex media components such as yeast extract, molasses, corn steep liquor and the 
like. Suitable precursors can moreover be added to the culture medium. The exact 
composition of the media compounds depends greatly on the particular experiment and 

35 is chosen individually for each specific case. Information about media optimization is 
obtainable from the textbook "Applied Microbiol. Physiology, A Practical Approach" 
(editors P.M. Rhodes, P.F. Stanbury, IRL Press (1997) pp. 53-73, ISBN 0 19 963577 
3). Growth media can also be purchased from commercial suppliers such as Standard 
1 (Merck) or BHI (Brain heart infusion, DIFCO) and the like. All media components are 

40 sterilized either by heat (1 .5 bar and 121 °C for 20 min) or by sterilizing filtration. The 
components can be sterilized either together or, if necessary, separately. All media 
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components can be present at the start of the cultivation or optionally be added 
continuously or batchwise. The temperature of the culture is normally between 1 5°C 
and 45°C, preferably at 25°C to 40°C, and can be kept constant or changed during the 
experiment. The pH of the medium should be in the range from 5 to 8.5, preferably 
5 around 7. The pH for the cultivation can be controlled during the cultivation by adding 
basic compounds such as sodium hydroxide, potassium hydroxide, ammonia or 
aqueous ammonia or acidic compounds such as phosphoric acid or sulfuric acid. 
Foaming can be controlled by employing antifoams such as, for example fatty acid 
polyglycol esters. The stability of plasmids can be maintained by adding to the medium 

1 0 suitable substances having a selective effect, for example antibiotics. Aerobic 

-conditions are maintained by introducing oxygen or oxygen-containing gas mixtures 
such as, for example ambient air into the culture. The temperature of the culture is 
normally from 20°C to 45°C and preferably from 25°C to 40°C. The culture is continued 
until formation of the desired product is at a maximum. This aim is normally achieved 

15 within 10 hours to 160 hours. 

[0396.0.0.0] The fermentation broths obtained in this way, containing the fine 
chemical in particular for example amino acids such as L-methionine, L-threonine 
and/or L-lysine, normally have a dry matter content of from 7.5 to 25% by weight 
Sugar-limited fermentation is additionally advantageous, at least at the end, but 
especially over at least 30% of the fermentation time. This means that the 
concentration of utilizable sugar in the fermentation medium is kept at, or reduced to, > 
0 to 3 g/l during this time. The fermentation broth is then processed further. Depending 
on requirements, the biomass can be removed entirely or partly by separation 
methods, such as, for example, centrifugation, filtration, decantation or a combination 
of these methods, from the fermentation broth or left completely in it. The fermentation 
broth can then be thickened or concentrated by known methods, such as, for example, 
with the aid of a rotary evaporator, thin-film evaporator, falling film evaporator, by 
reverse osmosis or by nanofiltration. This concentrated fermentation broth can then be 
worked up by freeze-drying, spray drying, spray granulation or by other processes. 

[0397.0.0.0] However, it is also possible to purify the fine chemical produced further. 
For this purpose, the product-containing composition is subjected to a chromatography 
on a suitable resin, in which case the desired product or the impurities are retained 
wholly or partly on the chromatography resin. These chromatography steps can be 
repeated if necessary, using the same or different chromatography resins. The skilled 
worker is familiar with the choice of suitable chromatography resins and their most 
effective use. The purified product can be concentrated by filtration or ultrafiltration and 
stored at a temperature at which the stability of the product is a maximum. 

[0398.0.0.0] The identity and purity of the isolated compound(s) can be determined 
by prior art techniques. These include high performance liquid chromatography 
(HPLC), spectroscopic methods, mass spectrometry (MS), staining methods, thin-layer 
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chromatography, NIRS, enzyme assay or microbiological assays. These analytical 
methods are summarized in: Patek et al. (1994) Appl. Environ. Microbiol. 60:133-140; 
Malakhova et al. (1996) Biotekhnologlya 11 27-32; and Schmidt et al. (1998) 
Bioprocess Engineer. 19:67-70. Ulmann's Encyclopedia of Industrial Chemistry (1996) 
5 Vol. A27, VCH: Weinheim, pp. 89-90, pp. 521-540, pp. 540-547, pp. 559-566, 575-581 
and pp. 581=587; Mlchal. Q (1999) Biochemical Pathways: An Atlas of Biochemistry 
and Molecular Biology, John Wiley and Sons; Fallon, A. et al. (1987) Applications of 
HPLC in Biochemistry in: Laboratory Techniques in Biochemistry and Molecular 
Biology, Vol. 17. 

10 [0399.0.0.0] In yet another aspect, the invention also relates to harvestable parts and 
to propagation material of the transgenic plants according to the invention which either 
contain transgenic plant cells expressing a nucleic acid molecule according to the 
invention or which contains cells which show an increased cellular activity of the 
polypeptide of the invention, e.g. an increased expression level or higher activity of the 

15 described protein. 

[0400.0.0.0] Harvestable parts can be in principle any useful parts of a plant, for 
example, flowers, pollen, seedlings, tubers, leaves, stems, fruit, seeds, roots etc. 
Propagation material includes, for example, seeds, fruits, cuttings, seedlings, tubers, 
rootstocks etc. Preferred are seeds, fruits, seedlings or tubers as harvestable or 
20 propagation material. 

[0401 .0.0.0] The invention furthermore relates to the use of the transgenic organisms 
according to the invention and of the cells, cell cultures, parts - such as, for example, 
roots, leaves and the like as mentioned above in the case of transgenic plant 
organisms - derived from them, and to transgenic propagation material such as seeds 
25 or fruits and the like as mentioned above, for the production of foodstuffs or feeding 
stuffs, pharmaceuticals or fine chemicals. 

[0402.0.0.0] Accordingly in another embodiment, the present invention relates to the 
use of the nucleic acid molecule, the organism, e.g. the microorganism, the plant, plant 
cell or plant tissue, the vector, or the polypeptide of the present invention for making 

30 fine chemicals such as fatty acids, carotenoids, isoprenoids, vitamins, lipids, wax 
esters, (polysaccharides and/or polyhydroxyalkanoates, and/or its metabolism 
products, in particular, steroid hormones, cholesterol, prostaglandin, triacylglycerols, 
bile acids and/or ketone bodies producing cells, tissues and/or plants. There are a 
number of mechanisms by which the yield, production, and/or efficiency of production 

35 of fatty acids, carotenoids, isoprenoids, vitamins, wax esters, lipids, (poly)saccharides 
and/or polyhydroxyalkanoates, and/or its metabolism products, in particular, steroid 
hormones, cholesterol, triacylglycerols, prostaglandin, bile acids and/or ketone bodies 
or further of above defined fine chemicals incorporating such an altered protein can be 
affected. In the case of plants, by e.g. increasing the expression of acetyl-CoA which is 
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the basis for many products, e.g., fatty acids, carotenoids, isoprenoids, vitamines, 
lipids, (polysaccharides, wax esters, and/or polyhydroxyalkanoates, and/or its 
metabolism products, in particular, prostaglandin, steroid hormones, cholesterol, 
triacylglycerols, bile acids and/or ketone bodies in a cell, it may be possible to increase 
5 the amount of the produced said compounds thus permitting greater ease of harvesting 
and purification or in case of plants more efficient partitioning. Further, one or more of 
said metabolism products, increased amounts of the cofactors, precursor molecules, 
and intermediate compounds for the appropriate biosynthetic pathways maybe 
required. Therefore, by increasing the number and/or activity of transporter proteins 
10 involved in the Import of nutrients, such as carbon sources (i.e., sugars), nitrogen 
. . sources (i.e., amino acids, ammonium salts), phosphate, and sulfur, it may be possible 
to improve the production of acetyl CoA and its metabolism products as mentioned 
above, due to the removal of any nutrient supply limitations on the biosynthetic 
process. In particular, it may be possible to increase the yield, production, and/or 
15 efficiency of production of said compounds, e.g. fatty acids, carotenoids, isoprenoids, 
vitamins, was esters, lipids, (polysaccharides, and/or polyhydroxyalkanoates, and/or 
its metabolism products, in particular, steroid hormones, cholesterol, prostaglandin, 
triacylglycerols, bile acids and/or ketone bodies molecules etc. in plants. 

[0403.0.0.0] Furthermore preferred is a method for the recombinant production of 
20 pharmaceuticals or fine chemicals in host organisms, wherein a host organism is 

transformed with one of the above-described nucleic acid constructs comprising one or 
more structural genes which encode the desired fine chemical or catalyze the 
biosynthesis of the desired fine chemical, the transformed host organism is cultured, 
and the desired fine chemical is isolated from the culture medium. This method can be 
25 applied widely to fine chemicals such as enzymes, vitamins, amino acids, sugars, fatty 
acids, and natural and synthetic flavorings, aroma substances and colorants or 
compositions comprising these. Especially preferred is the additional production of 
further amino acids, tocopherols and tocotrienols and carotenoids or compositions 
comprising said compounds. The transformed host organisms are cultured and the 
30 products are recovered from the host organisms or the culture medium by methods 
known to the skilled worker or the organism itself servers as food or feed supplement. 
The production of pharmaceuticals such as, for example, antibodies or vaccines, is 
described by Hood EE, Jilka JM. Curr Opin Biotechnol. 1999 Aug; 10(4): 382-6; Ma JK, 
Vine ND. Curr Top Microbiol Immunol. 1999; 236:275-92. 

35 [0404.0.0.0] In one embodiment, the present invention relates to a method for the 

identification of a gene product conferring an increase in the fine chemical production in 
a cell, comprising the following steps: 

(a) contactingr e.g. hybridising, the nucleic acid molecules of a sample, e.g. cells, 
tissues, plants or microorganisms or a nucleic acid library , which can contain a 
40 candidate gene encoding a gene product conferring an increase in the fine 
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chemical after expression, with the nucleic acid molecule of the present 
invention; 

(b) identifying the nucleic acid molecules, which hybridize under relaxed stringent 
conditions with the nucleic acid molecule of the present invention in particular to 

5 the nucleic acid molecule sequence shown in SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 

15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41 43, 45, 55, 57, 59, 61, 63, 
65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 
105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 
137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 

10 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 

201, 203, 205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 
233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 
265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 
297, 299, 301, 303, 305, 307, 309, 31 1, 313, 315, 317, 319, 321, 323, 325, 327, 

15 329, 331, 333, 335, 337,-339, 341, 343, 345, 347, 349, 351, 353, 355, 357, 359, 

361, 363, 365, 367, 369, 371, 373, 375, 377, 379, 381, 383, 385, 387, 389, 
391 or 393 and, optionally, isolating the full length cDNA clone or complete 
genomic clone; 

(c) introducing the candidate nucleic acid molecules in host cells, preferably in a 
20 plant cell or a microorganism, appropriate for producing the fine chemical; 

(d) expressing the identified nucleic acid molecules in the host cells; 

(e) assaying the the fine chemical level in the host cells; and 

(f) identifying the nucleic acid molecule and its gene product which expression 
confers an increase in the the fine chemical level in the host cell after expression 

25 compared to the wild type. 

[0405.0.0.0] Relaxed hybridisation conditions are: After standard hybridisation 
procedures washing steps can be performed at low to medium stringency conditions 
usually with washing conditions of 40°-55°C and salt conditions between 2xSSC and 
0,2x SSC with 0,1% SDS in comparison to stringent washing conditions as e.g. 60°- 

30 68°C with 0,1 % SDS. Further examples can be found in the references listed above for 
the stringend hybridization conditions. Usually washing steps are repeated with 
increasing stringency and length until a useful signal to noise ratio is detected and 
depend on many factors as the target, e.g. its purity, GC-content, size etc, the probe, 
e.g.its length, is it a RNA or a DNA probe, salt conditions, washing or hybridisation 

35 temperature, washing or hybridisation time etc. 
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[0406.0.0.0] In another embodiment, the present invention relates to a method for the 
identification of a gene product conferring an increase in the fine chemical production in 
a cell, comprising the following steps: 

(a) identifying nucleic acid molecules of an organism; which can contain a candidate 
5 gene encoding a gene product conferring an increase in the fine chemical after 

expression, which are at least 20%, preferably 25%, more preferably 30%, even 
more preferred are 35%. 40% or 50%, even more preferred are 60%, 70% or 
80%, most preferred are 90% or 95% or more homology to the nucleic acid 
molecule of the present invention, for example via homology search in a data 
10 bank; 

(b) introducing the candidate nucleic acid molecules in host cells, preferably in a 
plant cells or microorganisms, appropriate for producing the fine chemical; 

(c) expressing the identified nucleic acid molecules in the host cells; 

(d) assaying the the fine chemcial level in the host cells; and 

15 (e) identifying the nucleic acid molecule and its gene product which expression 

confers an increase in the the fine chemical level in the host cell after expression 
compared to the wild type. \ 

[0407.0.0.0] The nucleic acid molecules identified can then be used for the 
production of the fine chemical in the same way as the nucleic acid molecule of the 
20 present invention. Accordingly, in one embodiment, the present invention relates to a 
process for the production of the fine chemical, comprising (a) identifying a nucleic acid 
molecule according to aforementioned steps (a) to (f) or (a) to (e) and recovering the 
free or bound fine chemical from a organism having an increased cellular activity of a 
polypeptide encoded by the isolated nucleic acid molecule compared to a wild type. 

25 [0408.0.0.0] Furthermore, in one embodiment, the present invention relates to a 

method for the identification of a compound stimulating production of the fine chemical 
to said plant comprising: 

a) contacting cells which express the polypeptide of the present invention or its 
mRNA with a candidate compound under cell cultivation conditions; 

30 b) assaying an increase in expression of said polypeptide or said mRNA; 

c) comparing the expression level to a standard response made in the absence of 
said candidate compound; whereby, an increased expression over the standard 
indicates that the compound is stimulating production of the fine chemical. 



WO 2005/014828 PCT/EP2004/008136 

167 

[0409.0.0.0] Furthermore, in one embodiment, the present invention relates to a 
method for the screening for agonists or an antagonist of the activity of the polypeptide 
of the present invention or used in the process of the present invention, e.g. a 
polypeptide conferring an increase of the fine chemical in an organism or a part therof 
5 after increasing the activity in an organism or a part thereof, comprising: 

(a) contacting cells, tissues , plants or microorganisms which express the 
polypeptide according to the invention with a candidate compound or a sample 
comprising a plurality of compounds under conditions which permit the 
expression the polypeptide of the present invention or used in the process of the 

10 present invention; 

(b) assaying the fine chemical level or the polypeptide expression level in the cell, 
tissue, plant or microorganism or the media the cell, tissue, plant or 
microorganisms is cultured or maintained in; and 

(c) identifying a agonist or antagonist by comparing the measured the fine chemical 
15 level or polypeptide of the invention or used in the invention expression level with 

a standard the fine chemical or polypeptide expression level measured in the 
absence of said candidate compound or a sample comprising said plurality of 
compounds, whereby an increased level over the standard indicates that the 
compound or the sample comprising said plurality of compounds is an agonist 
20 and a decreased level over the standard indicates that the compound or the 

sample comprising said plurality of compounds is an antagonist. 

[0410.0.0.0] Furthermore, in one embodiment, the present invention relates to 
process for the identification of a compound conferring increased the fine chemical 
production in a plant or microorganism, comprising the steps: 

25 (a) culturing a cell or tissue or microorganism or maintaining a plant expressing the 
polypeptide according to the invention or a nucleic acid molecule encoding said 
polypeptide and a readout system capable of interacting with the polypeptide 
under suitable conditions which permit the interaction of the polypeptide with said 
readout system in the presence of a compound or a sample comprising a 

30 plurality of compounds and capable of providing a detectable signal in response 

to the binding of a compound to said polypeptide under conditions which permit 
the expression of said readout system and the polypeptide of the present 
invention or used in the process of the invention; and 

(b) identifying if the compound is an effective agonist by detecting the presence or 
35 absence or increase of a signal produced by said readout system. 



[0411.0.0.0] The screen for a gene product or an agonist conferring an increase in 
the fine chemical production can be performed by growth of an organism for example a 
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microorganism in the presence of growth reducing amounts of an inhibitor of the 
synthesis of the fine chemical. Better growth, eg higher dividing rate or high dry mass 
in comparison to the control under such conditions would identify a gene or gene 
product or an agonist conferring an increase in fine chemical production. 

[0412.0.0.0] One can think to screen for increased fine chemical production by for 
example resistance to a drug blocking the fine chemical synthesis and looking whether 
this effect is dependent on the protein of the invention e.g. comparing near identical 
organisms with low and high biological acitivity of the protein of the invention. 

[0413.0.0.0] Said compound may be chemically synthesized or microbiologically 
produced and/or comprised in, for example, samples, e.g., cell extracts from, e.g., 
plants, animals or microorganisms, e.g. pathogens. Furthermore, said compound(s) 
may be known in the art but hitherto not known to be capable of suppressing or 
activating the polypeptide of the present invention. The reaction mixture may be a cell 
free extract or may comprise a ceil or tissue culture. Suitable set ups for the method of 
the invention are known to the person skilled in the art and are, for example, generally 
described in Alberts et al., Molecular Biology of the Cell, third edition (1994), in 
particular Chapter 17. The compounds may be, e.g., added to the reaction mixture, 
culture medium, injected into the cell or sprayed onto the plant. 

[0414.0.0.0] If a sample containing a compound is identified in the method of the 
invention, then it is either possible to isolate the compound from the original sample 
identified as containing the compound capable of activating or increasing the content of 
the fine chemical in an organism or part thereof, or one can further subdivide the 
original sample, for example, if it consists of a plurality of different compounds, so as to 
reduce the number of different substances per sample and repeat the method with the 
subdivisions of the original sample. Depending on the complexity of the samples, the 
steps described above can be performed several times, preferably until the sample 
• identified according to the method of the invention only comprises a limited number of 
or only one substance(s). Preferably said sample comprises substances of similar 
chemical and/or physical properties, and most preferably said substances are identical. 
Preferably, the compound identified according to the above-described method or its 
derivative is further formulated in a form suitable for the application in plant breeding or 
plant cell and tissue culture. 

[0415.0.0.0] The compounds which can be tested and identified according to a 
method of the invention may be expression libraries, e.g., cDNA expression libraries, 
peptides, proteins, nucleic acids, antibodies, small organic compounds, hormones, 
peptidomimetics, PNAs or the like (Milner, Nature Medicine 1 (1995), 879-880; Hupp, 
Cell 83 (1995), 237-245; Gibbs, Cell 79 (1994), 193-198 and references cited supra). 
Said compounds can also be functional derivatives or analogues of known inhibitors or 
activators. Methods for the preparation of chemical derivatives and analogues are well 
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known to those skilled in the art and are described in, for example, Beilstein, Handbook 
of Organic Chemistry, Springer edition New York Inc., 1 75 Fifth Avenue, New York, 
N.Y. 10010 U.S.A. and Organic Synthesis, Wiley, New York, USA. Furthermore, said 
derivatives and analogues can be tested for their effects according to methods known 
5 in the art. Furthermore, peptidomimetics and/or computer aided design of appropriate 

derivatives unu ai icuuyuco uan uo uoBu, iw> w^n.^.—, . 9 

described above. The cell or tissue that may be employed in the method of the 
invention preferably is a host cell, plant cell or plant tissue of the invention described in 
the embodiments hereinbefore. 

1 0 [041 6.0.0.0] Thus, in a further embodiment the invention relates to a compound 
obtained or identified according to the method for identifiying an agonist of the 
invention said compound being an agonist of the polypeptide of the present invention 
or used in the process of the present invention. 

[041 7.0.0.0] Accordingly, in one embodiment, the present invention further relates to 
15 a compound identified by the method for identifying a compound of the present 
invention. 

[041 8.0.0.0] Said compound is, for example, a homologous of the polypeptide of the 
present invention. Homologues of the polypeptid of the present invention can be 
generated by mutagenesis, e.g., discrete point mutation or truncation of the polypeptide 

20 of the present invenion. As used herein, the term "homologue" refers to a variant form 
of the protein, which acts as an agonist of the activity of the polypeptide of the present 
invention. An agonist of said protein can retain substantially the same, or a subset, of 
the biological activities of the polypeptide of the present invention. In particular, said 
agonist confers the increase of the expression level of the polypeptide of the present 

25 invention and/or the expression of said agonist in an organisms or part thereof confers 
the increase of free and/or bound the fine chemical in the organism or part thereof. 

[0419.0.0.0] In one embodiment, the invention relates to an antibody specifically 
recognizing the compound or agonist of the present invention. 

[0420.0.0.0] The invention also relates to a diagnostic composition comprising at 
30 least one of the aforementioned nucleic acid molecules, vectors, proteins, antibodies or 
compounds of the invention and optionally suitable means for detection. 

[0421 .0.0.0] The diagnostic composition of the present invention is suitable for the 
isolation of mRNA from a cell and contacting the mRNA so obtained with a probe 
comprising a nucleic acid probe as described above under hybridizing conditions, 
35 detecting the presence of mRNA hybridized to the probe, and thereby detecting the 
expression of the protein in the cell. Further methods of detecting the presence of a 
protein according to the present invention comprise immunotechniques well known in 
the art, for example enzyme linked immunosorbent assay. Furthermore, it is possible to 
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use the nucleic acid molecules according to the invention as molecular markers or 
primer in plant breeding. Suitable means for detection are well known to a person 
skilled in the art, e.g. buffers and solutions for hydridization assays, e.g. the 
aforementioned solutions and buffers, further and means for Southern-, Western-, 
5 Northern- etc. -blots, as e.g. described in Sambrook et al. are known. 

[0422.0.0.0] In another embodiment the present invention relates to a kit comprising 
the nucleic acid molecule, the vector, the host cell, the polypeptide, the antisense 
nucleic acid, the antibody, plant cell, the plant or plant tissue, the harvestable part, the 
propagation material and/or the compound or agonist or antagonists identified 
1 0 according to the method of the invention. 

[0423.0.0.0] The compounds of the kit of the present invention may be packaged in 
containers such as vials, optionally with/in buffers and/or solution. If appropriate, one or 
more of said components might be packaged in one and the same container. 
Additionally or alternatively, one or more of said components might be adsorbed to a 

15 solid support as, e.g. a nitrocellulose filter, a glas plate, a chip, or a nylon membrane or 
to the well of a micro titerplate. The kit can be used for any of the herein described 
methods and embodiments, e.g. for the production of the host cells, transgenic plants, 
pharmaceutical compositions, detection of homologous sequences, identification of 
antagonists or agonists, as food or feed or as a supplement thereof, as supplement for 

20 the treating of plants, etc. 

[0424.0.0.0] Further, the kit can comprise instructions for the use of the kit for any of 
said embodiments, in particular for the use for producing organisms or part thereof 
having an increased free or bound the fine chemcial content. 

[0425.0.0.0] In one embodiment said kit comprises further a nucleic acid molecule 
25 encoding one or more of the aforementioned protein, and/or an antibody, a vector, a 
host cell, an antisense nucleic acid, a plant cell or plant tissue or a plant. 

[0426.0.0.0] In a further embodiment, the present invention relates to a method for 
the production of a agricultural composition providing the nucleic acid molecule, the 
vector or the polypeptide of the invention or comprising the steps of the method 
30 according to the invention for the identification of said compound, agonist or antagonist; 
and formulating the nucleic acid molecule, the vector or the polypeptide of the invention 
or the agonist, or compound identified according to the methods or processes of the 
present invention or with use of the subject matters of the present invention in a form 
applicable as plant agricultural composition. 

35 [0427.0.0.0] In another embodiment, the present invention relates to a method for the 
production of a "the fine chemical'-production supporting plant culture composition 
comprising the steps of the method for of the present invention; and formulating the 
compound identified in a form acceptable as agricultural composition. 
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[0428.0.0.0] Under "acceptable as agricultural composition" is understood, that such 
a composition is in agreement with the laws regulating the content of fungicides, plant 
nutrients, herbizides, etc. Preferably such a composition is without any harm for the 
protected plants and the animals (humans included) fed therewith. 

5 [0429.0.0.01 The present invention also pertains to several embodiments relating to 
further uses and methods. The nucleic acid molecule, polypeptide, protein 
homologues, fusion proteins, primers, vectors, host cells, described herein can be used 
in one or more of the following methods: identification of plants useful for fine chemical 
production as mentioned and related organisms; mapping of genomes; identification 

10 and localization of sequences of interest; evolutionary studies; determination of regions 
required for function; modulation of an activity. 

[0430.0.0.0] The nucleic acid molecule of the invention, the vector of the invention or 
the nucleic acid construct of the invention may also be useful for the production of 
organisms resistant to inhibitors of the fine chemical e.g. the amino acid production 

1 5 biosynthesis pathways. In particular, the overexpression of the polypeptide of the 

present invention may protect plants against herbicides, which block the amino acid, in 
particular the fine chemical, synthesis in said plant. Examples of herbicides blocking 
the fine chemical synthesis e.g. the amino acid synthesis in plants are for example 
sulfonylurea and imidazolinone herbicides which catalyze the first step in brariched- 

20 . chain amino acid biosynthesis 

[0431.0.0.0] Accordingly, the nucleic acid molecules of the present invention have a 
variety of uses. First, they may be used to identify an organism or a close relative 
thereof. Also, they may be used to identify the presence thereof or a relative thereof in 
a mixed population of microorganisms or plants. By probing the extracted genomic 
25 DNA of a culture of a unique or mixed population of plants under stringent conditions 
with a probe spanning a region of the gene of the present invention which is unique to 
this, one can ascertain whether the present invention has been used or whether it or a 
close relative is present. 

[0432.0.0.0] Further, the nucleic acid molecule of the invention may be sufficiently 
30 homologous to the sequences of related species such that these nucleic acid 

molecules may serve as markers for the construction of a genomic map in related 
organism. 

[0433.0.0.0] Accordingly, the present invention relates to a method for breeding 
plants for the production of the fine chemical, comprising 

35 (a) providing a first plant variety produced according to the process of the invention 
preferably (over)expressing the nucleic acid molecule of the invention; 



(b) 



crossing the first plant variety with a second plant variety; and 
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(c) selecting the offspring plants which overproduce the fine chemical by means of 
analysis the distribution of a molecular marker in the offspring representing the 
first plant variety and its capability to (overproduce the fine chemical. 

[0434.0.0.0] Details about the use of molecular markers in breeding can be found in 
5 Kumar et al., 1999 (Biotech Adv., 17:143-182) and Peleman and van der Voort 2003 
(Trends Plant Sci. 2003 Jul;8(7):330-334). The molecular marker can e.g. relate to the 
nucleic acid molecule of the invention and/or its expression level. Accordingly, the 
molecular marker can be a probe or a PCR primer set useful for identification of the 
genomic existence or genomic localisation of the nucleic acid molecule of the invention, 
10 e.g. in a Southern blot analysis or a PCR or its expression level, i.g. in a Northern Blot 
analysis or a quantitative PCR. Accordingly, in one embodiment, the present invention 
relates to the use of the nucleic acid molecule of the present invention or encoding the 
polypeptide of the present invention as molecular marker for breeding. 

[0435.0.0.0] The nucleic acid molecules of the invention are also useful for 
15 evolutionary and protein structural studies. By comparing the sequences of the 

invention or used in the process of the invention to those encoding similar enzymes 
from other organisms, the evolutionary relatedness of the organisms can be assessed. 
Similarly, such a comparison permits an assessment of which regions of the sequence 
are conserved and which are not, which may aid in determining those regions of the 
protein which are essential for the functioning of the enzyme. This type of 
determination is of value for protein engineering studies and rrjay give an indication of 
what the protein can tolerate in terms of mutagenesis without losing function. 

[0436.0.0.0] Accordingly, the nucleic acid molecule of the invention can be used for 
the identification of other nucleic acids conferring an increase of the fine chemical after 
expression. v 

[0437.0.0.0] Further, the nucleic acid molecule of the invention or a fragment of a 
gene conferring the expression of the polypeptide of the invention, preferably 
comprising the nucleic acid molecule of the invention, can be used for marker assisted 
breeding or association mapping of the fine chemical derived traits 

[0438.0.0.0] Accordingly, the nucleic acid of the invention, the polypeptide of the 
invention, the nucleic acid construct of the invention, the organisms, the host cell, the 
microorgansims, the plant, plant tissue, plant cell, or the part thereof of the invention, 
the vector of the invention, the agonist identified with the method of the invention, the 
nucleic acid molecule identified with the method of the present invention, can be used 
for the production of the fine chemical or of the fine chemical and one or more other 
ingredients such as amino acids, in particular Threoinine, Alanine, Glutamin, Glutamic 
acid, Valine, Aspargine, Phenylalanine, Leucine, Proline , Tryptophan Tyrosine/Valine, 
Isoleucine and Arginine. 
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Accordingly, the nucleic acid of the invention, or the nucleic acid molecule identified 
with the method of the present invention or the complement sequences thereof, the 
polypeptide of the invention, the nucleic acid construct of the invention, the organisms, 
the host cell, the microorgansims, the plant, plant tissue, plant cell, or the part thereof 
5 of the invention, the vector of the invention, the antagonist identified with the method of 

im/Antinn *h<a onfiK^rlw nf the* nrocAnt in\/<antinn thA anti<%Ans<* mnlpnt iIp of the 
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present invention, can be used for the reduction of the fine chemical in a organism or 
part thereof, e.g. in a cell. 

[0439.0.0.0] Further, the nucleic acid of the invention, the polypeptide of the 
10 invention, the nucleic acid construct of the invention, the organisms, the host cell, the 
microorgansims, the plant, plant tissue, plant cell, or the part thereof of the invention, 
the vector of the invention, the antagonist or the agonist identified with the method of 
the invention, the antibody of the presen invention, the antisense molecule of the 
present invention or the nucleic add molecule identified with the method of the present 
15 invention, can be used for the preparation of an agricultural composition. 

[0440.0.0.0] Furthermore, the nucleic acid of the invention, the polypeptide of the 
invention, the nucleic acid construct of the invention, the organisms, the host cell, the 
microorgansims, the plant, plant tissue, plant cell, or the part thereof of the invention, 
the vector of the invention, antagonist or the agonjst identified with the method of the 

20 invention, the antibody of the presen invention, the antisense molecule of the present 
invention or the nucleic acid molecule identified with the method r K pe present 
invention, can be used for the identification and production of compounds capable of 
conferring a modulation of the fine chemical levels in an organism or parts thereof, 
preferably to identify and produce compounds conferring an increase of the fine 

25 chemical levels in an organism or parts thereof, if said identified compound is applied 
to the organism or part thereof, i.e. as part of its food, or in the growing or culture 
media. 

[0441.0.0.0] These and other embodiments are disclosed and encompassed by the 
description and examples of the present invention. Further literature concerning any 

30 one of the methods, uses and compounds to be employed in accordance with the 
present invention may be retrieved from public libraries, using for example electronic 
devices. For example the public database "Medline" may be utilized which is available 
on the Internet, for example under hftp://www.ncbi.nlm.nih.gov/PubMed/medline.html. 
Further databases and addresses, such as hftp://www.ncbi.nlm.nih.gov/, 

35 hftp://www.infobiogen. fr/, hftp://www.fmi.ch/biology/research-tools.html, 

hftp://www.tigr.org/, are known to the person skilled in the art and can also be obtained 
using, e.g., hftp://www.lycos.com. An overview of patent information in biotechnology 
and a survey of relevant sources of patent information useful for retrospective 
searching and for current awareness are given in Berks, TIBTECH 12 (1994), 352-364. 



40 
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[0442.0.0.0] Table 1 gives an overview about the fine chemicals produced by SEQ ID 
NO: 1 of the present invention. 



Table 1: Fine chemicals produced with SEQ ID NO: 1 (YNL090W) 



Metabolite 


Method 


Min 


Max 


Tryptophane 


LC 


1,26 


i 4,39 


Proline 


LC 


1,73 


l 6,29 


Arginine 


LC 


1,54 


4,23 


Raffinose 


LC 


2,57 


16,05 


Ferulic Acid 


LC 


1,40 


2,41 


Phenylalanine 


LC 


1,34 


2,75 


Tyrosine 


LC 


1,44 


2,55 


gamma+beta-Tocopherol 


LC 


0,36 


0,66 


Cerotic Acid (C26:0) 


GC 


1,41 


3,78 


Lignoceric Acid (C24:0) 


GC 


1,34 


2,14 


Alanine 


GC 


1,21 


1,57 


Glycine 


GC 


1,46 


1,67 


Threonine 


GC 


1,21 


2,02 


Putrescine 


GC 


0,18 


0,42 


Serine 


GC 


1,29 


1,96 


Valine 


GC 


1,20 


2,31 


Isoleucine 


GC 


1,30 


4,35 


Leucine 


GC 


1,51 


5,01 


Sinapic Acid 


GC 


1,28 


2,08 


3,4-Dihydroxyphenylalanine 
(=DOPA) 


GC 


1,37 


1,87 


Stearic Acid (C1 8:0) 


GC 


1,16 


2,13 


beta-Carotene 


LC 


1,77 


2,68 



Column 1 shows the metabolite produced. Column 2 mirrors the analytic method and 
5 column 3 and 4 shows the minimum and maximum production of the respective fine 
chemical as x-fold. 

[0443.0.0.0] The present invention is illustrated by the examples, which follow. The 
present examples illustrate the basic invention without being intended as limiting the 
subject of the invention. The content of all of the references, patent applications, 
10 patents and published patent applications cited in the present patent application is 
herewith incorporated by reference. 
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[0444.0.0.0] Examples 

[0445.0.0.0] Example 1: Cloning SEQ ID NO: 1 in Escherichia coli 

[0446.0.0.0] SEQ ID NO: 1 was cloned into the plasmids pBR322 (Sutcliffe, J.G. 
(1979) Proc. Natl Acad. Sci. USA, 75: 3737-3741); pACYC177 (Change & Cohen 
5 (1978) J. Bacterid. 134: 1141-1156); plasmids of the pBS series (pBSSK+, pBSSK- 
and others; Stratagene, LaJolla, USA) or cosmids such as SuperCosI (Stratagene, 
LaJolla, USA) or Lorist6 (Gibson, T.J. Rosenthal, A., and Waterson, R.H. (1987) Gene 
53: 283-286) for expression in E. coli using known, well-established procedures (see, 
for example, Sambrook, J. et al. (1989) "Molecular Cloning: A Laboratory Manual". 
10 Cold Spring Harbor Laboratory Press or Ausubel, F.M. et al. (1994) "Current Protocols 
in Molecular Biology", John Wiley & Sons). 

[0447.0.0.0] Example 2: DNA sequencing and computerized functional analysis 

[0448.0.0.0] The DNA was sequenced by standard procedures, in particular the 
chain determination method, using ABI377 sequencers (see, for example, Fleischman, 
15 R.D. et al. (1995) 'Whole-genome Random Sequencing and Assembly of Haemophilus 
Influenzae Rti., Science 269; 496-512)". 

[0449.0.0.0] Example 3: In-vivo mutagenesis 

[0450.0.0.0] An in vivo mutagenesis of Corynebacterium glutamicum for the 
production of amino acids can be carried out by passing a plasmid DNA (or another 

20 vector DNA) through E. coli and other microorganisms (for example Bacillus spp. or 
yeasts such as Saccharomyces cerevisiae), which are not capable of maintaining the 
integrity of its genetic information. Usual mutator strains have mutations in the genes 
for the DNA repair system [for example mutHLS, mutD, mutT and the like; for 
comparison, see Rupp, W.D. (1996) DNA repair mechanisms in Escherichia coli and 

25 Salmonella, pp. 2277-2294, ASM: Washington]. The skilled worker knows these 

strains. The use of these strains is illustrated for example in Greener, A. and Callahan, 
M. (1 994) Strategies 7; 32-34. 

[0451 .0.0.0] Example 4: DNA transfer between Escherichia coli and 
Corynebacterium glutamicum 

30 [0452.0.0.0] Several Corynebacterium and Brevibacterium species comprise 
endogenous plasmids (such as, for example, pHM1519 or pBL1), which replicate 
autonomously (for a review, see, for example, Martin, J.F. et al. (1987) Biotechnology 
5: 137-146). Shuttle vectors for Escherichia coli and Corynebacterium glutamicum can 
be constructed easily using standard vectors for E. coli (Sambrook, J. et al., (1989), 

35 "Molecular Cloning: A Laboratory Manual", Cold Spring Harbor Laboratory Press or 
Ausubel, F.M. et al. (1994) "Current Protocols in Molecular Biology", John Wiley & 
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Sons), which have a replication origin for, and suitable marker from, Corynebacterium 
glutamicum added. Such replication origins are preferably taken from endogenous 
plasmids, which have been isolated from Corynebacterium and Brevibacterium 
species. Genes, which are used in particular as transformation markers for these 
5 species are genes for kanamycin resistance (such as those which originate from the 
Tn5 orTn-903 transposon) or for chloramphenicol resistance (Winnacker, E.L (1987) 
"From Genes to Clones - Introduction to Gene Technology, VCH, Weinheim). There 
are many examples in the literature of the preparation of a large multiplicity of shuttle 
vectors which are replicated in E. coli and C. glutamicum and which can be used for 

1 0 various purposes including the overexpression of genes (see, for example, Yoshihama, 
M. et al. (1985) J. -Bacterid.- 162: 591-597, Martin, J.F. et al., (1987) Biotechnology, 5: 
137-146 and Eikmanns, B.J. et al. (1992) Gene 102: 93-98). Suitable vectors which 
replicate in coryneform bacteria are for example, pZ1 (Menkel et al., Appl. Environ. 
Microbiol., 64, 1989: 549 - 554) pEkExl (Eikmanns et al., Gene 102, 1991: 93 - 98) or 

1 5 pHS2-1 (Sonnen et al, Gene 1 07, 1 991 : 69 - 74). These vectors are based on the 
cryptic plasmids pHM1519, pBL1 or pGA1. Other plasmid vectors such as, for 
example, those based on pCG4 (US 4,489,160), pNG2 (Serwold-Davis et al., FEMS 
Microbiol. Lett., 66, 1990: 1 19 - 124) or pAG1 (US 5,158.891) can be used in the 
same manner. 

20 [0453.0.0.0] Using standard methods, it is possible to clone a gene of interest into 
one of the above-described shuttle vectors and to introduce such hybrid vectors into 
Corynebacterium glutamicum strains. The transformation of C. glutamicum can be 
achieved by protoplast transformation (KastsUmata, R. et al., (1984) J. Bacterid. 159, 
306-311), electroporation (Liebl, E. etal., (1989) FEMS Microbiol. Letters, 53: 399-303) 

25 and in those cases where specific vectors are used also by conjugation (such as, for 
example, described in SchaTer, A., etal. (1990) J. Bacterid. 172: 1663-1666). Likewise, 
it is possible to transfer the shuttle vectors for C. glutamicum to E. coli by preparing 
plasmid DNA from C. glutamicum (using standard methods known in the art) and 
transforming it into E. coli. This transformation step can be carried out using standard 

30 methods, but preferably using a Mcr-deficient E. coli strain, such as NM522 (Gough & 
Murray (1983) J. Mol. Biol. 166: 1-19). 

[0454.0.0.0] If the transformed sequence(s) is/are to be integrated advantageously 
into the genome of the coryneform bacteria, standard techniques known to the skilled 
worker also exist for this purpose. Examples, which are used for this purpose are 

35 plasmid vectors as they have been described by Remscheid et al. (Appl. Environ. 
Microbiol., 60, 1994: 126-132) for the duplication and amplification of the hom-thrB 
operon. In this method, the complete gene is cloned into a plasmid vector, which is 
capable of replication in a host such as E. coli, but not in C. glutamicum. Suitable 
vectors are, for example, pSUP301 (Simon et al., BiofTechnology 1, 1983: 784-791), 

40 pKIBmob or pK19mob (SchSfer et al., Gene 145, 1994: 69-73), pGEM-T (Promega 
Corp., Madison, Wl, USA), pCR2.1-TOPO (Schuman, J. Biol. Chem., 269, 1994: 
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32678-32684, US 5,487,993), pCR®Blunt (Invitrogen, Groningen, the Netherlands) or 
pEM1 (Schrumpf etal., J. Bacterid., 173, 1991: 4510-4516). 

[0455.0.0.0] Example 5: Determining the expression of the mutaht/transgenic 
protein 

5 [0456.0.0.0] The observations of the acivity of a mutated, or transgenic, protein in a 
transformed host cell are based on the fact that the protein is expressed in a similar 
manner and in a similar quantity as the wild-type protein. A suitable method for 
determining the transcription quantity of the mutant, or transgenic, gene (a sign for the 
amount of mRNA which is available for the translation of the gene product) is to carry 

10 out a Northern blot (see, for example, Ausubel et al., (1988) Current Protocols in 

Molecular Biology, Wiley: New York), where a primer which is designed in such a way 
that it binds to the gene of interest is provided with a detectable marker (usually a 
radioactive or chemiluminescent marker) so that, when the total RNA of a culture of the 
organism is extracted, separated on a gel, applied to a stable matrix and incubated with 

15 this probe, the binding and quantity of the binding of the probe indicates the presence 
and also the amount of mRNA for this gene. Another method is a quantitative PCR. 
This information detects the extent to which the gene has been transcribed. Total cell 
RNA can be isolated from Corynebacterium glutamicum by a variety of methods, which 
. are known in the art, as described in Bormann, E.R. et al., (1992) Mol. Microbiol. 6: 

20 317-326. 

[0457.0.0.0] Standard techniques, such as Western blot, may be employed to 
determine the presence or relative amount of protein translated from this mRNA (see, 
for example, Ausubel et al. (1988) "Current Protocols in Molecular Biology", Wiley, New 
York). In this method, total cell proteins are extracted, separated by gel 

25 electrophoresis, transferred to a matrix such as nitrocellulose and incubated with a 
probe, such as an antibody, which binds specifically to the desired protein. This probe 
is usually provided directly or indirectly with a chemiluminescent or colorimetric marker, 
which can be detected readily. The presence and the observed amount of marker 
indicate the presence and the amount of the sought mutant protein in the cell. 

30 However, other methods are also known. 

[0458.0.0.0] Example 6: Growth of genetically modified Corynebacterium 
glutamicum: media and culture conditions 

[0460.0.0.0] Genetically modified Corynebacteria are grown in synthetic or. natural 
growth media. A number of different growth media for Corynebacteria are known and 
35 widely available (Lieb et al. (1989) Appl. Microbiol. Biotechnol. 32: 205-210; von der 
Osten et al. (1998) Biotechnology Letters 11:11-16; Patent DE 4 120 867; Liebl (1992) 
"The Genus Corynebacterium", in: The Procaryotes, Vol. II, Balows, A., etal., Ed. 
Springer-Verlag). 
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[0461.0.0.0] Said media which can be used according to the invention usually consist 
of one or more carbon sources, nitrogen sources, inorganic salts, vitamins and trace 
elements. Preferred carbon sources are sugars such as mono-, di- or polysaccharides. 
Examples of very good carbon sources are glucose, fructose, mannose, galactose, 
5 ribose, sorbose, ribulose, lactose, maltose, sucrose, raffinose, starch or cellulose. 

Sugars may also be added to the media via complex compounds such as molasses or 
other by-products of sugar refining. It may also be advantageous to add mixtures of 
various carbon sources. Other possible carbon sources are alcohols and/or organic 
acids such as methanol, ethanol, acetic acid or lactic acid. Nitrogen sources are usually 

10 organic or inorganic nitrogen compounds or materials containing said compounds. 
Examples of nitrogen sources include ammonia gas, aqueous ammonia solutions or 
ammonium salts such as NH 4 CI, or (NH 4 ) 2 S0 4| NH 4 OH, nitrates, urea, amino acids or 
complex nitrogen sources such as cornsteep liquor, soybean flour, soybean protein, 
yeast extract, meat extract and others. Mixtures of the above nitrogen sources may be 

15 used advantageously. 

[0462.0.0.0] Inorganic salt compounds, which may be included in the media 
comprise the chloride, phosphorus or sulfate salts of calcium, magnesium, sodium, 
cobalt, molybdenum, potassium, manganese, zinc, copper and iron. Chelating agents 
may be added to the medium in order to keep the metal ions in solution. Particularly 

20 suitable chelating agents include dihydroxyphenols such as catechol or 

protocatechulate or organic acids such as citric acid. The media usually also contain 
other growth factors such as vitamins or growth promoters, which include, for example, 
biotin, riboflavin, thiamine, folic acid, nicotinic acid, panthothenate and pyridoxine. 
Growth factors and salts are frequently derived from complex media components such 

25 as yeast extract, molasses, cornsteep liquor and the like. The exact composition of the 
compounds used in the media depends heavily on the particular experiment and is 
decided upon individually for each specific case. Information on the optimization of 
media can be found in the textbook "Applied Microbiol. Physiology, A Practical 
Approach- (Ed. P.M. Rhodes, P.F. Stanbury, IRL Press (1997) S. 53-73, ISBN 0 19 

30 963577 3). Growth media can also be obtained from commercial suppliers, for example 
Standard 1 (Merck) or BHI (Brain heart infusion, DIFCO) and the like. 

[0463.0.0.0] All media components are sterilized, either by heat (20 min at 1.5 bar 
und 121°C) or by filter sterilization. The components may be sterilized either together 
or, if required, separately. All media components may be present at the start of the 
35 cultivation or added continuously or batchwise, as desired. 

[0464.0.0.0] The culture conditions are defined separately for each experiment The 
temperature is normally between 15°C and 45°C and may be kept constant or may be 
altered during the experiment. The pH of the medium should be in the range from 5 to 
8.5, preferably around 7.0, and can be maintained by adding buffers to the media. An 
40 example of a buffer for this purpose is a potassium phosphate buffer. Synthetic buffers 
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such as MOPS, HEPES, ACES and the like may be used as an alternative or 
simultaneously. The culture pH value may also be kept constant during the culture 
period by addition of, for example, NaOH or NH 4 OH. If complex media components 
such as yeast extract are used, additional buffers are required less since many 
5 complex compounds have a high buffer capacity. When using a fermenter for the 

♦ho nM \/9lno ran al«n hfi ranulateri nsina aaseous 
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ammonia. 

[0465.0.0.0] The incubation period is generally in a range of from several hours to 
several days. This time period is selected in such a way that the maximum amount of 

10 product accumulates in the fermentation broth. The growth experiments, which are 
disclosed can be carried out in a multiplicity of containers such as microtiter plates, 
glass tubes, glass flasks or glass or metal fermenters of various sizes. To screen a 
large number of clones, the microorganisms should be grown in microtiter plates, glass 
tubes or shake flasks, either using simple flasks or baffle flasks. 100 ml shake flasks 

15 filled with 10% (based on the volume) of the growth medium required are preferably 
used. The flasks should be shaken on an orbital shaker (amplitude 25 mm) at a rate 
ranging from 100 to 300 rpm. Evaporation losses can be reduced by maintaining a 
humid atmosphere; as an alternative, a mathematical correction should be carried out 
for the evaporation losses. 

20 [0466.0.0.0] If genetically modified clones are examined, an unmodified control 
clone, or a control clone, which contains the basic plasmid without insertion, should 
also be included in the tests. If a transgenic sequence is expressed, a control clone 
should advantageously again be included in these tests. The medium is 
advantageously inoculated to an OD600 of 0.5 to 1.5 using cells which have been 

25 grown on agar plates, such as CM plates (10 g/l glucose, 2.5 g/l NaCI, 2 g/l urea, 10 g/l 
polypeptone, 5 g/l yeast extract, 5 g/l meat extract, 22 g/l agar, pH value 6.8 
established with 2M NaOH), which have been incubated at 30°C. The media are 
inoculated either by introducing a saline solution of C. glutamicum cells from CM plates 
or by addition of a liquid preculture of this bacterium. 

30 [0467.0.0.0] Example 7: In-vitro analysis of the function of the proteins encoded by 

the transformed sequences 

[0468.0.0.0] The determination of the activities and kinetic parameters of enzymes is 
well known in the art. Experiments for determining the activity of a specific modified 
enzyme must be adapted to the specific activity of the wild-enzyme type, which is well 
35 within the capabilities of the skilled worker. Overviews of enzymes in general and 

specific details regarding the structure, kinetics, principles, methods, applications and 
examples for the determination of many enzyme activities can be found for example in 
the following literature: Dixon, M., and Webb, E.C: (1979) Enzymes, Longmans, 
London; Fersht (1985) Enzyme Structure and Mechanism, Freeman, New York; Walsh 
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(1979) Enzymatic Reaction Mechanisms. Freeman, San Francisco; Price, N.C., 
Stevens, L. (1982) Fundamentals of Enzymology. Oxford Univ. Press: Oxford; Boyer, 
P.D: Ed. (1983) The Enzymes, 3rd Ed. Academic Press, New York; Bisswanger, H. 
(1994) Enzymkinetik, 2nd Ed. VCH, Weinheim (ISBN 3527300325); Bergmeyer, H.U., 
5 Bergmeyer, J., GraRI, M. Ed. (1983-1986) Methods of Enzymatic Analysis, 3rd Ed. Vol. 
I-XII, Verlag Chemie: Weinheim; and Ullmann's Encyclopedia of Industrial Chemistry 

(1987) Vol. A9, "Enzymes", VCH, Weinheim, pp. 352-363. 

[0469.0.0.0] Example 8: Analysis of the effect of the nucleic acid molecule on the 

production of the amino acids 

10 [0470.0.0.0] The effect of the genetic modification in C. glutamicum on the production 
of an amino acid can be determined by growing the modified microorganisms under 
suitable conditions (such as those described above) and analyzing the medium and/or 
the cellular components for the increased production of the amino acid. Such analytical 
techniques are well known to the skilled worker and encompass spectroscopy, thin- 

15 layer chromatography, various types of staining methods, enzymatic and 

microbiological methods and analytical chromatography such as high-performance 
liquid chromatography (see, for example, Ullman, Encyclopedia of Industrial Chemistry, 
Vol. A2, pp. 89-90 and pp. 443-613, VCH: Weinheim (1985); Fallon, A., et al., (1987) 
"Applications of HPLC in Biochemistry" in: Laboratory Techniques in Biochemistry and 

20 Molecular Biology, Vol. 17; Rehm et al. (1993) Biotechnology, Vol. 3, Chapter III: 
"Product recovery and purification", pp. 469-714, VCH: Weinheim; Belter, P.A. et al. 

(1988) Bioseparations: downstream processing for Biotechnology, John Wiley and 
Sons; Kennedy, J.F. and Cabral, J.M.S. (1992) Recovery processes for biological 
Materials, John Wiley and Sons; Shaeiwitz, J.A. and Henry, J.D. (1988) Biochemical 

25 Separations, in Ullmann's Encyclopedia of Industrial Chemistry, Vol. B3; chapter 11, 
pp. 1-27, VCH: Weinheim; and Dechow, F.J. (1989) Separation and purification 
techniques in biotechnology, Noyes Publications). 

[0471.0.0.0] In addition to the determination of the fermentation end product, other 
components of the metabolic pathways which are used for the production of the 

30 desired compound, such as intermediates and by-products, may also be analyzed in 
order to determine the total productivity of the organism, the yield and/or production 
efficiency of the compound. The analytical methods encompass determining the 
amounts of nutrients in the medium (for example sugars, hydrocarbons, nitrogen 
sources, phosphate and other ions), determining biomass composition and growth, 

35 analyzing the production of ordinary metabolites from biosynthetic pathways and 

measuring gases generated during the fermentation. Standard methods for these are 
described in Applied Microbial Physiology; A Practical Approach, P.M. Rhodes and 
P.F. Stanbury, Ed. IRL Press, pp. 103-129; 131-163 and 165-192 (ISBN: 0199635773) 
and the references cited therein. 
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[0472.0.0.0] Example 9: Purification of the amino acid 

[0473.0.0.0] The amino acid can be recovered from cells or from the supernatant of 
the above-described culture by a variety of methods known in the art. For example, the 
culture supernatant is recovered first. To this end, the cells are harvested from the 
5 culture by slow centrifugation. Cells can generally be disrupted or lysed by standard 
techniques such as mechanical force or sonication. The cell debris is removed by 
centrifugation and the supernatant fraction, if appropriate together with the culture 
supernatant, is used for the further purification of the amino acid. However, it is also 
possible to process the supernatant alone if the amino acid is present in the 
10 supernatant in sufficiently high a concentration. In this case, the amino acid, or the 
amino acid mixture, can be purified further for example via extraction and/or salt 
precipitation or via ion-exchange chromatography. 

[0474.0.0.0] If required and desired, further chromatography steps with a suitable 
resin may follow, the amino acid, but not many contaminants in the sample, being 

1 5 retained on the chromatography resin or the contaminants, but not the sample with the 
product (amino acid), being retained on the resin. If necessary, these chromatography 
steps may be repeated, using identical or other chromatography resins. The skilled 
worker is familiar with the selection of suitable chromatography resin and the most 
effective use for a particular molecule to be purified. The purified product can be 

20 concentrated by filtration or ultrafiltration and stored at a temperature at which 

maximum product stability is ensured. Many purification methods, which are not limited 
to the above purification method are known in the art. They are described, for example, 
in Bailey, J.E. & Ollis, D.F. Biochemical Engineering Fundamentals, McGraw-Hill: New 
York (1986). 

25 [0475.0.0.0] Identity and purity of the amino acid isolated can be determined by 
standard techniques of the art. They encompass high-performance liquid 
chromatography (HPLC), spectroscopic methods, mass spectrometry (MS), staining 
methods, thin-layer chromatography, NIRS, enzyme assay or microbiological assays. 
These analytical methods are compiled in: Patek et al. (1994) Appl. Environ. Microbiol. 

30 60: 133-140; Malakhova et al. (1996) Biotekhnologiya 11: 27-32; and Schmidt et al. 

(1998) Bioprocess Engineer. 19: 67-70. Ulmann's Encyclopedia of Industrial Chemistry 
(1996) Vol. A27, VCH: Weinheim, pp. 89-90, pp. 521-540, pp. 540-547, pp. 559-566, 
575-581 and pp. 581-587; Michal, G (1999) Biochemical Pathways: An Atlas of 
Biochemistry and Molecular Biology, John Wiley and Sons; Fallon, A. et al. (1987) 

35 Applications of HPLC in Biochemistry in: Laboratory Techniques in Biochemistry and 
Molecular Biology, Vol. 17. 
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[0476.0.0.0] Example 10: Cloning SEQ ID NO: 1 for the expression in plants 

[0477.0.0.0] Unless otherwise specified, standard methods as described in 
Sambrook et al., Molecular Cloning: A laboratory manual, Cold Spring Harbor 1989, 
Cold Spring Harbor Laboratory Press are used. 

5 [0478.0.0.0] SEQ ID NO: 1 is amplified by PCR as described in the protocol of the 
Pfu Turbo or DNA Herculase polymerase (Stratagene). 

[0479.0.0.0] The composition for the protocol of the Pfu Turbo DNA polymerase was 
as follows: 1x PCR buffer (Stratagene), 0.2 mM of each dNTP, 100 ng genomic DNA of 
Saccharomyces cerevisiae (strain S288C; Research Genetics, Inc., now Invitrogen) or 
10 Escherichia coli (strain MG1655; E.coii Genetic Stock Center), 50 pmol forward primer, 
50 pmol reverse primer, 2.5 u Pfu Turbo DNA polymerase. The amplification cycles 
were as follows: 

1 cycle of 3 minutes at 94-95°C, followed by 25-36 cycles of in each case 1 minute at 
95°C or 30 seconds at 94°C, 45 seconds at 50°C, 30 seconds at 50°C or 30 seconds 
15 at 55°C and 210-480 seconds at 72°C, followed by 1 cycle of 8 minutes at 72°C, then 
4°C. 

[0480.0.0.0] The composition for the protocol of the Herculase polymerase was as 
follows: 1x PCR buffer (Stratagene), 0.2 mM of each dNTP, 100 ng genomic DNA of 
Saccharomyces cerevisiae (strain S288C; Research Genetics, Inc., now Invitrogen) or 
20 Escherichia coli (strain MG1 655; E.coii Genetic Stock Center), 50 pmol forward primer, 
50 pmol reverse primer, 2.5 u Herculase polymerase. The amplification cycles were as 
follows: 

[0481.0.0.0] 1 cycle of 2-3 minutes at 94°C, followed by 25-30 cycles of in each case 
30 seconds at 94°C, 30 seconds at 55-60°C and 5-10 minutes at 72°C, followed by 1 
25 cycle of 10 minutes at 72°C, then 4°C. 

[0482.0.0.0] The following primer sequences were selected for the gene 
SEQ ID NO: 1: 

i) forward primer (SEQ I D NO: 53) 
S'-ATGTGTGAAAAGGCCGTTAGAAGG-S' 

30 ii) reverse primer (SEQ ID NO: 54) 

5'-TTATAAAATTATGCAACAGTTAGCCC-3' 

[0483.0.0.0] Thereafter, the amplificate was purified over QIAquick columns following 
the standard protocol (Qiagen). 
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[0484.0.0.0] For the cloning of PCR-products, produced by Pfu Turbo DNA 
polymerase, the vector DNA (30 ng) was restricted with Sma\ following the standard 
protocol (MBI Fermentas)and stopped by addition of high-salt buffer The restricted 
vector fragments were purified via Nucleobond columns using the standard protocol 
5 (Macherey-Nagel). Thereafter, the linearized vector was dephosphorylated following 
the standard protocol (MBI Fermentas). 

[0485.0.0.0] The PCR-products, produced by Pfu Turbo DNA polymerase, were 
directly cloned into the processed binary vector. 

[0486.0.0.0] The DNA termini of the PCR-products, produced by Herculase DNA 
10 polymerase, were blunted in a second synthesis reaction using Pfu Turbo DNA 

polymerase. The composition for the protocol of the blunting the DNA-termini was as 
follows: 0.2 mM blunting dTTP and 1 .25 u Pfu Turbo DNA polymerase. The reaction 
was incubated at 72°C for 30 minutes. Then the PCR-products were cloned into the 
processed vector as well. 

15 [0487.0.0.0] A binary vector comprising a selection cassette (promoter, selection 
marker, terminator) and an expression cassette with promoter, cloning cassette and 
terminator sequence between the T-DNA border sequences was used. In addition to 
those within the cloning cassette, the binary vector has no Sma\ cleavage site. Binary 
vectors which can be used are known to the skilled worker; an overview of binary 

20 vectors and their use can be found in Hellens, R., Mullineaux, P. and Klee H., [(2000) 
"A guide to Agrobacterium binary vectors", Trends in Plant Science, Vol. 5 No. 10, 446- 
451. Depending on the vector used, cloning may advantageously also be carried out 
via other restriction enzymes. Suitable advantageous cleavage sites can be added to 
the ORF by using suitable primers for the PCR amplification. 

25 [0488.0.0.0] Approximately 30 ng of prepared vector and a defined amount of 
prepared amplificate were mixed and ligated by addition of ligase. 

[0489.0.0.0] The ligated vectors were transformed in the same reaction vessel by 
addition of competent £. coli cells (strain DH5alpha) and incubation for 20 minutes at 
1°C followed by a heat shock for 90 seconds at 42°C and cooling to 4°C. Then, 
30 complete medium (SOC) was added and the mixture was incubated for 45 minutes at 
37°C. The entire mixture was subsequently plated onto an agar plate with antibiotics 
(selected as a function of the binary vector used) and incubated overnight at 37°C. 

[0490.0.0.0] The outcome of the cloning step was verified by amplification with the 
aid of primers which bind upstream . and downstream of the integration site, thus 
35 allowing the amplification of the insertion. In addition combinations of the above 

mentioned gene specific primers and upstream and downstream primers were used in 
PCR reactions to identify clones with the correct insert orientation. The amplifications 
were carried as described in the protocol of Tag DNA polymerase (Gibco-BRL). 
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[0491.0.0.0] The amplification cycles were as follows: 1 cycle of 5 minutes at 94°C, 
followed by 35 cycles of in each case 15 seconds at 94°C, 15 seconds at 50-66°C and 
5 minutes at 72°C, followed by 1 cycle of 10 minutes at 72°C, then 4°C. 

[0492.0.0.0] Several colonies were checked, but only one colony for which a PCR 
5 product of the expected size was detected was used in the following steps. 

[0493.0.0.0] A portion of this positive colony was transferred into a reaction vessel 
filled with complete medium (LB) and incubated overnight at 37°C. The LB medium 
contained an antibiotic chosen to suit the binary vector (see above) used and the 
resistance gene present therein in order to select the clone. 

1 0 [0494.0.0.0] The plasmid preparation was carried out as specified in the Qiaprep 
standard protocol (Qiagen). 

[0495.0.0.0] Example 1 1 : Generation of transgenic plants which express 

SEQ ID NO: 1 

[0496.0.0.0] 1 ng of the plasmid DNA isolated was transformed by electroporation 
1 5 into competent cells of Agrobacterium tumefaciens, of strain GV 31 01 pMP90 (Koncz 
and Schell, Mol. Gen. Gent. 204, 383-396, 1986). The choice of the agrobacterial strain 
depends on the choice of the binary vector. An overview of possible strains and their 
. properties is found in Hellens, R., Mullineaux, P. and Klee H., (2000) ." A guide to 
Agrobacterium binary vectors, Trends in Plant Science, Vol. 5 No. 10, 446-451. 
20 Thereafter, complete medium (YEP) was added and the mixture was transferred into a 
fresh reaction vessel for 3 hours at 28°C. Thereafter, all of the reaction mixture was 
plated onto YEP agar plates supplemented with the respective antibiotics, for example 
rifampicin and gentamycin for GV3101 pMP90, and a further antibiotic for the selection 
onto the binary vector, was plated, and incubated for 48 hours at 28°C. 

25 [0497.0.0.0] The agrobacteria generated in Example 10, which contains the plasmid 
construct were then used for the transformation of plants. 

[0498.0.0.0] A colony was picked from the agar plate with the aid of a pipette tip and 
taken up in 3 ml of liquid TB medium, which also contained suitable antibiotics, 
depending on the agrobacterial strain and the binary plasmid. The preculture was 
30 grown for 48 hours at 28°C and 120 rpm. 

[0499.0.0.0] 400 ml of LB medium containing the same antibiotics as above were 
used for the main culture. The preculture was transferred into the main culture. It was 
grown for 1 8 hours at 28°C and 1 20 rpm. After centrifugation at 4 000 rpm, the pellet 
was resuspended in infiltration medium (MS medium, 10% sucrose). 

35 [0500.0.0.0] In order to grow the plants for the transformation, dishes (Piki Saat 80, 
green, provided with a screen bottom, 30 x 20 x 4.5 cm, from Wiesauplast, 
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Kunststofftechnik, Germany) were half-filled with a GS 90 substrate (standard soil, 
Werkverband E.V., Germany). The dishes were watered overnight with 0.05% Proplant 
solution (Chimac-Apriphar, Belgium). Arabidopsis thaliana C24 seeds (Nottingham 
Arabidopsis Stock Centre, UK ; NASC Stock N906) were scattered over the dish, 
5 approximately 1 000 seeds per dish. The dishes were covered with a hood and placed 
in the stratification facility (S h, 1 10 \i urncl/nrVs 1 , 22°C; 16 h, dark, 6°C), After 5 days, 
the dishes were placed into the short-day controlled environment chamber (8 h 130 
umol/m 2 /s-\ 22°C; 16 h, dark 20°C), where they remained for approximately 10 days 
until the first true leaves had formed. 

10 [0501.0.0.0] The seedlings were transferred . into pots containing the same substrate 
(Teku pots, 7 cm, LC series, manufactured by Poppelmann GmbH & Co, Germany). 
Five plants were pricked out into each pot. The pots were then returned into the short- 
day controlled environment chamber for the plant to continue growing. 

[0502.0.0.0] After 10 days, the plants were transferred into the greenhouse cabinet 
15 (supplementary illumination, 16 h, 340 jiE, 22°C; 8 h, dark, 20°C), where they were 
allowed to grow for further 17 days. 

[0503.0.0.0] For the transformation, 6-week-old Arabidopsis plants which had just 
started flowering were immersed for 10 seconds into the above-described agrobacterial 
suspension which had previously been treated with 10 *il Silwett L77 (Crompton S.A, 
20 Osi Specialties, Switzerland). The method in question is described in Clough and Bent, 
1998 (Clough, JC and Bent, AF. 1998 Floral dip: a simplified method for 
/Agro/jacferium-mediated transformation of Arabidopsis thaliana, Plant J. 16:735-743. 

[0504.0.0.0] The plants were subsequently placed for 18 hours into a humid 
chamber. Thereafter, the pots were returned to the greenhouse for the plants to 
25 continue growing. The plants remained in the greenhouse for another 1 0 weeks until 
the seeds were ready for harvesting. 

[0505.0.0.0] Depending on the resistance marker used for the selection of the 
transformed plantsthe harvested seeds were planted in the greenhouse and subjected 
to a spray selection or else first sterilized and then grown on agar plates supplemented 
30 with the respective selection agent. . In case of BASTAO-resistance, plantlets were 
sprayed four times at an interval of 2 to 3 days with 0.02 % BASTA® and transformed 
plants were allowed to set seeds. The seeds of the transgenic A. thaliana plants were 
stored in the freezer (at -20°C). 

[0506.0.0.0] Example 12: Plant culture for bioanalytical analyses 



35 



[0507.0.0.0] For the bioanalytical analyses of the transgenic plants, the latter were 
grown uniformly a specific culture facility. To this end the GS-90 substrate as the 
compost mixture was introduced into the potting machine (Laible System GmbH, 
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Singen, Germany) and filled into the pots. Thereafter, 35 pots were combined in one 
dish and treated with Previcur. For the treatment, 25 ml of Previcur were taken up in 10 
I of tap water. This amount was sufficient for the treatment of approximately 200 pots. 
The pots were placed into the Previcur solution and additionally irrigated overhead with 
5 tap water without Previcur. They were used within four days. . 

[0508.0.0.0] For the sowing, the seeds, which had been stored in the refrigerator (at - 
20°C) ( were removed from the Eppendorf tubes with the aid of a toothpick and 
transferred into the pots with the compost In total, approximately 5 to 12 seeds were 
distributed in the middle of the pot. 

10 [0509.0.0.0] After the seeds had been sown, the dishes with the pots were covered 
with matching plastic hood and placed into the stratification chamber for 4 days in the 
dark at 4°C. The humidity was approximately 90%. After the stratification, the test 
plants were grown for 22 to 23 days at a 16-h-light, 8-h-dark rhythm at 20°C, an 
atmospheric humidity of 60% and a C0 2 concentration of approximately 400 ppm. The 
15 light sources used were Powerstar HQI-T 250 W/D Daylight lamps from Osram, which 
generate a light resembling the solar color spectrum with a light intensity of 
approximately 220 |jJE/m2/s-1 

[0510.0.0.0] When the plants were 8, 9 and 10 days old, they were subjected to 
selection for the resistance marker Approximately 1400 pots with transgenic plants 
were treated with 11 0,015% vol/vol of Basta® (Glufosinate-ammonium) solution in 
water (Aventis Cropsience, GermanyAfter a further 3 to 4 days, the transgenic, 
resistant seedlings (plantlets in the 4-leaf stage) could be distinguished clearly from the 
untransformed plantlets. The nontransgenic seedlings were bleached or dead. The 
transgenic resistance plants were thinned when they had reached the age of 14 days. 
The plants, which had grown best in the center of the pot were considered the target 
plants. All the remaining plants were removed carefully with the aid of metal tweezers 
and discarded. 

[051 1.0.0.0] During their growth, the plants received overhead irrigation with distilled 
water (onto the compost) and bottom irrigation into the placement grooves. Once the 
grown plants had reached the age of 23 days, they were harvested. 

[0512.0.0.0] Example 13: Metabolic analysis of transformed plants 

[0513.0.0.0] The modifications identified in accordance with the invention, in the 
content of above-described metabolites, were identified by the following procedure. 

a) sampling and storage of the samples 



[0514.0.0.0] Sampling was performed directly in the controlled-environment 
chamber. The plants were cut using small laboratory scissors, rapidly weighed on 
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laboratory scales, transferred into a pre-cooled extraction sleeve and placed into an 
aluminum rack cooled by liquid nitrogen. If required, the extraction sleeves can be 
stored in the freezer at -80°C. The time elapsing between cutting the plant to freezing it 
in liquid nitrogen amounted to not more than 10 to 20 seconds. 

5 b) Lyophilization 

[0515.0.0.0] During the experiment, care was taken that the plants either remained in 
the deep-frozen state (temperatures < -40°C) or were freed from water by 
lyophilization until the first contact with solvents. 

[0516.0.0.0] The aluminum rack with the plant samples in the extraction sleeves was 
10 placed into the pre-cooled (~40°C) lyophilization facility. The initial temperature during 
the main drying phase was-35°C and the pressure was 0.120 mbar. During the drying 
phase, the parameters were altered following a pressure and temperature program. 
The final temperature after 12 hours was +30°C and the final pressure was 
0.001 to 0.004 mbar. After the vacuum pump and the refrigerating machine had been 
1 5 switched off, the system was flushed with air (dried via a drying tube) or argon. 

c) Extraction 

[0517.0.0.0] Immediately after the lyophilization apparatus had been flushed, the 
extraction sleeves with the lyophilized plant material were transferred into the 5 ml 
extraction cartridges of the ASE device (Accelerated Solvent Extractor ASE 200 with 
20 Solvent Controller and AutoASE software (DIONEX)). 

[0518.0.0.0] The 24 sample positions of an ASE device (Accelerated Solvent 
Extractor ASE 200 with Solvent Controller and AutoASE software (DIONEX)) were 
filled with plant samples, including some samples for testing quality control. 

[0519.0.0.0] The polar substances were extracted with approximately 10 ml of 
25 methanol/water (80/20, v/v) at T = 70°C and p = 140 bar, 5 minutes heating-up phase, 
1 minute static extraction. The more lipophilic substances were extracted with 
approximately 10 ml of methanol/dichloromethane (40/60, v/v) at T = 70°C and 
p = 140 bar, 5 minute heating-up phase, 1 minute static extraction. The two solvent 
mixtures were extracted into the same glass tubes (centrifuge tubes, 50 ml, equipped 
30 with screw cap and pierceable septum for the ASE (DIONEX)). 

[0520.0.0.0] The solution was treated with internal standards: ribitol, L-glycine-2,2-d 2> 
L-alanine-2,3,3,3-d 4 , methionine-methyl-d 3 , and a-methylglucopyranoside and methyl 
nonadecanoate, methyl undecanoate, methyl tridecanoate, methyl pentadecanoate, 
methyl nonacosanoate. 

35 [0521 .0.0.6] The total extract was treated with 8 ml of water. The solid residue of the 
plant sample and the extraction sleeve were discarded. 



WO 2005/014828 PCT/EP2004/008136 

188 

[0522.0.0.0] The extract was shaken and then centrifuged for 5 to 1 0 minutes at at 
least 1 400 g in order to accelerate phase separation. 1 ml of the supernatant 
methanol/water phase ("polar phase", colorless) was removed for the further GC 
analysis, and 1 ml was removed for the LC analysis. The remainder of the 
5 methanol/water phase was discarded. 0.5 ml of the organic phase ("lipid phase", dark 
green) was removed for the further GC analysis and 0.5 ml was removed for the LC 
analysis. All the portions removed were evaporated to dryness using the IR Dancer 
infrared vacuum evaporator (Hettich). The maximum temperature during the 
evaporation process did not exceed 40°C. Pressure in the apparatus was hot less than 
10 10 mbar. 

d) Processing the lipid phase for the LC/MS or LC/MS/MS analysis 

[0523.0.0.0] The lipid extract, which had been evaporated to dryness was taken up in 
mobile phase. The HPLC was run with gradient elution. 

15 [0524,0.0.0] The polar extract, which had been evaporated to dryness was taken up 
in mobile phase. The HPLC was run with gradient elution. 

e) Derivatization of the lipid phase for the GC/MS analysis 

[0525.0.0.0] For the transmethanolysis, a mixture of 140 pi of chloroform, 37 pi of 
20 hydrochloric acid (37% by weight HCI in water), 320 pi of methanol and 20 pi of toluene 
was added to the evaporated extract. The vessel was sealed tightly and heated for 2 
hours at 100°C, with shaking. The solution was subsequently evaporated to dryness. 
The residue was dried completely. 

[0526.0.0.0] The methoximation of the carbonyl groups was carried out by reaction 
25 with methoxyamine hydrochloride (5 mg/ml in pyridine, 100 fjj for 1.5 hours at 60°C) iri 
a tightly sealed vessel. 20 pi of a solution of odd-numbered, straight-chain fatty acids 
(solution of each 0.3 mg/mL of fatty acids from 7 to 25 carbon atoms and each 0.6 
mg/mL of fatty acids with 27, 29 and 31 carbon atoms in 3/7 (v/v) pyridine/toluene) 
were added as time standards. Finally, the derivatization with 100 pi of N-methyl-N- 
30 (trimethylsilyl)-2,2, 2-trifluoroacetamide (MSTFA) was carried out for 30 minutes at 
60°C, again in the tightly sealed vessel. The final volume before injection into the GC 
was 220 pi. 

f) Derivatization of the polar phase for the GC/MS analysis 

35 [0527.0.0.0] The methoximation of the carbonyl groups was carried out by reaction 
with methoxyamine hydrochloride (5 mg/ml in pyridine, 50 pJ for 1.5 hours at 60°C) in a 
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tightly sealed vessel. 10 Ml of a solution of odd-numbered, straight-chain fatty acids 
(solution of each 0.3 mg/mL of fatty acids from 7 to 25 carbon atoms and each 0.6 
mg/mL of fatty acids with 27, 29 and 31 carbon atoms in 3/7 (v/v) pyridine/toluene) 
were added as time standards. Finally, the derivatization with 50 pi of N-methyl-N- 
5 (trimethylsilyl)-2,2,2-trif!uoroacetamide (MSTFA) was carried out for 30 minutes at 
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was 110 pi. 

g) Analysis of the various plant samples 

[0528.0.0.0] The samples were measured in individual series of 20 plant samples 
10 each (also referred to as sequences), each sequence containing at least 5 wild-type 
plants as controls. The peak area of each analyte was divided by the peak area of the 
respective internal standard. The data were standardized for the fresh weight 
established for the plant. The values calculated thus were related to the wild-type 
control group by being divided by the mean of the corresponding data of the wild-type 
1 5 control group of the same sequence. The values obtained were referred to as 

ratio_by_WT, they are comparable between sequences and indicate how much the 
analyte concentration in the mutant differs in relation to the wild-type control. 
Appropiate controls were done before to proof that the vector and transformation 
procedure itself has no significant influence on the metabolic composition of the plants. 
20 Therefore the desribed changes in comparison with wildtypes were caused by the 
introduced genes. 

[0529.0.0.0] As an alternative, the amino acids can be detected advantageously via 
HPLC separation in ethanolic extract as described by Geigenberger et al. (Plant Cell & 
Environ, 19, 1996:43-55). 

25 The results of the different plant analyses can be seen from the table 1 . 

[0530.0.0.0] Column 1 in Table 1 shows the amino acid analyzed. Columns 3 and 4 
shows the ratio of the analyzed amino acid between the transgenic plants and the wild 
type; Increase of the metabolites: Max: maximal x-fold (normalised to wild type)-Min: 
minimal x-fold (normalised to wild type). Decrease of the metabolites: Max: maximal x- 
30 fold (normalised to wild type) (minimal decrease), Min: minimal x-fold (normalised to 
wild type) (maximal decrease). Column 2 indicates the analytical method. 

[0531.0.0.0] When the analyses were repeated independently, all results proved to 
be significant. 
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[0532.0.0.0] Example 14a: Engineering ryegrass plants by over-expressing 

YNL090W, e.g. from Saccharomyces cerevisiae or plants 

[0533.0.0.0] Seeds of several different ryegrass varieties can be used as explant 
sources for transformation, including the commercial variety Gunne available from 
5 Svalof Weibull seed company or the variety Affinity. Seeds are surface-sterilized 
sequentially with 1% Tween-20 for 1 minute, 100 % bleach for 60 minutes, 3 rinses 
with 5 minutes each with de-ionized and distilled H20, and then germinated for 3-4 
days on moist, sterile filter paper in the dark. Seedlings are further sterilized for 1 
minute with 1% Tween-20, 5 minutes with 75% bleach, and rinsed 3 times with ddH 2 0, 
10 5 min each, . 

[0534.0.0.0] Surface-sterilized seeds are placed on the callus induction medium 
containing Murashige and Skoog basal salts and vitamins, 20 g/l sucrose, 150 mg/l 
asparagine, 500 mg/l casein hydrolysate, 3 g/l Phytagel, 10 mg/l BAP, and 5 mg/l 
dicamba. Plates are incubated in the dark at 25°C for 4 weeks for seed germination 
15 and embryogenic callus induction. 

[0535.0.0.0] After 4 weeks on the callus induction medium, the shoots and roots of 
the seedlings are trimmed away, the callus is transferred to fresh media, is maintained 
in culture for another 4 weeks, and is then transferred to MSO medium in light for 2 
weeks. Several pieces of callus (11-17 weeks old) are either strained through a 10 

20 mesh sieve and put onto callus induction medium, or are cultured in 100 ml of liquid 
ryegrass callus induction media (same medium as for callus induction with agar) in a 
250 ml flask. The flask is wrapped in foil and shaken at 175 rpm in the dark at 23°C for 
1 week. Sieving the liquid culture with a 40-mesh sieve is collected the cells. The 
fraction collected on the sieve is plated and is cultured on solid ryegrass callus 

25 induction medium for 1 week in the dark at 25°C. The callus is then transferred to and 
is cultured oh MS medium containing 1% sucrose for 2 weeks. 

[0536.0.0.0] Transformation can be accomplished with either Agrobacterium or with 
particle bombardment methods. An expression vector is created containing a 
constitutive plant promoter and the cDNA of the gene in a pUC vector. The plasmid 

30 DNA is prepared from E. coli cells using with Qiagen kit according to manufacturer's 
instruction. Approximately 2 g of embryogenic callus is spread in the center of a sterile 
filter paper in a Petri dish. An aliquot of liquid MSO with 10 g/l sucrose is added to the 
filter paper. Gold particles (1.0 pm in size) are coated with plasmid DNA according to 
method of Sanford et al., 1993 and are delivered to the embryogenic callus with the 

35 following parameters: 500 pg particles and 2 pg DNA per shot, 1300 psi and a target 
distance of 8.5 cm from stopping plate to plate of callus and 1 shot per plate of callus. 

[0537.0.0.0] After the bombardment, calli are transferred back to the fresh callus 
development medium and maintained in the dark at room temperature for a 1-week 
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period. The callus is then transferred to growth conditions in the light at 25 °C to initiate 
embryo differentiation with the appropriate selection agent, e.g. 250 nM Arsenal, 5 mg/l 
PPT or 50 mg/L Kanamycin. Shoots resistant to the selection agent are appearing and 
once rooted are transferred to soil. 

5 [0538.0.0.0] Samples of the primary transgenic plants (TO) are analyzed by PCR to 
confirm the presence of T-DNA. These results are confirmed by Southern hybridization 
in which DNA is electrophoresed on a 1% agarose gel and transferred to a positively 
charged nylon membrane (Roche Diagnostics). The PCR DIG Probe Synthesis Kit 
(Roche Diagnostics) is used to prepare a digoxigenin-labelled probe by PCR, and used 

10 as recommended by the manufacturer. . 

[0539.0.0.0] Transgenic TO ryegrass plants are propagated vegetatively by excising 
tillers. The transplanted tillers are maintained in the greenhouse for 2 months until well 
established. The shoots are defoliated and allowed to grow for 2 weeks. 

[0540.0.0.0] Example 14b: Engineering soybean plants by over-expressing 
1 5 YNL090W, e.g. from Saccharomyces cerevisiae or plants 

[0541 .0.0.0] Soybean can be transformed according to the following modification of 
the method described in the Texas A&M patent US 5,164,310. Several commercial 
soybean varieties are amenable to transformation by this method. The cultivar Jack 
(available from the Illinois Seed Foundation) is commonly used for transformation. 

20 Seeds are sterilized by immersion in 70% (v/v) ethanol for 6 min and in 25 % 

commercial bleach (NaOCI) supplemented with 0.1% (v/v) Tween for 20 min, followed 
by rinsing 4 times with sterile double distilled water. Removing the radicle, hypocotyl 
and one cotyledon from each seedling propagates seven-day seedlings. Then, the 
epicotyl with one cotyledon is transferred to fresh germination media in petri dishes and 

25 incubated at 25 °C under a 16-hr photoperiod (approx. 100 uE-m-2s-1) for three 
weeks. Axillary nodes (approx. 4 mm in length) are cut from 3-4 week-old plants. 
Axillary nodes are excised and incubated in Agrobacterium LBA4404 culture. 

[0542.0.0.0] Many different binary vector systems have been described for plant 
transformation (e.g. An, G. in Agrobacterium Protocols. Methods in Molecular Biology 

30 vol 44, pp 47-62, Gartland KMA and MR Davey eds. Humana Press, Totowa, New 
Jersey). Many are based on the vector pBIN19 described by Bevan (Nucleic Acid 
Research. 1984. 12:8711-8721) that includes a plant gene expression cassette flanked 
by the left and right border sequences from the Ti plasmid of Agrobacterium 
tumefaciens. A plant gene expression cassette consists of at least two genes - a 

35 selection marker gene and a plant promoter regulating the transcription of the cDNA or 
genomic DNA of the trait gene. Various selection marker genes can be used as 
described above, including the Arabidopsis gene encoding a mutated acetohydroxy 
acid synthase (AHAS) enzyme (US patents 57673666 and 6225105). Similarly, various 
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promoters can be used to regulate the trait gene to provide constitutive, developmental, 
tissue or environmental regulation of gene transcription as described above. In this 
example, the 34S promoter (GenBank Accession numbers M59930 and X16673) is 
used to provide constitutive expression of the trait gene. 

[0543.0.0.0] After the co-cultivation treatment, the explants are washed and 
transferred to selection media supplemented with 500 mg/L timentin. Shoots are 
excised and placed on a shoot elongation medium. Shoots longer than 1 cm are placed 
on rooting medium for two to four weeks prior to transplanting to soil. 

[0544.0.0.0] The primary transgenic plants (TO) are analyzed by PCR to confirm the 
presence of T-DNA. These results are confirmed by Southern hybridization in which 
DNA is electrophoresed on a 1 % agarose gel and transferred to a positively charged 
nylon membrane (Roche Diagnostics). The PCR DIG Probe Synthesis Kit (Roche 
Diagnostics) is used to prepare a digoxigenin-labelled probe by PCR, and is used as 
recommended by the manufacturer. 

[0545.0.0.0] Example 14c: Engineering corn plants by over-expressing 

YNL090W, e.g. from Saccharomyces cerevisiae or plants 

[0546.0.0.0] Transformation of maize (Zea Mays L.) is performed with a modification 
of the method described by Ishida et al. (1996. Nature Biotech 14745-50). 
Transformation is genotype-dependent in corn and only specific genotypes are 
amenable to transformation and regeneration. The inbred line A188 (University of 
Minnesota) or hybrids with A188 as a parent are good sources of donor material for 
transformation (Fromm et al. 1990 Biotech 8:833-839), but other genotypes can be 
used successfully as well. Ears are harvested from corn plants at approximately 1 1 
days after pollination (DAP) when the length of immature embryos is about 1 to 1 .2 
mm. Immature embryos are co-cultivated with Agrobacterium tumefaciens that carry 
"super binary" vectors and transgenic plants are recovered through organogenesis. 
The super binary vector system of Japan Tobacco is described in WO patents 
WO94/00977 and WO95/06722. Vectors can be constructed as described. Various 
selection marker genes can be used including the maize gene encoding a mutated 
acetohydroxy acid synthase (AHAS) enzyme (US patent 6025541). Similarly, various 
promoters can be used to regulate the trait gene to provide constitutive, developmental, 
tissue or environmental regulation of gene transcription. In this example, the 34S 
promoter (GenBank Accession numbers M59930 and X16673) is used to provide 
constitutive expression of the trait gene. 

[0547.0.0.0] Excised embryos are grown on callus induction medium, then maize 
regeneration medium, containing imidazolinone as a selection agent. The Petri plates 
are incubated in the light at 25 °C for 2-3 weeks, or until shoots develop. The green 
shoots are transferred from each embryo to maize rooting medium and incubated at 25 
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°C for 2-3 weeks, until roots develop. The rooted shoots are transplanted to soil in the 
greenhouse. T1 seeds are produced from plants that exhibit tolerance to the 
imidazolinone herbicides and which are PCR positive for the transgenes. 

[0548.0.0.0] The T1 generation of single locus insertions of the T-DNA can segregate 
5 for the transgene in a 3: 1 ratio. Those progeny containing one or two copies of the 
transgene are tolerant of the imidazolinone herbicide. Homozygous T2 plants can 
exhibited similar phenotypes as the T1 plants. Hybrid plants (F1 progeny) of 
homozygous transgenic plants and non-transgenic plants can also exhibited increased 
similar phenotyps. 

10 [0549.0.0.0] Example 14d: Engineering wheat plants by over-expressing 

YNL090W, e.g. from Saccharomyces cerevisiae or plants 

[0550.0.0.0] Transformation of wheat is performed with the method described by 
Ishida et al. (1996 Nature Biotech. 14745-50). The cultivar Bobwhite (available from 
CYMMIT, Mexico) is commonly used in transformation. Immature embryos are co- 

1 5 cultivated with Agrobacterium tumefaciens that carry "super binary" vectors, and 
transgenic plants are recovered through organogenesis. The super binary vector 
system of Japan Tobacco is described in WO patents WO94/00977 and WO95/06722. 
Vectors were constructed as described. Various selection marker genes can be used 
including the maize gene encoding a mutated acetohydroxy acid synthase (AHAS) 

20 enzyme (US patent 6025541). Similarly, various promoters can be used to regulate the 
trait gene to provide constitutive, developmental, tissue or environmental regulation of 
gene transcription. In this example, the 34S promoter (GenBank Accession numbers 
M59930 and X16673) can be used to provide constitutive expression of the trait gene. 

[0551 .0.0.0] After incubation with Agrobacterium, the embryos are grown on callus 
25 induction medium, then regeneration medium, containing imidazolinone as a selection 
agent. The Petri plates are incubated in the light at 25 °C for 2-3 weeks, or until shoots 
develop. The green shoots are transferred from each embryo to rooting medium and 
incubated at 25 °C for 2-3 weeks, until roots develop. The rooted shoots are 
transplanted to soil in the greenhouse. T1 seeds are produced from plants that exhibit 
30 tolerance to the imidazolinone herbicides and which are PCR positive for the 
transgenes. 

[0552.0.0.0] The T1 generation of single locus insertions of the T-DNA can segregate 
for the transgene in a 3:1 ratio. Those progeny containing one or two copies of the 
transgene are tolerant of the imidazolinone herbicide. Homozygous T2 plants exhibited 
35 similar phenotypes. 
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[0553.0.0.0] Example 14e: Engineering Rapeseed/Canola plants by over-expressing 

YNL090W, e.g. from Saccharomyces cerevisiae or plants 

[0554.0.0.0] Cotyledonary petioles and hypocotyls of 5-6 day-old young seedlings 
are used as explants for tissue culture and transformed according to Babic et al. (1998, 
5 Plant Cell Rep 17: 183-188). The commercial cultivar Westar (Agriculture Canada) is 
the standard variety used for transformation, but other varieties can be used. 

[0555.0.0.0] Agrobacterium tumefaciens LBA4404 containing a binary vector are 
used for canola transformation. Many different binary vector systems have been 
described for plant transformation (e.g. An, G. in Agrobacterium Protocols. Methods in 

10 Molecular Biology vol 44, pp 47-62, Gartland KMA and MR Davey eds. Humana Press, 
Totowa, New Jersey). Many are based on the vector pBIN19 described by Bevan 
(Nucleic Acid Research. 1984. 12:8711-8721) that includes a plant gene expression 
cassette flanked by the left and right border sequences from the Ti plasmid of 
Agrobacterium tumefaciens. A plant gene expression cassette consists of at least two 

15 genes - a selection marker gene and a plant promoter regulating the transcription of 
the cDNA or genomic DNA of the trait gene. Various selection marker genes can be 
used including the Arabidopsis gene encoding a mutated acetohydroxy acid synthase 
(AHAS) enzyme (US patents 57673666 and 6225105). Similarly, various promoters 
can be used to regulate the trait gene to provide constitutive, developmental, tissue or 

20 environmental regulation of gene transcription. In this example, the 34S promoter 
(GenBank Accession numbers M59930 and X1 6673) can be used to provide 
constitutive expression of the trait gene. 

[0556.0.0.0] Canola seeds are surface-sterilized in 70% ethanol for 2 min., and then 
in 30% Clorox with a drop of Tween-20 for 1 0 min, followed by three rinses with 

25 sterilized distilled water. Seeds are then germinated in vitro 5 days on half strength MS 
medium without hormones, 1% sucrose, 0.7% Phytagar at 23oC, 16 hr. light. The 
cotyledon petiole explants with the cotyledon attached are excised from the in vitro 
seedlings, and are inoculated with Agrobacterium by dipping the cut end of the petiole 
explant into the bacterial suspension. The explants are then cultured for 2 days on 

30 MSBAP-3 medium containing 3 mg/l BAP, 3 % sucrose, 0.7 % Phytagar at 23 °C, 16 hr 
light. After two days of co-cultivation with Agrobacterium, the petiole explants are 
transferred to MSBAP-3 medium containing 3 mg/l BAP, cefotaxime, carbenicillin, or 
timentin (300 mg/l) for 7 days, and then cultured on MSBAP-3 medium with cefotaxime, 
carbenicillin, or timentin and selection agent until shoot regeneration. When the shoots 

35 are 5-10 mm in length, they are cut and transferred to shoot elongation medium 

(MSBAP-0.5, containing 0.5 mg/l BAP). Shoots of about 2 cm in length are transferred 
to the rooting medium (MS0) for root induction. 

[0557.0.0.0] Samples of the primary transgenic plants (TO) are analyzed by PCR to 
confirm the presence of T-DNA. These results are confirmed by Southern hybridization 
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in which DNA is electrophoresed on a 1 % agarose ge! and are transferred to a 
positively charged nylon membrane (Roche Diagnostics), The PCR DIG Probe 
Synthesis Kit (Roche Diagnostics) is used to prepare a digoxigenin-labelled probe by 
PCR, and used as recommended by the manufacturer. 

5 [0558.0.0.01 Example 14f: Engineering alfalfa plants by over-expressing 

YNL090W genes, e.g. from Saccharomyces cerevisiae 
or E. coli or plants 

[0559.0.0.0] A regenerating clone of alfalfa (Medicago sativa) is transformed using 
the method of (McKersie et al., 1999 Plant Physiol 119: 839-847). Regeneration and 

10 transformation of alfalfa is genotype dependent and therefore a regenerating plant is 
required. Methods to obtain regenerating plants have been described. For example, 
these can be selected from the cultivar Rangelander (Agriculture Canada) or any other 
commercial alfalfa variety as described by Brown DCW and A Atanassov (1985. Plant 
Cell Tissue Organ Culture 4: 1 1 1-1 12). Alternatively, the RA3 variety (University of 

15 Wisconsin) has been selected for use in tissue culture (Walker et al., 1978 Am J Bot 
65:654-659). 

[0560.0.0.0] Petiole explants are cocultivated with an overnight culture of 
Agrobacterium tumefaciens C58C1 pMP90 (McKersie et al., 1999 Plant Physiol 1 19: 
839-847) or LBA4404 containing a binary vector. Many different binary vector systems 

20 have been described for plant transformation (e.g. An, G. in Agrobacterium Protocols. 
Methods in Molecular Biology vol 44, pp 47-62, Gartland KMA and MR Davey eds. 
Humana Press, Totowa, New Jersey). Many are based on the vector pBIN19 described 
by Bevan (Nucleic Acid Research. 1984. 12:8711-8721) that includes a plant gene 
expression cassette flanked by the left and right border sequences from the Ti plasmid 

25 of Agrobacterium tumefaciens. A plant gene expression cassette consists of at least 
two genes - a selection marker gene and a plant promoter regulating the transcription 
of the cDNA or genomic DNA of the trait gene. Various selection marker genes can be 
used including the Arabidopsis gene encoding a mutated acetohydroxy acid synthase 
(AHAS) enzyme (US patents 57673666 and 6225105). Similarly, various promoters 

30 can be used to regulate the trait gene that provides constitutive, developmental, tissue 
or environmental regulation of gene transcription. In this example, the 34S promoter 
(GenBank Accession numbers M59930 and X16673) can be used to provide 
constitutive expression of the trait gene. 

[0561.0.0.0] The explants are cocultivated for 3 d in the dark on SH induction 
35 medium containing 288 mg/ L Pro, 53 mg/ L thioproline, 4.35 g/ L K2S04, and 100 jjm 
acetosyringinone. The explants are washed in half-strength Murashige-Skoog medium 
(Murashige and Skoog, 1962) and plated on the same SH induction medium without . 
acetosyringinone but with a suitable selection agent and suitable antibiotic to inhibit 
Agrobacterium growth. After several weeks, somatic embryos are transferred to BOi2Y 
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development medium containing no growth regulators, no antibiotics, and 50 g/ L 
sucrose. Somatic embryos are subsequently germinated on half-strength Murashige- 
Skoog medium. Rooted seedlings are transplanted into pots and grown in a 
greenhouse. 

5 [0562.0.0.0] The TO transgenic plants are propagated by node cuttings and rooted in 
Turface growth medium. The plants are defoliated and grown to a height of about 10 
cm (approximately 2 weeks after defoliation). 

[0563.0.0.0] Example 14g: Engineering alfalfa plants by over-expressing 

YLR375W genes, e.g. from Saccharomyces cerevisiae 
10 or plants 

[0564.0.0.0] A regenerating clone of alfalfa (Medicago sativa) is transformed using 
the method of (McKersie et al., 1999 Plant Physiol 119: 839-847). Regeneration and 
transformation of alfalfa is genotype dependent and therefore a regenerating plant is 
required. Methods to obtain regenerating plants have been described. For example, 
15 these can be selected from the cultivar Rangelander (Agriculture Canada) or any other 
commercial alfalfa variety as described by Brown DCW and A Atanassov (1985. Plant 
Cell Tissue Organ Culture 4: 1 11-112). Alternatively, the RA3 variety (University of 
Wisconsin) has been selected for use in tissue culture (Walker et al., 1978 Am J Bot 
65:654-659). 

20 [0565.0.0.0] Petiole explants are cocultivated with an overnight culture of 

Agrobacterium tumefaciens C58C1 pMP90 (McKersie et al., 1999 Plant Physiol 119: 
839-847) or LBA4404 containing a binary vector. Many different binary vector systems 
have been described for plant transformation (e.g. An, G. in Agrobacterium Protocols. 
Methods in Molecular Biology vol 44, pp 47-62, Gartland KMA and MR Davey eds. 

25 Humana Press, Totowa, New Jersey). Many are based on the vector pBIN19 described 
by Bevan (Nucleic Acid Research. 1984. 12:871 1-8721) that includes a plant gene 
expression cassette flanked by the left and right border sequences from the Ti plasmid 
of Agrobacterium tumefaciens. A plant gene expression cassette consists of at least 
two genes - a selection marker gene and a plant promoter regulating the transcription 

30 of the cDNA or genomic DNA of the trait gene. Various selection marker genes can be 
used including the Arabidopsis gene encoding a mutated acetohydroxy acid synthase 
(AH AS) enzyme (US patents 57673666 and 6225105), Similarly, various promoters 
can be used to regulate the trait gene that provides constitutive, developmental, tissue 
or environmental regulation of gene transcription. In this example, the 34S promoter 

35 (GenBank Accession numbers M59930 and X16673) can be used to provide 
constitutive expression of the trait gene. 

[0566.0.0.0] The explants are cocultivated for 3 d in the dark on SH induction 
medium containing 288 mg/ L Pro, 53 mg/ L thioproline, 4.35 g/ L K2SQ4, and 100 |jm 
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acetosyringinone. The explants are washed in half-strength Murashige-Skoog medium 
(Murashige and Skoog, 1962) and plated on the same SH induction medium without 
acetosyringinone but with a suitable selection agent and suitable antibiotic to inhibit 
Agrobacterium growth. After several weeks, somatic embryos are transferred to BOi2Y 
5 development medium containing no growth regulators, no antibiotics, and 50 g/ L 
sucrose. Somatic embryos are subsequently germinated on half-strength Murashige- 
Skoog medium. Rooted seedlings are transplanted into pots and grown in a 
greenhouse. 

[0567.0.0.0] The TO transgenic plants are propagated by node cuttings and rooted in 
1 0 Turface growth medium. The plants are defoliated and grown to a height of about 1 0 
cm (approximately 2 weeks after defoliation). 

[0568.0.0.0] Equivalents 

[0569.0.0.0] Those of ordinary skill in the art will recognize, or will be able to 
ascertain using no more than routine experimentation, many equivalents to the specific 
15 embodiments of the invention described herein. Such equivalents are intended to be 
encompassed by the claims. 
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1 . A process for the production of fine chemical, which comprises 

a) increasing or generating the biological activity represented by a protein as 
depicted in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 1 6, 1 8, 20, 22, 24, 26, 28, 30, 
32, 34, 36, 38, 40, 42, 44, 46, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 
80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 
114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 
144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 

174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 

204, 206, 208, 21 0, 21 2, 21 4, 21 6, 21 8, 220, 222, 224, 226, 228, 230, 232, 
234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 
264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 
294, 296, 298, 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 
324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 
354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 
384, 386, 388, 390, 392 or 394 in a non-human organism, or in one or more 
parts thereof; and 

b) growing the organism under conditions which permit the production of the 
fine chemical in said organism. 

2. A process for the production of fine chemical, comprising the increasing or 
generating in an organism or a part thereof the expression of at least one nucleic 
acid molecule comprising a nucleic acid molecule selected from the group 
consisting of : 

a) nucleic acid molecule encoding of the polypeptide as depicted in SEQ ID 
NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 
42, 44, 46, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 
90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 
122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 
152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 
182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 
212, 21 4, 21 6, 21 8, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 
242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 
272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300, 
302, 304, 306, 308, 31 0, 31 2, 31 4, 31 6, 31 8, 320, 322, 324, 326, 328, 330, 
332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 
362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386, 388, 390, 
392 or 394 or a fragment thereof, which confers an increase in the amount 
of fine chemical in an organism or a part thereof; 
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b) nucleic acid molecule comprising of the nucleic acid molecule as depicted in 
SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 
37, 39, 41 43, 45, 55, 57, 59, 61 , 63, 65, 67, 69, 71 , 73, 75. 77, 79, 81 , 83, 
85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 
5 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 

149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 
179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 
209, 21 1, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 
239, 241 , 243, 245, 247, 249, 251 , 253, 255, 257, 259, 261 , 263, 265, 267, 
1 0 269, 271 , 273, 275, 277, 279, 281 , 283, 285, 287, 289, 291 , 293, 295, 297, 

299, 301 , 303, 305, 307, 309, 31 1 , 31 3, 31 5, 31 7, 31 9, 321 , 323, 325, 327, 
329, 331, 333, 335, 337, 339, 341, 343, 345, 347, 349, 351, 353, 355, 357, 
359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 379, 381, 383, 385, 387, 
389, 391 or 393; 

15 c) nucleic acid molecule whose sequence can be deduced from a polypeptide 

sequence encoded by a nucleic acid molecule of (a) or (b) as a result of the 
degeneracy of the genetic code and conferring an increase in the amount of 
fine chemical in an organism or a part thereof; 

d) nucleic acid molecule which encodes a polypeptide which has at least 50% 
20 identity with the amino acid sequence of the polypeptide encoded by the 

nucleic acid molecule of (a) to (c) and conferring an increase In the amount 
of fine chemical in an organism or a part thereof; 

e) nucleic acid molecule which hybridizes with a nucleic acid molecule of (a) to 
(c) under stringent hybridization conditions and conferring an increase in the 

25 amount of fine chemical in an organism or a part thereof; 

f) nucleic acid molecule which encompasses a nucleic acid molecule which is 
obtained by amplifying nucleic acid molecules from a cDNA library or a 
genomic library using the primers in SEQ ID NO: 53 or SEQ ID NO: 54 and 
conferring an increase in the amount of the fine chemical in an organism or 

30 a part thereof ; 

g) nucleic acid molecule encoding a polypeptide which is isolated with the aid 
of monoclonal antibodies against a polypeptide encoded by one of the 
nucleic acid molecules of (a) to (f) and conferring an increase in the amount 
of fine chemical in an organism or a part thereof; 

35 h) nucleic acid molecule encoding a polypeptide comprising the consensus 

sequence as depicted in SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, 
SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 397, SEQ ID 
NO: 398, SEQ ID NO: 399 and/or SEQ ID NO: 400 and conferring an 
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increase in the amount of the fine chemical in an organism or a part thereof; 
and 

i) nucleic acid molecule which is obtainable by screening a suitable nucleic 
acid library under stringent hybridization conditions with a probe comprising 
one of the sequences of the nucleic acid molecule of (a) to (k) or with a 
fragment thereof having at least 1 5 nt, preferably 20 nt, 30 nt, 50 nt, 1 00 nt, 
200 nt or 500 nt of the nucleic acid molecule characterized in (a) to (k) and 
conferring an increase in the amount of the fine chemical in an organism or 
a part thereof. 

or comprising a sequence which is complementary thereto. 

3. The process of claim 1 or 2, comprising recovering of the free or bound fine 
chemical. 

4. The process of any one of claim 1 to 3, comprising the following steps: 

(a) selecting an organism or a part thereof expressing a polypeptide encoded 
by the nucleic acid molecule characterized in claim 2; 

(b) mutagenizing the selected organism or the part thereof; 

(c) comparing the activity or the expression level of said polypeptide in the 
mutagenized organism or the part thereof with the activity or the expression 
of said polypeptide of the selected organisms or the part thereof; 

(d) selecting the mutated organisms or parts thereof, which comprise an 
increased activity or expression level of said polypeptide compared to the 
selected organism or the part thereof; 

(e) optionally, growing and cultivating the organisms or the parts thereof; and 

(f) recovering, and optionally isolating, the free or bound fine chemical 
produced by the selected mutated organisms or parts thereof. 

5. The process of any one of claims 1 to 4, wherein the activity of said protein or the 
expression of said nucleic acid molecule is increased or generated transiently or 
stably. 

6. An isolated nucleic acid molecule comprising a nucleic acid molecule selected 
from the group consisting of: 

(a) nucleic acid molecule encoding of the polypeptide as depicted in SEQ ID 
NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 
42, 44, 46, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 
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90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 
122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 
152, 154, 156, 158, 160, 162, 164. 166, 168, 170, 172, 174, 176, 178, 180, 
1 82, 1 84, 1 86, 1 88, 1 90, 1 92, 1 94, 1 96, 1 98, 200, 202, 204, 206, 208, 21 0, 
5 21 2, 21 4, 216, 21 8, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 

242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 
272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300, 
302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 
332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 
■I o 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386, 388, 390, 

392 or 394 or a fragment thereof, which confers an increase in the amount 
of fine chemical in an organism or a part thereof; 

' (b) nucleic acid molecule comprising of the nucleic acid molecule as depicted in 
SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 

1 5 37, 39, 41 43, 45, 55, 57, 59, 61 , 63, 65, 67, 69, 71 , 73, 75, 77, 79, 81 , 83, 

85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 
119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 
149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 
179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 

20 209, 21 1 , 21 3, 21 5, 21 7, 21 9, 221 , 223, 225, 227, 229, 231 , 233, 235, 237, 

239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 
269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 
299, 301, 303, 305, 307, 309, 311, 313, 315, 317, 319, 321, 323, 325, 327, 
329, 331, 333, 335, 337, 339, 341, 343, 345, 347, 349, 351, 353, 355, 357, 

25 359, 361 , 363, 365, 367, 369, 371 , 373, 375, 377, 379, 381 , 383, 385, 387, 

389, 391 or 393 or a fragment thereof, which confers an increase in the 
amount of fine chemical in an organism or a part thereof; 

(c) nucleic acid molecule whose sequence can be deduced from a polypeptide 
sequence encoded by a nucleic acid molecule of (a) or (b) as a result of the 

30 degeneracy of the genetic code and conferring.an increase in the amount of 

fine chemical in an organism or a part thereof; 

(d) nucleic acid molecule which encodes a polypeptide which has at least 50% 
identity with the amino acid sequence of the polypeptide encoded by the 
nucleic acid molecule of (a) to (c) and conferring an increase in the amount 

35 of fine chemical in an organism or a part thereof; 

(e) nucleic acid molecule which hybridizes with a nucleic acid molecule of (a) to 
(c) under stringent hybridization conditions and conferring an increase in the 
amount of fine chemical in an organism or a part thereof; 
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(f) nucleic acid molecule which encompasses a nucleic acid molecule which is 
obtained by amplifying nucleic acid molecules from a cDNA library or a 
genomic library using the primers in SEQ ID NO: 53 or SEQ ID NO: 54 and 
conferring an increase in the amount of the fine chemical in an organism or 
a part thereof ; 

(g) nucleic acid molecule encoding a polypeptide which is isolated with the aid 
of monoclonal and/or polyclonal antibodies against a polypeptide encoded 
by one of the nucleic acid molecules of (a) to (f) and conferring an increase 
in the amount of fine chemical in an organism or a part thereof; 

(h) nucleic acid molecule encoding a polypeptide comprising the consensus 
sequence as depicted in SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, 
SEQ ID NO: 50, SEQ ID NO: 51 , SEQ ID NO: 52, SEQ ID NO: 397, SEQ ID 
NO: 398, SEQ ID NO: 399 and/or SEQ ID NO: 400 and conferring an 
increase in the amount of the fine chemical in an organism or a part thereof; 
and 

(i) nucleic acid molecule which is obtainable by screening a suitable nucleic 
acid library under stringent hybridization conditions with a probe comprising 
one of the sequences of the nucleic acid molecule of (a) to (k) or with a 
fragment thereof having at least 15 nt, preferably 20 nt, 30 nt, 50 nt, 100 nt, 
200 nt or 500 nt of the nucleic acid molecule characterized in (a) to (k) and 
conferring an increase in the amount of the fine chemical in an organism or 
a part thereof, 

whereby the nucleic acid molecule distinguishes over the sequence as depicted 
in SEQ ID NO: 1 , 55, 57, 59, 61 , 63, 65, 67, 69, 71 , 73, 75, 77, 79, 81 , 83, 85, 
87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 
123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 
155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 
1 87, 1 89, 1 91 , 1 93, 1 95, 1 97, 1 99, 201 , 203, 205, 207, 209, 211,21 3, 21 5, 21 7, 
219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 
251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 
283, 285, 287, 289, 291, 293, 295, 297, 299, 301, 303, 305, 307, 309, 311, 313, 
31 5, 31 7, 31 9, 321 , 323, 325, 327, 329, 331 , 333, 335, 337, 339, 341 , 343, 345, 
347, 349, 351, 353, 355, 357, 359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 
379, 381 , 383, 385, 387, 389, 391 or 393 by one or more nucleotides. 

7. A nucleic acid construct which confers the expression of the nucleic acid 
molecule of claim 6, comprising one or more regulatory elements. 



8. A vector comprising the nucleic acid molecule as claimed in claim 6 or the nucleic 
acid construct of claim 7. 
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9. The vector as claimed in claim 8, wherein the nucleic acid molecule is in 
operable linkage with regulatory sequences for the expression in a prokaryotic or 
eukaryotic, or in a prokaryotic and eukaryotic, host. 

10. A host cell, which has been transformed stably or transiently with the vector as 
5 claimed in claim 8 or 9 or the nucleic acid molecule as claimed in claim 6 or the 

nucleic acid construct of claim 7 or produced as described in claim any one of 
claims 2 to 4. 

1 1 . The host cell of claim 1 0, which is a transgenic host cell. 

12. The host cell of claim 10 or 1 1 , which is a plant cell, an animal cell, a 

1 0 microorganism, or a yeast cell, a fungus cell, a prokaryotic cell, an eukaryotic cell 

or an archaebacterium. 

13. A process for producing a polypeptide, wherein the polypeptide is expressed in a 
host cell as claimed in any one of claims 9 to 1 1 . 

14. A polypeptide produced by the process as claimed in claim 13 or encoded by the 
15 nucleic acid molecule as claimed in claim 6 whereby the polypeptide 

distinguishes over the sequence as depicted in SEQ ID NO: 2, 56, 58, 60, 62, 64, 
66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 
106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 
138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 

20 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 

202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 
234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 
266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 
298, 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 

25 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 

362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386, 388, 390, 392 
or 394 by one or more amino acids. 

15. An antibody, which binds specifically to the polypeptide encoded by a nucleic 
acid sequence as claimed in claim 6 a). 

30 1 6. A plant tissue, propagation material, harvested material or a plant comprising the 
host cell as claimed in claim 12, which is plant cell or an Agrobacterium. 

17. A method for screening for agonists and antagonists of the activity of a 
polypeptide encoded by the nucleic acid molecule of claim 6 conferring an 
increase in the amount of fine chemical in an organism or a part thereof 
35 comprising: 
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(a) contacting cells, tissues , plants or microorganisms which express the a 
polypeptide encoded by the nucleic acid molecule of claim 6 conferring an 
increase in the amount of the fine chemical in an organism or a part thereof 
with a candidate compound or a sample comprising a plurality of 

5 compounds under conditions which permit the expression the polypeptide; 

(b) assaying the fine chemical level or the polypeptide expression level in the 
cell, tissue, plant or microorganism or the media the cell, tissue, plant or 
microorganisms is cultured or maintained in; and 

(c) identifying a agonist or antagonist by comparing the measured fine 

10 chemical level or polypeptide expression level with a standard fine chemical 

or polypeptide expression level measured in the absence of said candidate 
compound or a sample comprising said plurality of compounds, whereby an 
increased level over the standard indicates that the compound or the 
sample comprising said plurality of compounds is an agonist and a 

1 5 decreased level over the standard indicates that the compound or the 

sample comprising said plurality of compounds is an antagonist. 

18. A process for the identification of a compound conferring increased fine chemical 
production in a plant or microorganism, comprising the steps: 

(a) culturing a plant cell or tissue or microorganism or maintaining a plant 

20 expressing the polypeptide encoded by the nucleic acid molecule of claim 6 

conferring an increase in the amount of the fine chemical in an organism or 
a part thereof and a readout system capable of interacting with the 
polypeptide under suitable conditions which permit the interaction of the 
polypeptide with dais readout system in the presence of a compound or a 

25 sample comprising a plurality of compounds and capable of providing a 

detectable signal in response to the binding of a compound to said 
polypeptide under conditions which permit the expression of said readout 
system and of the polypeptide encoded by the nucleic acid molecule of 
claim 6 conferring an increase in the amount of the fine chemical in an 

30 organism or a part thereof; 

(b) identifying if the compound is an effective agonist by detecting the presence 
or absence or increase of a signal produced by said readout system. 

19. A method for the identification of a gene product conferring an increase in the fine 
chemical production in a cell, comprising the following steps: 

35 (a) contacting the nucleic acid molecules of a sample, which can contain a 

candidate gene encoding a gene product conferring an increase in fine 
chemical after expression with the nucleic acid molecule of claim 6; 
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(b) identifying the nucleic acid molecules, which hybridize under relaxed 
stringent conditions with the nucleic acid molecule of claim 6; 

(c) introducing the candidate nucleic acid molecules in host cells appropriate 
for producing the fine chemical; 

. (d) expressing the identified nucleic acid molecules in the host cells; 

(e) assaying the fine chemical level in the host cells; and 

(f) identifying nucleic acid molecule and its gene product which expression 
confers an increase in the fine chemical level in the host cell in the host cell 
after expression compared to the wild type. 

20. A method for the identification of a gene product conferring an increase in fine 
chemical production in a cell, comprising the following steps: 

(a) identifiying in a data bank nucleic acid molecules of an organism; which can 
contain a candidate gene encoding a gene product conferring an increase 
in the fine chemical amount or level in an organism or a part thereof after 
expression, and which are at least 30% homolog to the nucleic acid 
molecule of claim 6; 

(b) introducing the candidate nucleic acid molecules in host cells appropriate 
for producing the fine chemical; 

(c) expressing the identified nucleic acid molecules in the host cells; 

(d) assaying the fine chemical level in the host cells; and 

(e) identifying nucleic acid molecule and its gene product which expression 
confers an increase in the fine chemical level in the host cell after 
expression compared to the wild type. 

21 . A method for the production of an agricultural composition comprising the steps 
of the method of any one of claims 17 to 20 and formulating the compound 
identified in any one of claims 17 to 20 in a form acceptable for an application in 
agriculture. 

22. A composition comprising the nucleic acid molecule of claim 6, the polypeptide of 
claim 14, the nucleic acid construct of claim 7, the vector of any one of claims 8 
or 9, an antagonist or agonist identified according to claim 17, the compound of 
claim 18, the gene product of claim 19 or 20, the antibody of claim 15, and 
optionally an agricultural acceptable carrier. 
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23. Use of the nucleic acid molecule as claimed in claim 6 for the identification of a 
nucleic acid molecule conferring an increase of the fine chemical after 
expression. 

24. Use of the polypeptide of claim 1 4 or the nucleic acid construct claim 7 or the 

5 gene product identified according to the method of claim 18 or 19 for identifying 

compounds capable of conferring a modulation of the fine chemical levels in an 
organism. 

25. Food or feed composition comprising the nucleic acid molecule of claim 6, the 
polypeptide of claim 1 4, the nucleic acid construct of claim 7, the vector of claim 

10 8 or 9, the antagonist or agonist identified according to claim 17, the antibody of 

claim 15, the plant or plant tissue of claim 16, the harvested material of claim 16, 
the host cell of claim 10 to 12 or the gene product identified according to the 
method of claim 1 9 or 20. 



26. 

15 



Use of the nucleic acid molecule as claimed in claim 6 in mapping and breeding 
processes for the identification of plant varieties having and increased capacity 
for production of the fine chemical. 
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SEQUENCE LISTING 



IAP9 Rec'dPCWlff 3 f JAN 6 20(j5 

PCT/EP2004/008136 «JWJ 



<110> BASF Plant Science GmbH 

<120> Process for the production of fine chemicals 

<130> PF55740 

<140> 2004_299 

<141> 2004-08-10 



<160> 400 

<170> Patentln version 3.1 



<210> 1 

<211> 579 

<212> DNA 

<213> Saccharomyces cerevisiae 



<220> 

<221> CDS 

<222> (1)..(579) 

<223> 

<400> 1 

atg tct gaa aag gcc gtt aga agg aaa ctt gtt att att ggt gat ggt 48 

Met Ser Glu Lys Ala Val Arg Arg Lys Leu Val lie He Gly Asp Gly 

1 5 10 15 

get tgt ggc aag acc tct tta eta tat gta ttt aca tta gga aaa ttc 96 
Ala Cys Gly Lys Thr Ser Leu Leu Tyr Val Phe Thr Leu Gly Lys Phe 
20 25 30 

cct gaa caa tat cat ccg aca gtg ttc gag aat tat gtc act gat tgc 144 
Pro Glu Gin Tyr His Pro Thr Val Phe Glu Asn Tyr Val Thr Asp Cys 
35 40 45 

aga gtt gac gga ata aaa gtg tec tta acg etc tgg gat aca gcg gga 192 
Arg Val Asp Gly He Lys Val Ser Leu Thr Leu Trp Asp Thr Ala Gly 
50 55 60 

caa gag gaa tat gaa cgt tta cgt cca ttc tea tat tea aaa gca gat 240 
Gin Glu Glu Tyr Glu Arg Leu Arg Pro Phe Ser Tyr Ser Lys Ala Asp 
65 70 75 80 

ata ata tta att ggg ttt get gta gac aat ttt gaa tea eta att aac 288 
lie lie Leu He Gly Phe Ala Val Asp Asn Phe Glu Ser Leu He Asn 
85 90 95 

gca agg acg aaa tgg gcg gat gag gca tta cga tat tgt cct gac gca 336 
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Ala Arg Thr Lys Trp Ala Asp Glu Ala Leu Arg Tyr Cys Pro Asp Ala 
• 100 105 110 

cca ate gtt ctt gta ggc ttg aaa aaa gat ttg agg caa gaa gec cat 384 
Pro lie Val Leu Val Gly Leu Lys Lys Asp Leu Arg Gin Glu Ala His 
115 120 125 

ttt aaa gag aat get acc gat gaa atg gtt ccc att gaa gat gca aaa 432 
Phe Lys Glu Asn Ala Thr Asp Glu Met Val Pro lie Glu Asp Ala Lys 
130 135 140 



caa gtt gca agg gec att ggg gec aag aaa tac atg gaa tgt agt gca 
Gin Val Ala Arg Ala lie Gly Ala Lys Lys Tyr Met Glu Cys Ser Ala 
145 " 150 155 160 



taa 

<210> 2 

<211> 192 

<212> PRT 

<213> Saccharomyces cerevisiae 

<400> 2 

Met Ser Glu Lys Ala Val Arg Arg Lys Leu Val He He Gly Asp Gly 
1 s 10-15 

Ala Cys Gly Lys Thr Ser Leu Leu Tyr Val Phe Thr Leu Gly Lys Phe 
20 25 30 

Pro Glu Gin Tyr His Pro Thr Val Phe Glu Asn Tyr Val Thr Asp Cys 
35 40 45 

Arg Val Asp Gly He Lys Val Ser Leu Thr Leu Trp Asp Thr Ala Gly 
50 55 60 

Gin Glu Glu Tyr Glu Arg Leu Arg Pro Phe Ser Tyr Ser Lys Ala. Asp 
65 70 75 80 

He He Leu He Gly Phe Ala Val Asp Asn Phe Glu Ser Leu He Asn 
85 90 95 

Ala Arg Thr Lys Trp Ala Asp Glu Ala Leu Arg Tyr Cys Pro Asp Ala 
100 105 HO 



480 



ctg act ggt gag ggt gtg gat gat gtc ttt gaa gta get aca aga acc 528 
Leu Thr Gly Glu Gly Val Asp Asp Val Phe Glu Val Ala Thr Arg Thr 
165 ' ' 170 175 

agt ttg ctt atg aag aag gaa cca ggg get aac tgt tgc at a att tta 576 
Ser Leu Leu Met Lys Lys Glu Pro Gly Ala Asn Cys Cys He He Leu 
180 185 190 



579 



Pro He Val Leu Val Gly Leu Lys Lys Asp Leu Arg Gin Glu Ala His 
115 120 125 
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Phe Lys Glu Asn Ala Thr Asp Glu Met Val Pro lie Glu Asp Ala Lys 
130 135 140 



Gin Val Ala Arg Ala lie Gly Ala Lys Lys Tyr Met Glu Cys Ser Ala 
145 150 155 160 



Leu Thr Gly Glu Gly Val Asp Asp Val Phe Glu Val Ala Thr Arg Thr 
165 170 175 



Ser Leu Leu Met Lys Lys Glu Pro Gly Ala Asn Cys Cys lie lie Leu 
180 185 190 



<210> 3 

<211> 600 

<212> DNA 

<213> Oryza sativa 

<220> 

<221> CDS 

<222> (1) .". (600) 

<223> 



<400> 3 

atg ggt tgc tec tec tec gtg cca get cga age act gga ggg ttg aac 48 

Met Gly Cys Ser Ser Ser Val Pro Ala Arg Ser Thr Gly Gly Leu Asn 
1 5 10 15 

aat att age aac gat aac tec get act gat tea aag gac ttg cgt get 96 
Asn lie Ser Asn Asp Asn Ser Ala Thr Asp Ser Lys Asp Leu Arg Ala 
20 25 30 

aag ttg gta ttg ctt ggt gac tct ggt gta ggg aaa agt tgc att gtt 144 
Lys Leu Val Leu Leu Gly Asp Ser Gly Val Gly Lys Ser Cys lie Val 
35 40 45 

ctt cgc ttt gtt cgt ggt cag ttt gat ccc act tec aag gta act gtc 192 
Leu Arg Phe Val Arg Gly Gin Phe Asp Pro Thr Ser Lys Val Thr Val 
50 55 60 

ggt gca tea ttt tta tea caa aca ttg get ttg gag gac tea aca ata 240 
Gly Ala Ser Phe Leu Ser Gin Thr Leu Ala Leu Glu Asp Ser Thr lie 
65 70 75 80 

gtg aaa ttt gaa ata tgg gat acc get gga caa gag agg tat get gee 288 
Val Lys Phe Glu lie Trp Asp Thr Ala Gly Gin Glu Arg Tyr Ala Ala 
85 90 95 

ttg gca cct ctt tac tac aga gga get get get gca gtt gtt gtc tac 336 
Leu Ala Pro Leu Tyr Tyr Arg Gly Ala Ala Ala Ala Val Val Val Tyr 
100 " ~ 105 110 

gac ata act agt cca gaa tea ttt age aaa gca caa tac tgg gtg aag 384 
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Asp He Thr Ser Pro Glu Ser Phe Ser Lys Ala Gin Tyr Trp Val Lys 
115 120 125 



gaa ctt caa aaa cat ggt agt cct gat att ate atg gtt ttg gtt ggt 
Glu Leu Gin Lys His Gly Ser Pro Asp He He Met Val Leu Val Gly 
130 135 140 

aat aaa get gat eta cat gaa aat cga cat gta tct tct cag gaa gca 
Asn Lys Ala Asp Leu His Glu Asn Arg His Val Ser Ser Gin Glu Ala 
145 150 155 160 

caa gag tat gca gag aag aat aat atg gtt ttc ate gag aca tea gca 
Gin Glu Tyr Ala Glu Lys Asn Asn Met Val Phe He Glu Thr Ser Ala 
165 170 175 

aag aca get gat aat ata aac caa gta ttt gag gaa att gcg aag agg 
Lvs Thr Ala Asp Asn He Asn Gin Val Phe Glu Glu He Ala Lys Arg 
±80 • *" 185 130 

ttg ccc agg cca acg gcg tct tga 
Leu Pro Arg Pro Thr Ala Ser 
195 



<210> 4 

<211> 199 

<212> PRT 

<213> Oryza sativa 

<400> 4 

Met Gly Cys Ser Ser Ser Val Pro Ala Arg Ser Thr Gly Gly Leu Asn 
15 10 15 

Asn He Ser Asn Asp Asn Ser Ala Thr Asp Ser Lys Asp Leu Arg Ala 
20 25 30 

Lys Leu Val Leu Leu Gly Asp Ser Gly Val Gly Lys Ser Cys He Val 
35 40 45 

Leu Arg Phe Val Arg Gly Gin Phe Asp Pro Thr Ser Lys Val Thr Val 
50 55 60 

Gly Ala Ser Phe Leu Ser Gin Thr Leu Ala Leu Glu Asp Ser Thr He 
65 70 75 80 

val Lys Phe Glu He Trp Asp Thr Ala Gly Gin Glu Arg Tyr Ala Ala 
85 90 95 

Leu Ala Pro Leu Tyr Tyr Arg Gly Ala Ala Ala Ala Val Val Val Tyr 
100 105 HO 



432 



480 



528 



576 



600 



Asp He Thr Ser Pro Glu Ser Phe Ser Lys Ala Gin Tyr Trp Val Lys 
■ 115 120 125 
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Glu Leu Gin Lys His Gly Ser Pro Asp lie lie Met Val Leu Val Gly 
130 135 140 

Asn Lys Ala Asp Leu His Glu Asn Arg His Val Ser Ser Gin Glu Ala 
145 150 155 160 

Gin Glu Tyr Ala Glu Lys Asn Asn Met Val Phe lie Glu Thr Ser Ala 
165 170' 175 

Lys Thr Ala Asp Asn He Asn Gin Val Phe Glu Glu He Ala Lys Arg 
180 185 190 

Leu Pro Arg Pro Thr Ala Ser 
195 

<210> 5 

<211> 648 

<212> DNA 

<213> Oryza sativa 



<220> 

<221> CDS 

<222> (1) . . (648) 

<223> 



<400> 5 

atg gcg tec age gcg teg egg ttc ate aag tgc gtc acg gtc ggg gac 48 

Met Ala Ser Ser Ala Ser Arg Phe He Lys Cys Val Thr Val Gly Asp 

1 5 10 15 

ggc gee gtc ggc aag acc tgc atg etc ate tgc tac ace age aac aag 96 

Gly Ala Val Gly Lys Thr Cys Met Leu He Cys Tyr Thr Ser Asn Lys 
20 25 30 

ttc ccc act gat tac gta ccc act gtt ttt gac aat ttc agt gca aac 144 

Phe Pro Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn 
35 ' 40 45 

gtg gtg gtc gac ggc acc acg gtg aat ttg ggt etc tgg gat act gca 192 

Val Val Val Asp Gly Thr Thr Val Asn Leu Gly Leu Trp Asp Thr Ala 

50 ' 55 60 

ggg cag gaa gat tac aac aga ttg agg ccg eta age tac cgt ggc gec 240 

Gly Gin Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala 
65 70 75 80 

gat gtc ttt gtg ctt gee ttc tec eta gtg age cga get age tat gag 288 

Asp Val Phe Val Leu Ala Phe Ser Leu Val Ser Arg Ala Ser Tyr Glu 

85 90 95 

aat gtc atg aag aag tgg tta cca gag ctt cag cat tat gca cca ggg 336 

Asn Val Met Lys Lys Trp Leu Pro Glu Leu Gin His Tyr Ala Pro Gly 
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100 105 HO 

gtg cca att gtg ttg gtt ggg acc aaa ttg gat ctt cgt gaa gat aaa 384 
Val Pro lie Val Leu Val Gly Thr Lys Leu Asp Leu Arg Glu Asp Lys 
115 120 125 

cac tac tta ctt gac cat cct age ttg gtg cct gtg act aca gca cag 
His Tyr Leu Leu Asp His Pro Ser Leu Val Pro Val Thr Thr Ala Gin 
130 135 140 

aga gag gaa etc cgc aag cac att ggc gca acg tgt tac ate gaa tgc 
Gly Glu Glu Leu Arg Lys His lie Gly Ala Thr Cys Tyr lie Glu Cys 
145 150 155 160 

age tea aag aca cag cag aat gta aaa get gtg ttt gat get gee ate 
Ser Ser Lys Thr Gin Gin Asn Val Lys Ala Val Phe Asp Ala Ala lie 
165 170 175 

aag gta gta ate aag cct cca aca aag cag agg gac agg aag aag aag 
Lys Val Val lie Lys Pro Pro Thr Lys Gin Arg Asp Arg Lys Lys Lys 
180 185 190 

aaa aca egg cgc gga tgt tct ttc ttc tgc aag ggt gtc atg tec aga 624 
Lys Thr Arg Arg Gly Cys Ser Phe Phe Cys Lys Gly Val Met Ser Arg 
195 " 200 205 



aga agg eta gta tgc ttc aag tga 

Arg Arg Leu Val Cys Phe Lys 
210 * 215 



<210> 6 

<211> 215 

<212> PRT 

<213> Oryza sativa 

<400> 6 

Met Ala Ser Ser Ala Ser Arg Phe lie Lys Cys Val Thr Val Gly Asp 
! ■ 5 10 15 

Gly Ala Val Gly Lys Thr Cys Met Leu lie Cys Tyr Thr Ser Asn Lys 
20 25 30 

Phe Pro Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn 
35 40 45 

Val Val Val Asp Gly Thr Thr Val Asn Leu Gly Leu Trp Asp Thr Ala 
50 * 55 60 



Glv Gin Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala 
65 70 75 80 



432 



4S0 



528 



576 



648 



Val Phe Val Leu Ala Phe Ser Leu Val Ser Arg Ala Ser Tyr Glu 
85 90 95 
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Asn Val Met Lys Lys Trp Leu Pro Glu Leu Gin His Tyr Ala Pro Gly 
100 ,105 . 110 

Val Pro He Val Leu Val Gly Thr Lys Leu Asp Leu Arg Glu Asp Lys 
115 120 125 

His Tyr Leu Leu Asp His Pro Ser Leu Val' Pro Val Thr Thr Ala Gin 
130 135 140 

Gly Glu Glu Leu Arg Lys His He Gly Ala Thr Cys Tyr He Glu Cys 
145 150 155 160 

Ser Ser Lys Thr Gin Gin Asn- Val Lys Ala Val Phe Asp Ala Ala He 
165 170 175 

Lys Val Val He Lys Pro Pro Thr Lys Gin Arg Asp Arg Lys Lys Lys 
180 185 190 

Lys Thr Arg Arg Gly Cys Ser Phe Phe Cys Lys Gly Val Met Ser Arg 
195 200 205 

Arg Arg Leu Val Cys Phe Lys 
210 215 

<210> 7 

<211> 591 

<212> DNA 

<213> Brassica napus 



<220> 

<221> CDS 

<222> (1) . . (591) 

<223> 



<400> 7 

atg age gca teg agg ttc ata aag tgt gtt aca gtc ggc gat ggt gec 48 
Met Ser Ala Ser Arg Phe He Lys Cys Val Thr Val Gly Asp Gly Ala 
1 5 10 15 

gtc gga aaa acc tgt atg ctg ate tct tac acc age aac act ttc cct 96 
Val Gly Lys Thr Cys Met Leu He Ser Tyr Thr Ser Asn Thr Phe Pro 
20 25 30 

acg gac tat gtt cca act gtt ttc gac aac ttc agt get aac gtg gtt . 144 
Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val 
35 40 45 

gtt gat ggg aac act gtg aat ctt gga ttg tgg gat aca get ggt caa 192 
Val Asp Gly Asn Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 
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50 55 60 

gaa gac tat aac agg tta aga cca ttg agt tac cgt ggt gca gat gtc 240 
Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 
65 70 75 80 

ttc att ctt get ttc tct ctt att age aaa get age tac gag aac ata 288 
Phe lie Leu Ala Phe Ser Leu lie Ser Lys Ala Ser Tyr Glu Asn He 
85 90 . 95 

gee aag aag tgg att cct gag etc agg cat tat gee cct gga gtt cct 336 
Ala Lys Lys Trp He Pro Glu Leu Arg His Tyr Ala Pro Gly Val Pro 
100 105 HO 



ate att etc gtg ggg aca aaa etc gat ctt cga gat gac aag cag ttc 
lie He Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe 
115 120 125 



gtg ctt cag cca cca aaa tea aag aag aag aaa aag aac aag aat cgt 
Val Leu Gin Pro Pro Lys Ser Lys Lys Lys Lys Lys Asn Lys Asn Arg 
180 185 190 

tgc gtt ttc ttg tga 
Cys val Phe Leu 
195 



<210> 8 

<211> 196 

<212> PRT 

<213> Brassica napus 

<400> 8 

Met Ser Ala Ser Arg Phe He Lys Cys Val Thr Val Gly Asp Gly Ala 
x 5 10 15 

Val Gly Lys Thr Cys Met Leu He Ser Tyr Thr Ser Asn Thr Phe Pro 
20 25 30 

Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val 
35 40 45 

Val Asp Gly Asn Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 
50 55 60 



384 



ttc ata gac cat ccc ggt gca gtg cca ate act aca aac cag gga gag 432 
Phe He Asp His Pro Gly Ala Val Pro He Thr Thr Asn Gin Gly Glu 
130 135 140 

gaa eta aag aaa etc ata gga tct cca gtt tac att gaa tgt agt tea 480 
Glu Leu Lys Lys Leu He Gly Ser Pro Val Tyr He Glu Cys Ser Ser 
145 150 155 160 

aag acg cag cag aat gtc aaa gca gtc ttt gac gca get att aaa gtg 528 
Lys Thr Gin Gin Asn Val Lys Ala Val Phe Asp Ala Ala He Lys Val 
165 170 175 



576 



591 



Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 



WO 2005/014828 
65 



70 



9/291 

75 



PCT/EP2004/008136 
80 



Phe He Leu Ala Phe Ser Leu He Ser Lys Ala Ser Tyr Glu Asn He 
85 90 95 



Ala Lys Lys Trp He Pro Glu Leu Arg His Tyr Ala Pro Gly Val Pro 
100 105 110 



He He Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe 
115 " 120 " 125 



Phe He Asp His Pro Gly Ala Val Pro He Thr Thr Asn Gin Gly Glu 
130 135 140 



Glu Leu Lys Lys Leu He Gly Ser Pro Val Tyr He Glu Cys Ser Ser 
145 150 155 160 



Lys Thr Gin Gin Asn Val Lys Ala Val Phe Asp Ala Ala He Lys Val 
165 170 175 



Val Leu Gin Pro Pro Lys Ser Lys Lys Lys Lys Lys Asn Lys Asn Arg 
180 185 190 



Cys Val Phe Leu 
195 



<210> 9 
<211> . 597 
<212> DNA 

<213> Brassica napus 



<220> 

<221> CDS 

<222> (1)..(597) 

<223> 



<400> 9 

atg agt get teg agg ttt ate aag tgt gtc ace gtc ggc gac ggc get 48 

Met Ser Ala Ser Arg Phe He Lys Cys Val Thr Val Gly Asp Gly Ala 
1 5 10 15 

gtc gga aag act tgt ctg etc ate tec tac act age aac act ttc ccc 96 
Val Gly Lys Thr Cys Leu Leu He Ser Tyr Thr Ser Asn Thr Phe Pro 
20 25 .30 

acg gac tat gtg cca act gtg ttt gat aat ttc age gcg aat gtg att 144 
Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val He 
35 40 45 
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gtg gat ggg aac act ate aac ttg gga ttg tgg gat act gca ggg caa 
Val Lp Gly Asn Thr He Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 
50 55 6° 

gag gac tac aat aga eta aga cca ttg age tat cgc ggc gca gat gtc 
llu Lp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 
65 70 75 

ttc tta etc get ttc tec ctt gtc age aaa get age tat gaa aat gtt 
Phe Leu Leu Ala Phe Ser Leu Val Ser Lys Ala Ser Tyr Glu Asn Val 
85 90 95 

tct aaa aaq tgg gta cct gaa ctg aga cat tat get cct ggt gtt oca 
ser Lys lys Tr? Val Pro llu Leu Arg His Tyr Ala Pro Gly Val Pro 
100 105 

ate ate etc gtc gga acc aag ctt gat ctt cga gat gac aag caa ttc 
lie He Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe 
115 120 125 

n-h frt-h oacr cac cct aat act gtg cct ate tct act get cag ggt gaa 
Phe 52 llu His Pro Gly Ala Val Pro He Ser Thr Ala Gin Gly Glu 
130 135 1*0 

oaa cta =aa aaa atg att ggg gca cct get tat att gaa tgc agt gca 
SS Leu lyl SI Va! He Gly Ala Pro Ala Tyr He Glu Cys Ser Ala 
145 150 15 5 



aaa aea caa cag aat gta aaa gcg gtg ttt gat gcg get ate aag gta 
Asn 
165 



aaa aca caa cag aciu yua ^ — - - - - „ - 

Ss Thr Gin Gin Asn Val Lys Ala Val Phe Asp Ala Ala lie Lys Val 
165 170 175 

gtt etc caa cca ccc aaa aac aag aag agg aag aag aga aag tct cag 
Val Leu Gin Pro Pro Lys Asn Lys Lys Arg Lys Lys Arg Lys Ser Gin 

185 190 



180 

aaa get tgt tct ata ttg tga 
Lys Ala Cys Ser lie Leu 
195 



<210> 10 

<211> 198 

<212> PRT 

<213> Brassica napus 

<400> 10 



Met Ser Ala Ser Arg Phe He Lys Cys Val Thr Val Gly Asp Gly Ala 
15 10 15 

val Gly Lys Thr Cys Leu Leu He Ser Tyr Thr Ser Asn Thr Phe Pro 
20 25 30 



192 



240 



288 



336 



384 



432 



480 



528 



576 



597 



Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val He 
35 40 4 5 
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Val Asp Gly Asn Thr lie Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 
50 55 60 

Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 
65 " 70 75 80 

Phe Leu Leu Ala Phe Ser Leu Val Ser Lys Ala Ser Tyr Glu Asn Val 
85 90 95 

Ser Lys Lys Trp Val Pro Glu Leu Arg His Tyr Ala Pro Gly Val Pro 
100 105 110 

lie lie Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe 

115 120 * ■ 125 " 

Phe Val Glu His Pro Gly Ala Val Pro He Ser Thr Ala Gin Gly Glu 
130 135 140 

Glu Leu Lys Lys Val He Gly Ala Pro Ala Tyr He Glu Cys Ser Ala 
145 150 155 160 

Lys Thr Gin Gin Asn Val Lys Ala Val Phe Asp Ala Ala lie Lys Val 
165 170 175 

Val Leu Gin Pro Pro Lys Asn Lys Lys Arg Lys Lys Arg Lys Ser Gin 
180 185 190 

Lys Ala Cys Ser He Leu 
195 

<210> 11 

<211> 591 

<212> DNA 

<213> Glycine max 

<220> 

<221> CDS 

<222> (1) . . (591) 

<223> 



<400> 11 

atg agt acg gca agg ttt ate aag tgt gta aca gtt gga. gat ggt get 48 

Met Ser Thr Ala Arg Phe He Lys Cys Val Thr Val Gly Asp Gly Ala 
1 5 10 15 



gtg gga aag aca tgc atg ctt ata tec tat ace age aat ace ttt ccc 
Val Gly Lys Thr Cys Met Leu He Ser Tyr Thr Ser Asn Thr Phe Pro 
20 25 30 



96 
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aog gat tat gtt cca aca gtg ttt gac aat ttc agt get aat gta acg 
Thr Lp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Thr 
35 40 45 

ata gat ggt agt act gtt aat ctt ggt tta tgg gac act gca gga caa 
Val £p Gly sir Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 
50 55 60 

craa aat tac aac agg eta agg cct tta age tat aga gga get gat gtg 
Ilu Asp Tyr Asn j£g Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 

70 75 



65 



ttt tta tta tgc tat tct etc ate age aaa gee agt tat gag aac ate 
Pne Leu Leu Cys Tyr Ser Leu He Ser Lys Ala Ser Tyr Glu Asn He 
85 90 95 

»==, =. aC r t-aa ata cct aag eta aga cat tat get oca aat gtg cot 
ser Lys. Lys Tr? S Pro Ilu Leu Arg His Tyr Ala Pro Asn val Pro 
100 105 

ata ata eta gtg gga aca aaa eta gat ttg cga gat gac aag caa ttt 
S ?2 ^eu val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe 
115 120 X 

r*i-r* aat cat ccq crga tec gca cga ata aca act get cag ggt gaa 
Leu S IX Ss Pro Sy ser Ala Arg He Thr Thr Ala Gin Gly Glu 
130 135 

4-«-rr a^rr aaa ata att agt gca gtc act tat att gag tgc age tec 
if* Leu Lys Ss Me? S Sy Ala Val Thr Tyr He Glu Cys Ser Ser 
145 150 155 



s: s ss- a s» » S SS S2 Dp S Si s s s 

165 170 

a £5 si s s s s s s: s s ss sss 



180 

tgt act ttc etc tga 
Cys Thr Phe Leu 
195 



<210> 12 

<211> 196 

<212> PRT 

<213> Glycine max 



<400> 12 

Met Ser Thr Ala Arg Phe He Lys Cys Val Thr Val Gly Asp Gly Ala 
1 5 10 15 

Val Gly Lys Thr Cys Met Leu He Ser Tyr Thr Ser Asn Thr Phe Pro 
20 25 

Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Thr 
35 40 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



591 
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Val Asp Gly Ser Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 
50 55 60 

Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 
65 70 75 80 



Phe Leu Leu Cys Tyr Ser Leu He Ser Lys Ala Ser Tyr Glu Asn He 
85 90 95 



Ser Lys Lys Trp He Pro Glu Leu Arg His Tyr Ala Pro Asn Val Pro 
100 105 110 



He Val Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe 
115 120 125 

Leu lie Asp His Pro Gly Ser Ala Arg He Thr Thr Ala Gin Gly Glu 
130 135 140 



Glu Leu Lys Lys Met He Gly Ala Val Thr Tyr lie Glu Cys Ser Ser 
145 150 155 160 

Lys Thr Gin Leu Asn Val Lys Thr Val Phe Asp Ala Ala He Lys Val 
165 170 175 

Ala Leu Lys Pro Pro Lys Pro Lys Lys Lys Pro Arg Lys Lys Arg Thr 
180 185 190 



Cys Thr Phe Leu 
195 



<210> 13 

<211> 594 

<212> DNA 

<213> Glycine max 

<220> 

<221> CDS 

<222> (1) . . (594) 

<223> 



<400> 13 

atg age aca aca egg ttt att aag tgt gtc aca gtt ggt gat ggg get 48 
Met Ser Thr Thr Arg Phe He Lys Cys Val Thr Val Gly Asp Gly Ala 
1 5 10 15 

gtg gga aag act tgc atg ctt ate tct tac act age aac act ttc ccc 96 
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val Gly Lys Thr Cys Met Leu He Ser Tyr Thr Ser Asn Thr Phe Pro 
20 25 30 



acg gat tat gtt cct aca gtt ttc gac aat ttc agt gca aat gtt gtg 
Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val 
35 40 45 

gtt gat ggc age aca gtt aac ctg gga ttg tgg gac act get gga cag 
= a , * . Giy £cr Thr Val Asm Leu Gly Leu Trp Asp Thr Ala Gly Gin 

50 5 5 60 

gaa gat tac aac agg ctt agg cca ttg agt tac aga gga gca gat gtg 
Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp val 
65 70 75 

ttc ttq ctg gec ttt tec etc ate age aaa gec age tat gaa aat ata 
Phe Leu Leu Ala Phe Ser Leu He Ser Lys Ala Ser Tyr Glu Asn He 
85 9° 95 

tct aaa aag tgg att cct gaa ttg aga cat tat gec cca act gtg cct 
Ser Lyt Lys T g He Pro Glu Leu Arg His Tyr Ala Pro Thr Val Pro 
100 105 110 

att ata eta att aga act aaa ctt gat ttg agg gaa gac agg caa tat 
l" Val Leu Val Gly Thr Lys Leu Asp Leu Arg Glu Asp Arg Gin Tyr 
US * 120 125 

ttd att gat cat cct gga acc aca get ata get act gec cag gga gaa 
III lie Isp His Pre Sly Thr Thr Ala He Ala Thr Ala Gin Gly Glu 
130 135 I 40 

gaa ctg aag aag gca att ggt get get gtg tac ata gag tgc age tea 
Ilu Leu Lys Lys Ala He Gly Ala Ala Val Tyr He Glu Cys Ser Ser 
145 ISO 155 ibo 

aag act cag cag aat gtg aag gec gtg ttt gat get gca ate aag gtt 
Lys Thr Gin Gin Asn Val Lys Ala Val Phe Asp Ala Ala He Lys Val 
165 I 70 175 

att tta caa cca cct aag tec aag aaa aaa gga aag aag aag aac acg 
v£ LeS G?n Pro Pro Lys Ser Lys Lys Lys Gly Lys Lys Lys Asn Thr 
180 I 85 I 90 

cct tgt gtt ttc etc tga 
Pro Cys Val Phe Leu 





195 


<210> 


14 


<211> 


197 


<212> 


PRT 


<213> 


Glycine max 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



594 
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Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val 
35 40 45 

Val Asp Gly Ser Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 
50 55 60 

Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser. Tyr Arg Gly Ala Asp Val 
65 70 75 80 

Phe Leu Leu Ala Phe Ser Leu lie Ser Lys Ala Ser Tyr Glu Asn He 
85 90 95 

Ser Lys Lys Trp He Pro Glu Leu Arg His Tyr Ala Pro Thr Val Pro 
' 100 105 110 

He Val Leu Val Gly Thr Lys Leu Asp Leu Arg Glu Asp Arg Gin Tyr 
115 120 125 

Leu lie Asp His Pro Gly Thr Thr Ala He Ala Thr Ala Gin Gly Glu 
130 ~ 135 140 

Glu Leu Lys Lys Ala He Gly Ala Ala Val Tyr He Glu Cys Ser Ser 
145 150 155 160 

Lys Thr Gin Gin Asn Val Lys Ala Val Phe Asp Ala Ala He Lys Val 
165 170 175 

Val Leu Gin Pro Pro Lys Ser Lys Lys Lys Gly Lys Lys Lys Asn Thr 
180 185 190 

Pro Cys Val Phe Leu 
195 

<210> 15 

<211> 591 

<212> DNA 

<213 > Glycine max 



<220> 

<221> CDS 

<222> (1) . . (591) 

<223> 



<400> 15 

atg agt gcg teg agg ttc att aag tgc gtc ace gtc ggc gac ggt get 48 
Met Ser Ala Ser Arg Phe He Lys Cys Val Thr Val Gly Asp Gly Ala 
1 5 10 15 
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, _ t.*.- *-i-rr act- fen tac acc age aac act ttt ccc 

JS S £ S S! S £ £ sS 3? Tar sir Asn ar fl. »ro 

20 25 

acg gac tat gtg ccc acc gtt ttt gac aat ttc agt get aat gtg gtg 
Tta Asp Tyr Val Pro Thr Val Phe Asp Asn Phe .Ser Ala Asn Val Val 
35 40 

ata crat gga age acc gta aac eta gga ttg' tgg gat aca get ggt cag 
val Asp Gly sir Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 
50 55 50 

^rr ,«f tac aat ana tta aga ccc ttg age tat cga gga get gat gtc 
5S S P Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 
G5 ™ 75 

ttc ata ctt qcc ttt tct etc ata age aag get age tat gaa aat att 
Pne iS Leu K Phe Ser Leu He Ser Lys Ala Ser Tyr Glu Asn He 
8 5 



S £ 31 £ £ £ E £ S S £ K £ © !S S 

100 105 

ata att etc gtt gga aca aag tta gat ctt egg gat gat aag caa ttt 
Se Se Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe 



S2 £ £p s £ s IX £ S £ sss a s is 

130 • 135 140 

a £ a £ a £ s e s s £ £ ss £ £ s 

145 150 155 

£ S SS SI K £ £ S £ £ 5 IS IS £ £ £ 
!S £ SS £ £ £ 5 £ £ SSS S £ S 

180 I 85 190 

tgc tec ata tta tga 
Cys Ser He Leu 
195 

<210* 16 

<211> 196 

<212> PRT 

<213> Glycine max 



<400> 16 

Met Ser Ala Ser Arg Phe He Lys Cys Val Tor Val Gly Asp Gly Ala 
1 5 

Val Gly Lys Thr Cys Leu Leu lie Ser Tyr Thr Ser Asn Thr Phe Pro 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



591 



20 



25 
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Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val 
35 40 " 45 



Val Asp Gly Ser Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 
50 55 60 



Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 
65 70 75 80 



Phe He Leu Ala Phe Ser Leu He Ser Lys Ala Ser Tyr Glu Asn He 
85 90 95 



Ala Lys Lys Trp He Pro Glu Leu Arg His Tyr Ala Pro Gly Val Pro 
100 105 110 



He lie Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe 
115 120 125 



Phe Met Asp His Pro Gly Ala Val Pro He Thr Thr Ala Gin Gly Glu 
130 " " 135 140 



Glu Leu Arg Lys Leu He Gly Ala Pro Ala Tyr He Glu Cys Ser Ser 
145 ~ 150 155 160 



Lys Thr Gin Gin Asn Val Lys Ala Val Phe Asp Ala Ala He Lys Val 
165 170 175 



Val He Gin Pro Pro Lys Leu Lys Lys Lys Arg Lys Thr Gin Lys Ala 
180 " 185 190 



Cys Ser He Leu 
195 



<210> 17 

<211> 594 

<212> DNA 

<213> Glycine max 

<220> 

<221> CDS 

<222> (1)..(594) 

<223> 



<400> 17 

atg age get tct agg ttc ate aag tgc gtc act gtt ggg gat ggt get 



48 
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Met Ser Ala Ser Arg Phe He Lys Cys Val Thr Val Gly Asp Gly Ala 
1 5 10 15 

ata qac aaa acc tgt ttg ctt att tec tac acc age aac act ttt ccc 
Val Ily Lys Thr Cys Leu Leu He Ser Tyr Thr Ser Asn Thr Phe Pro 
20 25 30 

acc aat tat gtg ccg act gtt ttt gac aat ttc age gca aat gtg gtt 
Ill Tvr Val Pro Thr Val Phe Asp Asn. Phe Ser Ala Asn Val Val 
* 35 40 45 

gtc aat ggg age att gtg aat ctg ggt ttg tgg gat act get gga caa 
Val Asn Gly Ser He Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 
50 55 "60 

gag gat tat aac aga tta aga cct ttg agt tac cgt ggt gee gat gtt 
Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 
65 70 " 75 80 

ttc ata ctg get ttc tct etc ata age aag gec agt tat gaa aat gtc 
Pte He Leu La Phe Ser Leu He Ser Lys Ala Ser Tyr Glu Asn Val 
85 90 95 

tct aaa aag tgg att ccg gag ttg aag cat tat get cct ggt gtc ccc 
Ser Lys Lyl i% He Pro llu Leu Lys His Tyr Ala Pro Gly Val Pro 
100 105 

att att ctg gtt ggc aca aag ctt gac ctt egg gat gat aag cag ttc 
He lie Leu Val Ily Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe 
115 120 125 

i-rrn art aac cat cct qgt gec gta cct att acc aca get cag gga gaa 
Ss lie Sp His Pro Ily Ala Val Pro lie Thr Thr Ala Gin Gly G.lu 

135 140 



130 



qag ctt agg aag ctg att aat gcg cca get tac att gaa tgc agt tea 
oX Leu A?| Lyi Leu He Asn Ala Pro Ala Tyr He Glu Cys Ser Ser 
145 150 155 160 

aaa aca caq qag aac gtg aag gca gtc ttt gat gca gec ata aga gtt 
gs Thr Gin Glu Asn Val Lyi Ala Val Phe Asp Ala Ala He Arg Val 
165 I 70 175 

gtc ctt caa cca cct aag cag aag aaa aag aag ggt aaa gca caa aag 
val Leu Gin Pro Pro Lys Gin Lys Lys Lys Lys Gly Lys Ala Gin Lys 
180 185 19° 

gee tgt teg ata ttg tga 
Ala Cys Ser He Leu 
195 



<210> 18 

<211> 197 

<212> PRT 

<213> Glycine max 

<400> 18 

Met Ser Ala Ser Arg Phe He Lys Cys Val Thr Val Gly Asp Gly Ala 
1 5 10 15 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



594 
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Val Gly Lys Thr Cys lieu Leu lie Ser Tyr Thr Ser Asn Thr Phe Pro 
20 25 ' 30 

Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val 
35 40 45 

Val Asn Gly Ser lie Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 
50 55 60 

Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 
65 70 75 80 

Phe lie Leu Ala Phe Ser Leu lie Ser Lys Ala Ser Tyr Glu Asn Val 
85 90 95 

Ser Lys Lys Trp He Pro Glu Leu Lys His Tyr Ala Pro Gly Val Pro 
100 105 110 

lie He Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe 
115 120 125 

Cys He Asp His Pro Gly Ala Val Pro He Thr Thr Ala Gin Gly Glu 
130 135 140 

Glu Leu Arg Lys Leu He Asn Ala Pro Ala Tyr lie Glu Cys Ser Ser 
145 ~ ISO 155 -160 

Lys Thr Gin Glu Asn Val Lys Ala Val Phe Asp Ala Ala He Arg Val 
165 170 175 

Val Leu Gin Pro Pro Lys Gin Lys Lys Lys Lys Gly Lys Ala Gin Lys 
180 185 190 

Ala Cys Ser He Leu 
195 

<210> 19 

<211> 594 

<212> DNA 

<213> Glycine max 



<220> 

<221> CDS 

<222> (1)..(594) 

<223> 
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<400> 19 . rra^ <Klt occ 48 



Hi t cg tcc agg ttc ate aag tgt gtc act gtg ggt gac ggt gec 
Me? sir IS Ser Ar§ Phe He Lys Cys Val Thr Val Gly Asp Gly Ala 
1 5 10 15 

m-i- nan aao act tac atg etc ate tcc tac acc age aac act ttt ect 
?£ Hy lyl Thr cys Me? Leu lie Ser Tyr Thr Ser Asn Thr Phe Pro 

25 



acg gac tac gtg cca act gtc ttt gac aaf ttc agt gca aat gtc gtt 
tS Isp Tyr val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val 
35 40 45 



gtg gat gga age act gtg aat ctt ggg ttg tgg gat act get ggc caa 
val IS ily sir Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 
50 55 60 



aaa gat tac aat aga" ttg aga ccc tta age tat cgt gga gca gat gta 
Su Is"p T?r Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 
65 70 75 

ttc ctg ctt get ttc tct etc ata age agg gec age tat gaa aat gtt 
Pne III Leu IL Phe Ser Leu He Ser Arg Ala Ser Tyr Glu Asn Val 
85 90 95 

acc aag aaa tgg att cct gag ttg agg eat. tat get cct ggt gtt cca 
IS Lys Lys Tr| lie Pro Glu Leu Arg His Tyr Ala Pro Gly Val Pro 

105 ixu 



100 



att att ctt att gga aca aaa ctt gat ctt egg gat gat aag cag ttc 
ill £eu v2 Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe 

115 120 125 

rr ac cat cct cat gca gtg cct ate acc aca gca cag ggt gag 
Phe Gin Sp Ts Pro S£ Ala Va? Pro He Thr Thr Ala Gin Gly Glu 
130 . 135 1*0 



~==, ^t-rr ana aaa ett ate ggt get cca att tac att gaa tgt agt tea 
ilu Leu S 5! ^ Se lly Ala Pro He Tyr He Glu Cys Ser Ser 
145 150 



aaa aca caa cag aat gtg aag get gtt ttt gat gca gec ate aag gta 
Lys Thr Gin Gin Asn Val Lys Ala Val Phe Asp Ala Ala lie Lys Val 



165 170 175 

att etc cag ccc cca aag cag aag aaa aag aag aga aag gga caa aag 
vai Leu Gin Pro Pro Lyl Gin Lys Lys Lys Lys Arg Lys Gly Gin- Lys 
180 185 I 90 



gee tgt tcc att ttg tga 
Ala Cys Ser lie Leu 
195 



<210> 20 

<211> 197 

<212> PRT 

<213> Glycine max 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



594 



<400> 20 

Met Ser Ala Ser Arg Phe He Lys Cys Val Thr Val Gly Asp Gly Ala 
1 5 *1° 15 
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Val Gly Lys Thr Cys Met Leu He Ser Tyr Thr Ser Asn Thr Phe Pro 
20 25 30 

Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val 
35 40 45 

Val Asp Gly Ser Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 
50 55 60 

Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 
65 " 70 75 80 

Phe Leu Leu Ala Phe Ser Leu He Ser Arg Ala Ser Tyr Glu Asn Val 
85 90 95 

Ala Lys Lys Trp He Pro Glu Leu Arg His Tyr Ala Pro Gly Val Pro 
100 105 110 

He He Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe 
115 * 120 125 

Phe Gin Asp His Pro Gly Ala Val Pro He Thr Thr Ala Gin Gly Glu 
130 135 140 

Glu Leu Arg Lys Leu He Gly Ala Pro He Tyr lie Glu Cys Ser Ser 
145 150 155 160 

Lys Thr Gin Gin Asn Val Lys Ala Val Phe Asp Ala Ala He Lys Val 
165 170 175 

Val Leu Gin Pro Pro Lys Gin Lys Lys Lys Lys Arg Lys Gly Gin Lys 
180 185 190 



Ala Cys Ser He Leu 
195 



<210> 21 

<211> 594 

<212> DNA 

<213> Glycine max 

<220> 

<221> CDS 

<222> (1) . . (594) 

<223> 
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atg agt gcg tec agg ttc ate aag tgt gtc act gtg ggt gac ggt gee 48 
Met Ser Ala Ser Arg Phe He Lys Cys Val Thr Val Gly Asp Gly Ala 
1 5 10 15 

att age aag act tgc atg etc ate tec tac acc age aac act ttt cct 
v a1 T„l Thr cvs Met Leu He Ser Tyr Thr Ser Asn Thr Phe Pro 

' 2V - 25 30 

acg gac tac gtg cca act gtc ttt gac aat ttc agt gca aat gtc gtt 
Thi Asd Tvr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val 
F 1 40 45 



35 



gtg gat gga age act gtg aat ctt ggg ttg tgg gat act get ggc caa 
Val Asp Gly Ser Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 
50 55 60 

gaa gat tac aat aga ttg aga ccc tta age tat cgt gga gca gat gta 
Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 



65 



165 



gtt etc cag ccc cca aag cag aag aaa aag aag aga aag gga caa aag 
val Leu Gin Pro Pro Lys Gin Lys Lys Lys Lys Arg Lys Gly Gin Lys. 
180 185 190 

gec tgt tec att ttg tga 
Ala Cys Ser lie Leu 
195 



<210> 22 

<211> 197 

<212> PRT 

<213> Glycine max 



96 



144 



192 



240 



288 



70 ^ 75 80 

ttc ctcr ctt get ttc tct etc ata age agg gee age tat gaa aat gtt 
Phe Leu Leu Ala Phe Ser Leu He Ser Arg Ala Ser Tyr Glu Asn Val 
85 9° 95 

gec aag aaa tgg att cct gag ttg agg cat tat get cct ggt gtt cca 336 
iaa lJs Lys Tr| He Pro Glu Leu Arg His Tyr Ala Pro Gly Val Pro 
100 105 HO 

att att ctt gtt gga aca aaa ctt gat ctt egg gat gat aag cag ttc 
He He Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe 
115 " 120 125 

ttt caa gac cat cct ggt gca gtg cct ate acc aca gca cag ggt gag 
Phe Gin Asp His Pro Gly Ala Val Pro lie Thr Thr Ala Gin Gly Glu 
130 135 140 

gaa ctg aga aag ctt ate ggt get cca att tac att gaa tgt agt tea 
Glu Leu Arg Lyl Leu He Gly Ala Pro He Tyr He Glu Cys Ser Ser 
145 150 155 "° 

aaa aca caa cag aat gtg aag get gtt ttt gat gca gee ate aag c£a 
Lys Thr Gin Gin Asn Val Lys Ala Val Phe Asp Ala Ala He Lys Val 

170 175 



384 



432 



480 



528 



576 



594 



<400> 



22 
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Met Ser Ala Ser Arg Phe lie Lys Cys Val Thr Val Gly Asp Gly Ala 
1 5 10 15 

Val Gly Lys Thr Cys Met Leu lie Ser Tyr Thr Ser Asn Thr Phe Pro 
20 25 30 

Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val 
35 40 45 

Val Asp Gly Ser Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 
50 55 60 

Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 
65 70 75 80 

Phe Leu Leu Ala Phe Ser Leu He Ser Arg Ala Ser Tyr Glu Asn Val 
85 90 95 

Ala Lys Lys Trp He Pro Glu Leu Arg His Tyr Ala Pro Gly Val Pro 
100 105 110 

He lie Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe 
115 120 125 

Phe Gin Asp His Pro Gly Ala Val Pro He Thr Thr Ala Gin Gly Glu 
130 135 140 

Glu Leu Arg Lys Leu He Gly Ala Pro lie Tyr He Glu Cys Ser Ser 
145 150 155 160 

Lys Thr Gin Gin Asn Val Lys Ala Val Phe Asp Ala Ala He Lys Val 
165 170 175 

Val Leu Gin Pro Pro Lys Gin Lys Lys Lys Lys Arg Lys Gly Gin Lys 
180 185 190 

Ala Cys Ser He Leu 
195 

<210> 23 

<211> 639 

<212> DNA 

<213> Glycine max 



<220> 

<221> CDS 

<222> (1) . . (639) 



WO 2005/014828 
<223> 
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a£g°ict 2 tcc g ca act gcc cca aga ttc ate aag tgc gtg acc gtt ggt 48 
Me? Ala Ser Ala Thr Ala Pro Arg Pfce He Lys Cys Val Thr Val Gly 

10 Xi> 



1 5 



£ 15 ES S if? S S Ts ss £ £ £S £ £ s £ 

- - 25 



20 



aaa ttc ccc acg gac tat ate ccc act gtg ttt gat aat ttc agt gca 
SI Phe Pro Thr Asp Tyr He Pro Thr Val Phe Asp Asn Phe Ser Ala 
* 40 45 



35 



-at atd att att gaa ggc ata act gtc aac tta ggc ctt tgg gat aca 
Asn Val Val Val Glu Sly He Thr Val Asn Leu Gly Leu Trp Asp Thr 
50 55 60 

act ggg caa gag gat tac aac agg ctg agg ccc ttg age tac agg ggg 
£a Sly Sin Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly 
6S 70 75 

rr^a m * crtc ttt ate ttg get ttt tct tta gtt agt cgc gcg age tat 
Sa Sp vlt Pne val Leu Ala Phe Ser Leu Val Ser Arg Ala Ser Tyr 
85 9° 95 

■8S £ a S "1 5 s s £ s £ IS £ £ K £ 



aac ate cca ttg gtg tta gtt ggc acc aaa ttg gat eta cga gaa gac 
Ify £2 Pro Leu Val Leu Val Gly Thr Lys Leu Asp Leu Arg. Glu Asp 
115 120 125 

s £ £ sss a £ £ £ s a a s sa s s £ 

130 135 

caa aat aag gaa etc cgt aaa cac att gga get acc tac tat att gag 
Gin !!y III £u Leu Arg Lys His He Gly Ala Thr Tyr Tyr He Glu 

150 155 " 



145 



tac age tea aaa act cag cag aat gtg aag gca gtt ttt gat get get 
£ys III Ser Lys Thr Gin Gin Asn Val Lys Ala Val Phe Asp Ala Ala 



165 



att aaa ata ate ate aag cct cca caa aag caa aac gag aaa aga aag 
111 Arg Me? Val lie LyI Pro Pro Gin Lys Gin Asn Glu Lys Arg Lys 
180 185 

aa* aaa cca cat aac tgt ttc eta aac gtc etc tgt cga agg aac att 
51 Lys Pro Arg G?y Cys Phe Leu Asn Val Leu Cys Arg Arg Asn He 
195 200 

gtt cgc ctt aaa tga 
Val Arg Leu Lys 
210 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 



639 



<210> 
<211> 
<212> 



24 

212 

PRT 
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<400> 24 

Met Ala Ser Ala Thr Ala Pro Arg Phe He Lys Cys Val Thr Val Gly 
1 5 10 15 



Asp Gly Ala Val Gly Lys Thr Cys Met Leu lie Cys Tyr Thr Ser Asn 
20 25 30 



Lys Phe Pro Thr Asp Tyr He Pro Thr Val Phe Asp Asn Phe Ser Ala 
35 40 45 

Asn Val Val Val Glu Gly He Thr Val Asn Leu Gly Leu Trp Asp Thr 
50 55 60 

Ala Gly Gin Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly 
65 70 75 80 

Ala Asp Val Phe Val Leu Ala Phe Ser Leu Val Ser Arg Ala Ser Tyr 
85 90 • 95 



Glu Asn Val Leu Lys Lys Trp He Pro Glu Leu Gin His Phe Ala Pro 
100 ~ ** 105 110 



Gly He Pro Leu Val Leu Val Gly Thr Lys Leu Asp Leu Arg Glu Asp 
115 120 125 

Lys His Tyr Met Ala Asp His Pro Ser Leu Val Pro Val Thr Thr Asp 
130 135 140 

Gin Gly Glu Glu Leu Arg Lys His He Gly Ala Thr Tyr Tyr He Glu 
145 * 150 155 160 

Cys Ser Ser Lys Thr Gin Gin Asn Val Lys Ala Val Phe Asp Ala Ala 
165 170 175 

He Arg Met Val He Lys Pro Pro Gin Lys Gin Asn Glu Lys Arg Lys 
180 185 190 

Lys Lys Pro Arg Gly Cys Phe Leu Asn Val Leu Cys Arg Arg Asn He 
195 200 205 



Val Arg Leu Lys 
210 



<210> 25 
<211> 762 
<212> DNA 
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288 



<220> 

<221> CDS 

<222> (1) . . (762) 

<223> 



<400> 25 t AQ 

atg gtt gga cca gga aga cct caa ate gtc etc ttc ggt tea tea ate 48 
Met val Gly Pro Gly Arg Pro Gin lie Val Leu Phe Gly Ser Ser lie 
r b 10 15 

gtt caa tac age ttc age gat ggt gga tgg gga gee act etc get aac 96' 
Val Gin Tyr Ser Phe Ser Asp Gly Gly Trp Gly Ala Thr Leu Ala Asn 
20 25 30 

ate tac tct cgc acg get gac gta ate ctt cgt ggg tat get ggt tgg 144 
He Tyr Ser Arg Thr Ala Asp Val lie Leu Arg Gly Tyr Ala Gly Trp 
35 40 45 

aac tec aga tct gee ttg aag gtg tta aac caa gtg ttc eca aag gat 192 
Asn Ser Arg Ser Ala Leu Lys Val Leu Asn Gin Val Phe Pro Lys Asp 

50 rtU 55 60 

get gtt ata caa cct tct ttg gtg ata gtc tat ttc gga ggg aat gat 240 
Ala Val He Gin Pro Ser Leu Val He Val Tyr Phe Gly Gly Asn Asp 
65 70 75 80 

tea atg cct cct cat cca tea ggg caa gga cct cat gtt cct etc tct 
Ser Met Pro Pro His Pro Ser Gly Gin Gly Pro His Val Pro Leu Ser 
85 90 95 

gaa ttc act gag aac atg agg aag ate gga gag cat ctt ttg age etc 336 
Glu Phe Thr Glu Asn Met Arg Lys He Gly Glu His Leu Leu Ser Leu 
100 105 HO 

teg gac aag acc cgt gtc att ttt etc act ccc cca cca atg aac gag 384 
Ser Asp Lys Thr Arg Val He Phe Leu Thr Pro Pro Pro Met Asn Glu 
115 ~ 120 125 

aga caa ate caa eta gtg ttt gga gat gca atg aga ggc egg agt aac 432 
Arg Gin He Gin Leu Val Phe Gly Asp Ala Met Arg Gly Arg Ser Asn 
130 135 140 

gag ctg tgt cgt cca tac gca gaa gcg ttg ttg aat eta tgc aga gag 480 
Glu Leu Cys Arg Pro Tyr Ala Glu Ala Leu Leu Asn Leu Cys Arg Glu 
145 150 155 160 

ate aat gtg aaa ggt ate gat ctt tgg aac gca ata cag caa caa gat 528 
He Asn Val Lys Gly He Asp Leu Trp Asn Ala He Gin Gin Gin Asp 
165 170 175 

gat tgg tta cac act tgc ttc act gac ggt ate cat ttc acg gee aag 576 
Asp Trp Leu His Thr Cys Phe Thr Asp Gly He His Phe Thr Ala Lys 
180 185 190 

gcg age gag att gtg gtg aag gag ata ttg aaa gta gtg aga gaa get 624 
Ala Ser Glu He Val Val Lys Glu He Leu Lys Val Val Arg Glu Ala 
195 200 205 

gat tgg aaa ccg agt ctt gac agg aag tea tta ccg gtt gag ttt cca 672 
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Asp Trp Lys Pro Ser Leu Asp Arg Lys Ser Leu Pro Val Glu Phe Pro 
210 215 220 

ttt gat tct ggt eta cca aac tec cca aga cat agt gat eta gaa tta 720 
Phe Asp Ser Gly Leu Pro Asn Ser Pro Arg His Ser Asp Leu Glu Leu 
225 230 235 240 

act aga aac aag aag ttg gag cct cgt atg gec cga ttg taa 762 
Thr Arg Asn Lys Lys Leu Glu .Pro Arg Met Ala Arg Leu 
245 250" 

<210> 26 
<211> 253 
<212> PRT 

<213> Brassica napus 



<400> 26 

Met Val Gly Pro Gly Arg Pro Gin lie Val Leu Phe Gly Ser Ser lie 
15 10 15 



Val Gin Tyr Ser Phe Ser Asp Gly Gly Trp Gly Ala Thr Leu Ala Asn 
20 25 30 



lie Tyr Ser Arg Thr Ala Asp Val lie Leu Arg Gly Tyr Ala Gly 'Trp 
35 40 45 



Asn Ser Arg Ser Ala Leu Lys Val Leu Asn Gin Val Phe Pro Lys Asp 
50 " 55 60 



Ala Val lie Gin Pro Ser Leu Val lie Val Tyr Phe Gly Gly Asn Asp 
65 70 75 80 



Ser Met Pro Pro His Pro Ser Gly Gin Gly Pro His Val Pro Leu Ser 
85 90 95 



Glu Phe Thr Glu Asn Met Arg Lys lie Gly Glu His Leu Leu Ser Leu 
100 105 110 



Ser Asp Lys Thr Arg Val He Phe Leu Thr Pro Pro Pro Met Asn Glu 
115 120 125 



Arg Gin He Gin Leu Val Phe Gly Asp Ala Met Arg Gly Arg Ser Asn 
130 135 140 



Glu Leu Cys Arg Pro Tyr Ala Glu Ala Leu Leu Asn Leu Cys Arg Glu 
145 150 155 160 



He Asn Val Lys Gly He Asp Leu Trp Asn Ala lie Gin Gin Gin Asp 
165 170 175 
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Asp Trp Leu His Thr Cys Phe Thr Asp Gly He His Phe Thr Ala Lys 
180 ~ 185 190 

Ala Ser Glu He Val Val Lys Glu He Leu Lys Val Val Arg Glu Ala 
195 200 205 

Asp Trp Lys Pro Ser Leu Asp Arg Lys Ser Leu Pro Val Glu Phe Pro 
210 215 220 

Phe Asp Ser Gly Leu Pro Asn Ser Pro Arg His Ser Asp Leu Glu Leu 
225 230 235 240 

Thr Arg Asn Lys Lys Leu Glu Pro Arg Met Ala Arg Leu 

. . 245 250 

<210> 27 

<211> 651 

<212> DNA 

<213> Brassica napus 



<220> 

<221> CDS 

<222> (1)..(261) 

<223> 



atg get tea act get tea aag ttc ata aaa tgt gtg act gtt ggt gat 48 
Met Ala Ser Thr Ala Ser Lys Phe He Lys Cys Val Thr Val Gly Asp 
1 5 10 15 

ggc gee gta ggt aaa acc tgt atg etc ate tgc tac ace age aac aaa 96 
Gly Ala Val Gly Lys Thr Cys Met Leu He. Cys Tyr Thr Ser Asn Lys 
20 25 30 

ttc cct act gac tac ata cca aca gtt ttt gac aac ttt agt gca aac 144 
Phe Pro Thr Asp Tyr He Pro Thr Val Phe Asp Asn Phe Ser Ala Asn 
35 ^ 40 45 

gtt gta gtt gaa ggc acc act gtg aac eta ggc eta tgg gac act get 192 
Val Val Val Glu Gly Thr Thr Val Asn Leu Gly Leu Trp Asp Thr Ala 
50 ' 55 60 

ggg caa gaa gac tac aac aga tta agg cct tta agt tac aga gga gca 240 
Glv Gin Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala 
65 70 75 80 

gat gtt ttc gtc ctg tct ttc tccttggtca gecgagctag ctacgagaat 291 
Asp Val Phe Val Leu Ser Phe 
85 

gtttataaaa agtggatccc tgaactccaa cactttgccc caggagttcc attagtcctt 351 
gttggtacca aactagatct ccgtgaagat aataagcatt atttggctga ccatcctgga 411 
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ctatcccctg taactactgc acagggagag gaattgcgta agctaatcgg tgcgacatat 471 
tacattgaat gtagctcgaa aactcaacag aatgtgaaag cagtttttga ttcagcgatc 531 
aaggaagtga tcaaaccggt ggttaaacaa aaggagaaga cgcagaaaac gaagaagcaa 591 
aagtctaatc atggctgttt atcaaacgtt ctgtgtggga ggatagtgac tcggcattga 651 

<210> 28 

<211> 87 

<212> PRT 

<213> Brassica napus 

<400> 28 

Met Ala Ser Thr Ala Ser Lys Phe lie Lys Cys Val Thr Val Gly Asp 
1 5 10 15 

Gly Ala Val Gly Lys Thr Cys Met Leu lie Cys Tyr Thr Ser Asn Lys 
20 25 30 

Phe Pro Thr Asp Tyr lie Pro Thr Val Phe Asp Asn Phe Ser Ala Asn 
35 40 45 

Val Val Val Glu Gly Thr Thr Val Asn Leu Gly Leu Trp Asp Thr Ala 
50 55 60 

Gly Gin Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala 
65 70 75 80 

Asp Val Phe Val Leu Ser Phe 
85 

<210> 29 
<211> 651 
<212> DNA 

<213> Brassica napus 



<220> 

<221> CDS 

<222> (1)..(651) 

<223> 



<400> 29 

atg get tea agt get tea aag ttc ate aaa tgt gta act gtt ggt gat 



48 
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Met Ala Ser Ser Ala Ser Lys Phe He Lys Cys Val Thr Val Gly Asp 
1 5 10 15 

96 



ggt gcc gtt ggt aaa acc tgt atg etc ate tgc tat ace age aac aag 
Glv Ala Val Gly Lys Thr Cys Met Leu He Cys Tyr Thr Ser Asn Lys 
20 25 30 

tte cct act gac tat gta cca acg gtt ttt gac aac ttt agt gca aac 144 
Phe Pro Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn 

4i» 



35 40 



192 



240 



288 



336 



384 



432 



480 



ott gta gtt gaa gga act act gtg aac tta ggg eta tgg gat act get 
Val Val Val Glu Gly Thr Thr Val Asn Leu Gly Leu Trp Asp Thr Ala 
50 55 60 

gga caa gaa gac tat aac aga tta agg cct tta age tac aga gga gca 
Gly Gin Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala 
65 70 75 80 

gat gtc ttc gtc ttg tct ttc tea ttg gtt age cga get age tac gag 
Isp Val Phe Val Leu Ser Phe Ser Leu Val Ser Arg Ala Ser Tyr Glu 
85 90 95 

aat gtt ttt aaa aag tgg ate cct gaa etc caa cac ttt get cca gga 
Asn Val Phe Lys Lys Trp He Pro Glu Leu Gin His Phe Ala Pro Gly 
100 105 110 

gtt cca tta gtt ctt gtc ggt acc aaa tta gat etc cgt gag gat aag 
Val Pro Leu Val Leu Val Gly Thr Lys Leu Asp Leu Arg Glu Asp Lys 
115 120 12 5 

cat tat ctg get gac cat cct gga eta tec cct gta act act gca cag 
His Tyr Leu Ala Asp His Pro Gly Leu Ser Pro Val Thr Thr Ala Gin 
130 " 135 140 

qga gag gag ttg cgt aag etc att ggt gca aca tat tac att gaa tgt 
Ily 111 111 Leu Arg Lys Leu He Gly Ala Thr Tyr Tyr He Glu Cys 
145 ISO 155 160 

age tea aaa act caa cag aat gtc aaa gca gtt ttt gat teg gca ate 528 
sir Ser Lys Thr Gin Gin Asn Val Lys Ala Val Phe Asp Ser Ala He 
165 170 175 

aag gaa gtg ate aaa ccg gtg ctt aaa cag aag ggo aag acc aag aaa 
LyI Glu Val He Lys Pro Val Leu Lys Gin Lys Gly Lys Thr Lys Lys 

* 180 185 190 

aag aag aag caa cag teg aat cac cac ggg tgt tta tea aac gtt ttg 
Lys LyI LyI Gin Gin Ser Asn His His Gly Cys Leu Ser Asn Val Leu 

* 195 200 205 

tgt ggg agg ata gtg acc egg cat tga 
Cys Gly Arg He Val Thr Arg His 
210 " 215 

<210> 30 

<211> 216 

<212> PRT 

<213> Brassica napus 



576 



624 



651 



<400> 



30 
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Met Ala Ser Ser Ala Ser Lys Phe lie Lys Cys Val Thr Val Gly Asp 
1 5 10 15 

Gly Ala Val Gly Lys Thr Cys Met Leu lie Cys Tyr Thr Ser Asn Lys 
20 25 30 

Phe Pro Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn 
35 40 45 

Val Val Val Glu Gly Thr Thr Val Asn Leu Gly Leu Trp Asp Thr Ala 
50 55 60 

Gly Gin Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala 
65 70 75 80 

Asp Val Phe Val Leu Ser Phe Ser Leu Val Ser Arg Ala Ser Tyr Glu 
85 90 95 

Asn Val Phe Lys Lys Trp lie Pro Glu Leu Gin His Phe Ala Pro Gly 
100 * * 105 110 

Val Pro Leu Val Leu Val Gly Thr Lys Leu Asp Leu Arg Glu Asp Lys 
115 120 125 

His Tyr Leu Ala Asp His Pro Gly Leu Ser Pro Val Thr Thr Ala Gin 
130 135 140 

Gly Glu Glu Leu Arg Lys Leu lie Gly Ala Thr Tyr Tyr lie Glu Cys 
145 150 155 160 

Ser Ser Lys Thr Gin Gin Asn Val Lys Ala Val Phe Asp Ser Ala lie 
165 170 175 

Lys Glu Val He Lys Pro Val Leu Lys Gin Lys Gly Lys Thr Lys Lys 
180 185 190 

Lys Lys Lys Gin Gin Ser Asn His His Gly Cys Leu Ser Asn Val Leu 
195 200 205 

Cys Gly Arg He Val Thr Arg His 
210 215 

<210> 31 

<211> 606 

<212> DNA 

<213> Brassica napus 



<220> 
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<221> CDS 

<222> (1)..(606) 

<223> 



</L(\C)> 31 

atg age aca gcg aga ttc ate aag tgt gtg acg gtc gga gat gga get 4 a 

Met Ser Thr Ala Arg Phe lie Lys Cys Val Thr Val Gly Asp Gly Ala 
15 10 15 

gtg ggg aag act tgt atg ctg att tea tat ace age aat act ttt cct 96 
Val Gly Lys Thr Cys Met Leu lie Ser Tyr Thr Ser Asn Thr Phe Pro 
20 25 30 

acg gat tac gtf ccg aca gtt ttt gac aat ttc agt gcg aat gtg gtg 144 
Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val 
35 40 45 

gtc gat gga agt act gtc aac etc ggc ctg tgg gat act get ggg cag 192 
Val Asp Gly Ser Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 
50 55 60 

gaa gat tat aac agg ctt egg cct ttg agt tac aga gga gca gat gtg 240 
Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 
65 70 75 80 

ttt tta ttg gca ttt tec eta att age aag gee agt tac gag aac att 
Phe Leu Leu Ala Phe Ser Leu He Ser Lys Ala Ser Tyr Glu Asn He 
85 90 95 

tac aaa aag tgg ctt ccg gag ctg aaa cat tat gcg cct age ate ccc 336 
Tyr Lys Lys Trp Leu Pro Glu Leu Lys His Tyr Ala Pro Ser He Pro 
100 1° 5 110 

att gta etc gtc gga acc aag tta gat ttg agg gat gac aaa cag ttc 384 
He Val Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe 
il5 120 125 

ttg aaa gat cat cca gga gca get tea ata aca act gee cag gga gag 
Leu Lys Asp His Pro Gly Ala Ala Ser He Thr Thr Ala Gin Gly Glu 
130 135 140 

gaa tta aga aag atg att gga gee ate aag tac tta gaa tgc age tec 
Glu Leu Arg Lys Met He Gly Ala He Lys Tyr Leu Glu Cys Ser Ser 
145 150 155 I" 

aaa acc cag cag aat gtg aag gca gtg ttt gat aca gcg ate egg gta 528 
Lys Thr Gin Gin Asn Val Lys Ala Val Phe Asp Thr Ala He Arg Val 
165 170 175 



288 



432 



480 



gcg ttg agg cct cca aag gca aag aag aag ata aag cca ttg agg acc 576 
Ala Leu Arg Pro Pro Lys Ala Lys Lys Lys He Lys Pro Leu Arg Thr 
180 ~ 185 190 



aaa aga tea aga aca tgc ttt ttc ttc taa 
Lys Arg Ser Arg Thr Cys Phe Phe Phe 
195 200 



<210> 32 
<211> 201 
<212> PRT 



606 
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<400> 32 

Met Ser Thr Ala Arg Phe lie Lys Cys Val Thr Val Gly Asp Gly Ala 
1 5 ^10 15 



Val Gly Lys Thr Cys Met Leu He Ser Tyr Thr Ser Asn Thr Phe Pro 
20 25 30 



Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val 
35 40 45 



Val Asp Gly Ser Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 
50 55 60 



Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 
65 70 .75 80 



Phe Leu Leu Ala Phe Ser Leu He Ser Lys Ala Ser Tyr Glu Asn He 
85 90 95 



Tyr Lys Lys Trp Leu Pro Glu Leu Lys His Tyr Ala Pro Ser lie Pro 
100 105 110 



lie Val Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe 
115 12 0 125 



Leu Lys Asp His Pro Gly Ala Ala Ser He Thr Thr Ala Gin Gly Glu 
130 135 140 



Glu Leu Arg Lys Met He Gly Ala He Lys Tyr Leu Glu Cys Ser Ser 
145 ~ 150 - 155 160 



Lys Thr Gin Gin Asn Val Lys Ala Val Phe Asp Thr Ala He Arg Val 
165 170 175 



Ala Leu Arg Pro Pro Lys Ala Lys Lys Lys He Lys Pro Leu Arg Thr 
180 185 190 



Lys Arg Ser Arg Thr Cys Phe Phe Phe 

200 





195 


<210> 


33 


<211> 


636 


<212> 


DNA 


<213> 


Brassica 
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<220> 

<221> CDS 

<222> (1)..(636) 

<223> 



<400> 33 

atg tea get tea gtg get get gca tea gta tea aca aca aca aca gca 
Met Ser Ala Ser Val Ala Ala Ala Ser Val Ser Thr Thr Thr Thr Ala 
15 10 15 



aac tat cca ggt get tgc aca ate tea aca gaa caa ggt caa gag eta 
Asn Tyr Pro Gly Ala Cys Thr lie Ser Thr Glu Gin Gly Gin Glu Leu 
145 150 • 155 160 

aga aaa gag ata gga gca tta gca tat ata gag tgc age tea aaa aca 
Arg Lys Glu lie Gly Ala Leu Ala Tyr lie Glu Cys Ser Ser Lys Thr 
165 170 175 



48 



get aca acg ttt ate aag tgc gtc act gtt ggc gat gga get gtg ggc 96 
Ala Thr Thr Phe lie Lys Cys Val Thr Val Gly Asp Gly Ala Val Gly 
20 25 30 

aaa act tgt ctt ctt ate tec tac acc age aac acc ttt cct act gat 144 
Lys Thr Cys Leu Leu lie Ser Tyr Thr Ser Asn Thr Phe Pro Thr Asp 
35 40 45 

tat gtt cct aca gtg ttc gac aac ttc agt gca aat gtt eta gtc gat 
Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Leu Val Asp 
50 55 60 

ggc aaa acc gtc aat ctt ggt ctt tgg gat act get ggt caa gaa gat 
Gly Lys Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp 
65 70 75 80 



192 



240 



tac aat agg ctt aga cca ttg agt tac aga gga gca gat gtt ttc att 288 
Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val Phe lie 
85 90 95 

ctt gec ttt tct ctt ate age agg cct age ttt gag aac att get aaa 336 
Leu Ala Phe Ser Leu lie Ser Arg Pro Ser Phe Glu Asn lie Ala Lys 
100 105 HO ' 

aag tgg gtc cct gag ctg cga cat tat gee cct aac gtg cct att gtt 384 
Lys Trp Val Pro Glu Leu Arg His Tyr Ala Pro Asn Val Pro lie Val 
115 120 125 

eta gtg gga act aaa tta gat eta aga gag gat aag aag ttc cca atg 432 
Leu Val Gly Thr Lys Leu Asp Leu Arg Glu Asp Lys Lys Phe Pro Met 
130 - 135 140 



480 



528 



caa cag aac gtg aaa gcg gtg ttt gat gca gcg ata aaa gta gtt eta 576 
Gin Gin Asn Val Lys Ala Val Phe Asp Ala Ala He Lys Val Val Leu 
180 185 190 

cag cct cct aca aaa att aag aaa caa aag aga aga ttt cgt ttc tgc 624 
Gin Pro Pro Thr Lys He Lys Lys Gin Lys Arg Arg Phe Arg Phe Cys 
195 ' 200 205 



cat get etc tga 
His Ala Leu 
210 



63 6 
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<210> 34 
<211> 211 
<212> PRT 

<213> Brassica napus 
<400> 34 

Met Ser Ala Ser Val Ala Ala Ala Ser Val Ser Thr Thr Thx Thr Ala 
1 5 10 15 

Ala Thr Thr Phe lie Lys Cys Val Thr Val Gly Asp Gly Ala Val Gly 
20 25 30 

Lys Thr Cys Leu Leu lie Ser Tyr Thr Ser Asn Thr Phe Pro Thr Asp 
35 40 45 

Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Leu Val Asp 
50 55 60 

Gly Lys Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp 
65 70 75 80 

Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val Phe He 
85 90 95 

Leu Ala Phe Ser Leu He Ser Arg Pro Ser Phe Glu Asn He Ala Lys 
100 105 110 

Lys Trp Val Pro Glu Leu Arg His Tyr Ala Pro Asn Val Pro lie Val 
115 120 125 

Leu Val Gly Thr Lys Leu Asp Leu Arg Glu Asp Lys Lys Phe Pro Met 
130 135 140 

Asn Tyr Pro Gly Ala Cys Thr He Ser Thr Glu Gin Gly Gin Glu Leu 
145 150 155 160 

Arg Lys Glu He Gly Ala Leu Ala Tyr He Glu Cys Ser Ser Lys Thr 
165 170 175 

Gin Gin Asn Val Lys Ala Val Phe Asp Ala Ala He Lys Val Val Leu 
180 185 190 

Gin Pro Pro Thr Lys He Lys Lys Gin Lys Arg Arg Phe Arg Phe Cys 
195 200 ~ 205 



His Ala Leu 
210 
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<210> 35 
<211> 588 
<212> DNA 

<213> Brassica napus 



<220> 

<221> CDS 

<222> (1)..<588) 

<223> 



<400> 35 AQ 
atg age get teg agg ttc ata aag tgt gtc acc gtc ggc gat ggt gec 48 
Met Ser Ala Ser Arg Phe He Lys Cys Val Thr Val Gly Asp Gly Ala 
1 5 10 . 15 

gtc gga aaa acc tgt atg ctg ate tct tac acg age aac acc ttc cct 96 
Val Gly Lys Thr Cys Met Leu He Ser Tyr Thr Ser Asn Thr Phe Pro 
20 25 30 

acg gac tat gta cca act gtt ttc gat aac ttc agt get aat gtg gtt 144 
Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val 
35 40 • 45 

gtt gat ggg aac act gtg aat ctt ggc ttg tgg gat aca get ggt caa 192 
Val Asp Gly Asn Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 
50 55 60 

gaa gac tat aac agg tta aga cca ttg agt tac cgt ggt gcg gat gtc 240 
Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 
65 70 75 80 

ttc att ctt get ttc tct ctt ate age aaa get age tac gag aat ata 288 
Phe He Leu Ala Phe Ser Leu He Ser Lys Ala Ser Tyr Glu Asn He 
85 90 95 

act aag aag tgg att cct gag etc agg cac tat gee cct ggt gtt cct 
Ala Lys Lys Trp He Pro Glu Leu Arg His Tyr Ala Pro Gly Val Pro 
100 105 HO 

att ate etc gtt gga aca aaa etc gat ctt cga gat gac aag cag ttc 
He He Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe 
115 120 125 

ttc ata gac cac cct ggt gca gtg ccg att agt act aac cag gga gag 432 
Phe He Asp His Pro Gly Ala Val Pro He Ser Thr Asn Gin Gly Glu 
130 135 140 

gaa eta aag aaa ctg ata ggg tct ccg get tac att gaa tgc agt tea 480 
Glu Leu Lys Lys Leu He Gly Ser Pro Ala Tyr He Glu Cys Ser Ser 

150 155 160 



336 



384 



145 

aag acg cag cag aac gtg aag gca gtc ttt gac gca gee ata aaa gta 
Lvs Thr Gin Gin Asn Val Lys Ala Val Phe Asp Ala Ala He Lys Val 
165 170 175 

gtg ctt cag cca cca aag caa aag aag aag aaa aag aag aat ggt tgt 
Val Leu Gin Pro Pro Lys Gin Lys Lys Lys Lys Lys Lys Asn Gly Cys 
180 185 190 



528 



576 
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gtt ttc ttg tga 588 
Val Phe Leu 
195 

<210> 36 

<211> 195 

<212> PRT . 

<213> Brassica napus 



<400> 36 

Met Ser Ala Ser Arg Phe lie Lys Cys Val Thr Val Gly Asp Gly Ala 
1 5 10 15 



Val Gly Lys Thr Cys Met Leu lie Ser Tyr Thr Ser Asn Thr Phe Pro 
20 * 25 30 



Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val 
35 40 45 



Val Asp Gly Asn Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 
50 55 60 

Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 
65 70 75 80 

Phe lie Leu Ala Phe Ser Leu lie Ser Lys Ala Ser Tyr Glu Asn lie 
85 90 95 

Ala Lys Lys Trp lie Pro Glu Leu Arg His Tyr Ala Pro Gly Val Pro 
100 105 110 



lie He Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe 
115 120 125 

Phe He Asp His Pro Gly Ala Val Pro He Ser Thr Asn Gin Gly Glu 
130 135 140 



Glu Leu Lys Lys Leu He Gly Ser Pro Ala Tyr He Glu Cys Ser Ser 
145 150 155 160 

Lys Thr Gin Gin Asn Val Lys Ala Val Phe Asp Ala Ala He Lys Val 
165 170 175 . 

Val Leu Gin Pro Pro Lys Gin Lys Lys Lys Lys Lys Lys Asn Gly Cys 
180 185 190 



Val Phe Leu 
195 
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<210> 37 

<211> 594 

<212> DNA 

<213> Brassica napus 
<220> 

<221> CDS 

<222> (1) . . (594) 

<223> 



<400> 37 

atg age gca teg agg ttc ata aag tgt gtg acc gtt ggt gac gga get 48 
Met Ser Ala Ser Arg Phe lie Lys Cys Val Thr Val Gly Asp Gly Ala 
15 10 15 

gtg ggt aaa aca tgt etc etc att tct tac acc age aac act ttc cct 96 
Val Gly Lys Thr Cys Leu Leu He Ser Tyr Thr Ser Asn Thr Phe Pro 
20 " 25 30 

acg gat tat gtt ccg act gtt ttt gat aac ttc age get aat gtg gtt 144 
Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val 
35 40 45 

gtt aac gga gee act gtc aac ctt ggc ttg tgg gat acc get ggg cag 192 
Val Asn Gly Ala Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 
50 55 60 

gag gat tat aac agg tta aga cca ttg agt tac cgc ggt get gat gtt 240 
Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 
65 70 " 75 80 

ttc ate tta gee ttc tec etc ate agt aag get agt tat gag aat gtc 288 
Phe He Leu Ala Phe Ser Leu He Ser Lys Ala Ser Tyr Glu Asn Val 
85 90 95 

tec aag aag tgg ate cct gag ctg act cac tat gec cct ggt gtc cca 336 
Ser Lys Lys Trp He Pro Glu Leu Thr His Tyr Ala Pro Gly Val Pro 
100 105 HO 

att gtt ctt gtt ggt acc aaa eta gat ctt agg gat gac aaa cag ttc 384 
He Val Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe 
115 120 125 

ttc gtt gac cac cct ggt get gta cct att acc act get cag gga gag 432 
Phe Val Asp His Pro Gly Ala Val Pro He Thr Thr Ala Gin Gly Glu 
130 135 140 

gaa ctg atg aag eta att gga get cct teg tac ate gag tgc agt tea 480 
Glu Leu Met Lys Leu He Gly Ala Pro Ser Tyr He Glu Cys Ser Ser 



145 



150 155 160 



aaa tea cag gag aac gtg aag ggg gtg ttt gat gca gcg att aga gtg 528 
Lys Ser Gin Glu Asn Val Lys Gly Val Phe Asp Ala Ala He Arg Val 
165 170 175 

gta ctt caa cct cca aag cag aag aaa aag aag ggc aaa gta caa aag 576 



WO 2005/014828 PCT7EP2004/008136 

39/291 

Val Leu Gin Pro Pro Lys Gin Lys Lys Lys Lys Gly Lys Val Gin Lys 
180 185 " 190 

gcc tgc tec att ttg taa 594 
Ala Cys Ser lie Leu 
195 

<210> 38 

<211> 197 

<212> PRT 

<213> Brassica napus 



• <400> 38 

Met Ser Ala Ser Arg Phe lie Lys Cys Val Thr Val Gly Asp Gly Ala 
15 10 15 



Val Gly Lys Thr Cys Leu Leu lie Ser Tyr Thr Ser Asn Thr Phe Pro 
20 25 30 



Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val 
35 40 " 45 



Val Asn Gly Ala Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 
50 55 60 



Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 
65 70 75 80 



Phe He Leu Ala Phe Ser Leu He Ser Lys Ala Ser Tyr Glu Asn Val 
85 90 95 



Ser Lys Lys Trp He Pro Glu Leu Thr His Tyr Ala Pro Gly Val Pro 
100 105 110 



lie Val Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe 
115 120 125 



Phe Val Asp His Pro Gly Ala Val Pro He Thr Thr Ala Gin Gly Glu 
130 135 140 



Glu Leu Met Lys Leu He Gly Ala Pro Ser Tyr He Glu Cys Ser Ser 
145 150 155 160 



Lys Ser Gin Glu Asn Val Lys Gly Val Phe Asp Ala Ala He Arg Val 
165 170 175 



Val Leu Gin Pro Pro Lys Gin Lys Lys Lys Lys Gly Lys Val Gin Lys 
180 185 - 190 
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Ala Cys Ser lie Leu 
195 

<210> 39 

<211> 645 

<212> DNA 

<213> Oryza sativa 



<220> 

<221> CDS 

<222> (1) . . (645) 

<223> 



<400> 39 AQ 
atg age teg gcg gcg gcg gcg acg agg ttc ate aag tgc gtc ace gtg 4B 
Met Ser Ser Ala Ala Ala Ala Thr Arg Phe lie Lys Cys Val Thr Val 
x 5 10 15 

ggg gac ggc gcg gtg ggg aag acg tgc atg etc ate tgc tac acc tgc 96 
Gly Asp Gly Ala Val Gly Lys Thr Cys Met Leu lie Cys Tyr Thr Cys 
20 25 30 

aac aag ttc ccc acc gat tac ate ccc acc gtg ttc gac aac ttc age 144 
Asn Lys Phe Pro Thr Asp Tyr He Pro Thr Val Phe Asp Asn Phe Ser 
35 40 45 

gee aat gtc tec gtg gac ggg age gtc gtc aac etc ggc etc tgg gac 192 
Ala Asn Val Ser Val Asp Gly Ser Val Val Asn Leu Gly Leu Trp Asp 
50 55 60 

act gca ggt cag gag gat tac age agg ttg agg cct ctg age tac agg 240 
Thr Ala Gly Gin Glu Asp Tyr Ser Arg Leu Arg Pro Leu Ser Tyr Arg 
65 70 75 80 

gga gec gat gtg ttc ate ctg tec ttc tec ctg ata age agg gcg age 288 
Gly Ala Asp Val Phe He Leu Ser Phe Ser Leu He Ser Arg Ala Ser 
85 90 95 

tat gag aat gtt cag aag aag tgg atg cca gag ctt cgc egg ttt gcg 336 
Tyr Glu Asn Val Gin Lys Lys Trp Met Pro Glu Leu Arg Arg Phe Ala 
100 105 HO 

cct ggt gtt cct gta gtt ctt gtt gga acc aag ttg gat etc cgt gaa 384 
Pro Gly Val Pro Val Val Leu Val Gly Thr Lys Leu Asp Leu Arg Glu 
115 120 125 

gat agg gee tat ctt get gat cat cca get tct tec ata ata aca acg 432 
Asp Arg Ala Tyr Leu Ala Asp His Pro Ala Ser Ser He He Thr Thr 
130 135 140 



gag cag gga gaa gaa ctg agg aag eta ata gga gcg gtc gee tac ate 
Glu Gin Gly Glu Glu Leu Arg Lys Leu He Gly Ala Val Ala Tyr He 
145 150 ~ 155 160 

gaa tgc age tec aag aca cag aga aac att aaa get gtt ttc gac act 
Glu Cys Ser Ser Lys Thr Gin Arg Asn He Lys Ala Val Phe Asp Thr 
165 170 175 



480 



528 



WO 2005/014828 
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gcc ate aaa gtg gtg ctt caa cct cca aga cat aag gat gta acc aga 576 
Ala lie Lys Val Val Leu Gin Pro Pro Arg His Lys Asp Val Thr Arg 
180 185 190 

aag aaa etc caa tea age tec aat egg cca gta agg egg tac ttt tgc 624 
Lys Lys Leu Gin Ser Ser Ser Asn Arg Pro Val Arg Arg Tyr Phe Cys 
195 200 205 

gga age get tgt ttc gcg tag 645 
Gly Ser Ala Cys Phe Ala 
210 

<210> 40 

<211> 214 

<212> PRT 

<213> Oryza sativa 



<400> 40 

Met Ser Ser Ala Ala Ala Ala Thr Arg Phe lie Lys Cys Val Thr Val 
15 10 15 



Gly Asp Gly Ala Val Gly Lys Thr Cys Met Leu He Cys Tyr Thr Cys 
20 25 30 



Asn Lys Phe Pro Thr Asp Tyr He Pro Thr Val Phe Asp Asn Phe Ser 
35 40 45 



Ala Asn Val Ser Val Asp Gly Ser Val Val Asn Leu Gly Leu Trp Asp 
50 55 60 



Thr Ala Gly Gin Glu Asp Tyr Ser Arg Leu Arg Pro Leu Ser Tyr Arg 
65 70 75 80 



Gly Ala Asp Val Phe He Leu Ser Phe Ser Leu He Ser Arg Ala Ser 
85 90 95 



Tyr Glu Asn Val Gin Lys Lys Trp Met Pro Glu Leu Arg Arg Phe Ala 
100 " 105 . 110 



Pro Gly Val Pro Val Val Leu Val Gly Thr Lys Leu Asp Leu Arg Glu 
115 120 125 



Asp Arg Ala Tyr Leu Ala Asp His Pro Ala Ser Ser He He Thr Thr 
130 135 140 



Glu Gin Gly Glu Glu Leu Arg Lys Leu He Gly Ala Val Ala Tyr He 
145 ^ 150 155 160 



Glu Cys Ser Ser Lys Thr Gin Arg Asn He Lys Ala Val Phe Asp Thr 
165 170 175 



WO 2005/014828 



42/291 



PCT/EP2004/008136 



Ala He Lys Val Val Leu Gin Pro Pro Arg His Lys Asp Val Thr Arg 
180 185 190 

Lys Lys Leu Gin Ser Ser Ser Asn Arg Pro Val .Arg Arg Tyr Phe Cys 
195 200 205 



Gly Ser Ala Cys Phe Ala 
210 



<210> 41 

<211> 645 

<212> DNA 

<213> Oryza sativa 

<220> 

<221> CDS 

<222> (1)..(645) 

<223> 



<400> 41 

atg age ggc gec acc aag ttc ate aag tgc gtc acc gtc ggc gac ggc 
Met Ser Gly Ala Thr Lys Phe He Lys Cys Val Thr. Val Gly Asp Gly 
15 10 15 

gec gtc ggc aag acc tgc atg etc ate tgc tac ate age aac aag ttc 
Ala Val Gly Lys Thr Cys Met Leu He Cys Tyr He Ser Asn Lys Phe 
20 25 30 

ccc acc gat tac ate ccc acc gtg ttc gac aac ttc agt get aat gtt 144 
Pro Thr Asp Tyr He Pro Thr Val Phe Asp Asn Phe "Ser Ala Asn Val 
35 40 45 

tea gtg gat ggg aac ate gtc aac ttg gga tta tgg gac act get gga 192 
Ser Val Asp Gly Asn He Val Asn Leu Gly Leu Trp Asp Thr Ala Gly 
50 ** * 55 60 

caa gag gat tac age agg ctg agg cca ctg age tac agg gga gcg gat 240 
Gin Glu Asp Tyr Ser Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp 
65 70 75 80 

ata ttt gtg ctg gca ttc tea ctg ate age aga gca age tat gag aat 
He Phe Val Leu Ala Phe Ser Leu He Ser Arg Ala Ser Tyr Glu Asn 
85 90 95 

gtt etc aag aag tgg atg ccg gag ctt cgc egg ttc gca ccg aat gtt 
Val Leu Lys Lys Trp Met Pro Glu Leu Arg Arg Phe Ala Pro Asn Val 
100 105 HO 

cca att gtt ctt gtt ggg acc aag tta gat eta cgt gac cac aga tct 384 
Pro He Val Leu Val Gly Thr Lys Leu Asp Leu Arg Asp His Arg Ser 
115 120 125 

tac ctt gcg gac cat cct get get tec gca att acg act gca cag ggt 432 



'48 



96 



288 



336 



WO 2005/014828 PCT/EP2004/008136 

43/291 

Tyr Leu Ala Asp His Pro Ala Ala Ser Ala lie Thr Thr Ala Gin Gly 

130 135 140 

gaa gaa ctt aga aag cag ata ggc gcc gcg get tac ate gaa tgc agt 480 
Glu Glu Leu Arg Lys Gin lie Gly Ala Ala Ala Tyr lie Glu Cys Ser 
145 150 155 160 

teg aaa aca caa cag aac ate aag gcc gtg ttt gat act gcc ate aag 528 
Ser Lys Thr Gin Gin Asn lie Lys Ala Val Phe Asp Thr Ala lie Lys 
165 170 ' 175 

gtg gtc ctt cag cct cct egg aga agg ggg gag acg acg atg gca agg 576 
Val Val Leu Gin Pro Pro Arg Arg Arg Gly Glu Thr Thr Met Ala Arg 
180 185 190 

aag aag aca agg cga age ace ggc tgc teg tta aag aac ttg atg tgt 624 
Lys Lys Thr Arg Arg Ser Thr Gly Cys Ser Leu Lys Asn Leu Met Cys 
195 200 205 . 

ggc agt gca tgt gtt gtt tag 645 
Gly Ser Ala Cys Val Val 
210 

<210> 42 

<211> 214 

<212> PRT 

<213> Oryza sativa 



<400> 42 

Met Ser Gly Ala Thr Lys Phe lie Lys Cys Val Thr Val Gly Asp Gly 
1 5 10 1*5 



Ala Val Gly Lys Thr Cys Met Leu lie Cys Tyr lie Ser Ash Lys Phe 
20 25 30 



Pro Thr Asp Tyr lie Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val 
35 40 45 



Ser Val Asp Gly Asn He Val Asn Leu Gly Leu Trp Asp Thr Ala Gly 
50 55 . 60 



Gin Glu Asp Tyr Ser Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp 
65 70 75 80 



He Phe Val Leu Ala Phe Ser Leu He Ser Arg Ala Ser Tyr Glu Asn 
85 90 95 

Val Leu Lys Lys Trp Met Pro Glu Leu Arg Arg Phe Ala Pro Asn Val 
100 ~ 105 110 



Pro lie Val Leu Val Gly Thr Lys Leu Asp Leu Arg Asp His Arg Ser 
115 120 125 
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Tyr Leu Ala Asp His Pro Ala Ala Ser Ala He Thr Thr Ala Gin Gly 
130 135 140 

Glu Glu Leu Arg Lys Gin He Gly Ala Ala Ala Tyr He Glu Cys Ser 
145 150 155 160 

Ser Lys Thr Gin Gin Asn He Lys Ala Val Phe Asp Thr Ala He Lys 
165 170 175 

Val Val Leu Gin Pro Pro Arg Arg Arg Gly Glu Thr Thr Met Ala Arg 
180 185 190 

Lys Lys Thr Arg Arg Ser Thr Gly Cys Ser Leu Lys Asn Leu Met Cys 
- 195 - 200 205 

Gly Ser Ala Cys Val Val 
210 

<210> 43 

<211> 639 

<212> DNA 

<213> Oryza sativa 



<220> 

<221> CDS 

<222> (1) . . (639) 

<223> 



<400> 43 

atg ggc teg aag ccg ccg ccg ccg ccg cag ccc age gtc tec ttc aag 
Met Gly Ser Lys Pro Pro Pro Pro Pro Gin Pro Ser Val Ser Phe Lys 
15 10 15 



etc tct ate tgg gat aca get gga caa gag aag ttc cat gca eta ggc 
•Leu Ser He Trp Asp Thr Ala Gly Gin Glu Lys Phe His Ala Leu Gly 
65 ~ 70 75 80 

cct ata tac tat cgt gat gca gac get get ctt tta gta tat gac ate 
Pro He Tyr Tyr Arg Asp Ala Asp Ala Ala Leu Leu Val Tyr Asp He 
85 " 90 95 



48 



etc gtc etc etc ggc gac ggg aga gta ggc aaa aca tct ctt gtg ctg 96 
Leu Val Leu Leu Gly Asp Gly Arg Val Gly Lys Thr Ser Leu Val Leu 
20 * 25 30 

egg tat gtg aat gat gtc ttc tea gac aaa cag gaa gca act gtt caa 144 
Arg Tyr Val Asn Asp Val Phe Ser Asp Lys Gin Glu Ala Thr Val Gin 
35 ~ 40 45 

get tea tat ttg aca aag cgc ctt gtt gtt gaa ggt gtg cct att acg 192 
Ala Ser Tyr Leu Thr Lys Arg Leu Val Val Glu Gly Val Pro He Thr 
50 55 60 



240 



288 
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aca gac aat gat act ttt ctt cgt gtc aca aag tgg gtg aaa gag ctt 336 
Thr Asp Asn Asp Thr Phe Leu Arg Val Thr Lys Trp Val Lys Glu lieu 
100 105 110 

aag caa atg gca aat aaa gat att gtt atg gcc att gca gca aat aaa 384 
Lys Gin Met Ala Asn Lys Asp He Val Met Ala He Ala Ala Asn Lys 
115 120 125 

agt gac ctg gtt aga tea aaa cac ata gac act aat gaa gca gca age 432 
Ser Asp Leu Val Arg Ser Lys His He Asp Thr Asn Glu Ala Ala Ser 
130 135 140 

tat gca gaa agt att ggg gca act etc ttt gtt act tct gcc aaa get 480 
Tyr Ala Glu Ser He Gly Ala Thr Leu Phe Val Thr Ser Ala Lys Ala 
145 150 155 160 

ggt act ggg att gat gat -ate ttc agt gac ata gcc aag aga tta tta 528 
Gly Thr Gly lie Asp Asp He Phe Ser Asp He Ala Lys Arg Leu Leu 
165 170 175 

gag aga aga aaa aac age tec gat ggc ttg tea ctg get cat cca aag 576 
Glu Arg Arg Lys Asn Ser Ser Asp Gly Leu Ser Leu Ala His Pro Lys 
180 185 190 

aaa ggc att tta ate gtc gac gat gaa cct gag aaa gaa cct cca cca 624 
Lys Gly He Leu He Val Asp Asp Glu Pro Gin Lys Glu Pro Pro Pro 
195 200 205 

aag tgc tgc tea tag 639 
Lys Cys Cys Ser 
210 

<210> 44 

<211> 212 

<212> PRT 

<213> Oryza sativa 



<400> 44 

Met Gly Ser Lys Pro Pro Pro Pro Pro Gin Pro Ser Val Ser Phe Lys 
15 10 15 

Leu Val Leu Leu Gly Asp Gly Arg Val Gly Lys Thr Ser Leu Val Leu 
20 25 30 

Arg Tyr Val Asn Asp Val Phe Ser Asp Lys Gin Glu Ala Thr Val Gin 
35 40 45 

Ala Ser Tyr Leu Thr Lys Arg Leu Val Val Glu Gly Val Pro He Thr 
50 55 60 

Leu Ser He Trp Asp Thr Ala Gly Gin Glu Lys Phe His Ala Leu Gly 
65 70 75 80 



Pro He Tyr Tyr Arg Asp Ala Asp Ala Ala Leu Leu Val Tyr Asp He 
85 90 95 
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Thr Asp Asn Asp Thr Phe Leu Arg Val Thr Lys Trp Val Lys Glu Leu 
100 105 HO 

Lvs Gin Met Ala Asn Lys Asp lie Val Met Ala He Ala Ala Asn Lys 
Y H5 120 125 

Ser Asp Leu Val Arg Ser Lys His He Asp Thr Asn Glu Ala Ala Ser 
130 135 140 

Tyr Ala Glu Ser He Gly Ala Thr Leu Phe Val Thr Ser Ala Lys Ala 
145 150 155 160 

Gly Thr Gly He Asp Asp He Phe Ser Asp He Ala Lys Arg Leu Leu 
165 170 175 

Glu Arg Arg Lys Asn Ser Ser Asp Gly Leu Ser Leu Ala His Pro Lys 
180 185 190 

Lvs Gly He Leu He Val Asp Asp Glu Pro Glu Lys Glu Pro Pro Pro 
195 200 205 



Lys Cys Cys Ser 
210 



<210> 45 

<211> 806 

<212> DNA 

<213> Brassica napus 

<220> 

<221> CDS 

<222> (6).. (596) 

<223> 



ggaaa atg age gca tct egg ttc ata aag tgc gtg acg gtc ggt gac gga 50 
Met Ser Ala Ser Arg Phe He Lys Cys Val Thr Val Gly Asp Gly 
15 10 15 

gca gtg ggc aaa aca tgt etc etc ate tct tac ace age aac act ttc 98 
Ala Val Gly Lys Thr Cys Leu Leu He Ser Tyr Thr Ser Asn Thr Phe 
20 A 25 30 

cct acg gat tat gtg ccc act gtt ttc gat aac ttt age get aac gtt 146 
Pro Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val 
35 40 45 



gtt gtt aac gga gee act gtc aac tta gga etc tgg gat aca gca ggg 



194 
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Val Val Asn Gly Ala Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly 
50 55 60 

cag gag gat tat aac aga ttg aga ccc ttg agt tac cgc ggt get gac 242 
Gin Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp 
65 70 75 

gtt ttc ate tta gec ttc tct ctt ate agt aag get agt tat gag aat 290 
Val Phe lie Leu Ala Phe Ser Leu lie Ser Lys Ala Ser Tyr Glu Asn 
80 85 90 95 

gtc tec .aag aag tgg ate cct gag ctg ace cac tat gee cct ggt gtc 338 
Val Ser Lys Lys Trp lie Pro Glu Leu Thr His Tyr Ala Pro Gly Val 
100 105 110 

cct att gtt ctt gtt gga ace aaa eta gat ctt agg gat gac aaa cag 38 6 

Pro lie Val Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin 
115 ~ 120 " 125 



ttc ttc gtt gac cac cct ggt get gta cct att acc act get cag ggc 434 
Phe Phe Val Asp His Pro Gly Ala Val Pro lie Thr Thr Ala Gin Gly 
130 135 140 

gag gaa eta atg aag eta att gga get cct teg tac ate gag tgc age 482 
Glu Glu Leu Met Lys Leu lie Gly Ala Pro Ser Tyr lie Glu Cys Ser 
145 150 155 

tea aaa tea cag gag aac gtg aag ggg gtg ttt gat gca gcg ate aga 53 0 

Ser Lys Ser Gin Glu Asn Val Lys Gly Val Phe Asp Ala Ala lie Arg 
160 165 170 175 

gtg gta ctt caa cct cca aag cag aag aaa aag aag age aaa get caa 578 
Val Val Leu Gin Pro Pro Lys Gin Lys Lys Lys Lys Ser Lys Ala Gin 
180 .185 190 

aag gee tgc tec att ttg tgatttctct acgctcatct ctcttccact 626 
Lys Ala Cys Ser lie Leu 
195 

ctctagtgaa ggcttaagaa gaagaaacac tttagccttt aagatttggt tcagagttcg 686 

ttgtgataag cctcgcttaa tccttagaaa cgattacttc tggttttact gataaagagc 746 

ttccagcttt atagctgaga gctaatattg cactaccact ataaaaaaaa aaaaaaaaaa 806 

<210> 46 

<211> 197 

<212> PRT 

<213> Brassica napus 



<400> 46 

Met Ser Ala Ser Arg Phe He Lys Cys Val Thr Val Gly Asp Gly Ala 
1 5 10 15 



Val Gly Lys Thr Cys Leu Leu He Ser Tyr Thr Ser Asn Thr Phe Pro 
20 25 8 30 



Thr Asp Tyr 
35 



Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val 
40 45 



WO 2005/014828 



48/291 



PCT/EP2004/008136 



val Asn Gly Ala Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 
50 55 60 

Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr -Arg Gly Ala Asp Val 
65 70 75 80 

Phe He Leu Ala Phe Ser Leu He Ser Lys Ala Ser Tyr Glu Asn Val 
85 9° 95 

Ser Lys Lys Trp He Pro Glu Leu Thr His Tyr Ala Pro Gly Val Pro 
100 105 HO 

He Val Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe 
115 " 120 - 125 

Phe Val Asp His Pro Gly Ala Val Pro He Thr Thr Ala Gin Gly Glu 
130 135 140 

Glu Leu Met Lys Leu He Gly Ala Pro Ser Tyr He Glu Cys Ser Ser 
145 150 155 160 

Lys Ser Gin Glu Asn Val Lys Gly Val Phe Asp Ala Ala He Arg val 
165 I 70 175 

Val Leu Gin Pro Pro Lys Gin Lys Lys Lys Lys Ser Lys Ala Gin Lys 
180 185 190 

Ala Cys Ser He Leu 
195 

<210> 47 

<211> 32 

<212> PRT 

<213> consensus 

<220> 

<221> MI S COFEATURE 
<222> (1)..(32) 

<223> Xaa in position 15 is Leu, He or Met; Xaa in position 16 is Leu 
or val; Xaa in position 17 is He, Leu or Tyr; Xaa in position 18 

is Ser, Cys, Val or Arg; Xaa in position 20 is Thr, He or Val; 
Xaa in position 23 is Thr, Lys, Gin or Val 



<400> 47 

Lys Cys Val Thr Val Gly Asp Gly Ala Val Gly Lys Thr Cys Xaa Xaa 
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1 5 10 15 

Xaa Xaa Tyr Xaa Ser Asn Xaa Phe Pro Thr Asp Tyr Val Pro Thr Val 
20 25 30 

<210> 48 
<211> 32 
<212> PRT 
<213> consensus 



<220> 

<221> MIS C_FEATURE 

<222> (1)..(32) 

<223> Xaa any natural amino acid 

<400> 48 

Lys Xaa Val Xaa Xaa Gly Asp Xaa Xaa Xaa Gly Lys Xaa Xaa Xaa Xaa 

1 5 10 15 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Xaa Xaa Xaa Xaa Xaa Xaa Thr Val 
20 25 30 



<210> 49 

<211> 16 " 

<212> PRT 

<213> consensus 

<220> 

<221> MIS COFEATURE 

<222> (1)..(16) 
<223> 

<400> 49 

Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr Asn Arg Leu Arg Pro Leu 

1 5 10 15 

<210> 50 



<211> 16 



WO 2005/014828 
<212> PRT 
<213> consensus 
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<220> 

<221> MISC_FEATURE 
<222> (1).. (16) 

<223> Xaa in position 1 is Leu or He; Xaa in position 9 is Asp, Lys 
Glu; Xaa in position 10 is Tyr or Phe; Xaa in position 11 is A 
, ser, Ala, His or Glu; Xaa in position 12 is Arg or Ala; Xaa i 
position 14 is Arg, Ala or Gly; Xaa in position 16 is Leu, He 
Phe 



<400> 50 

Xaa Trp Asp Thr Ala Gly Gin Glu Xaa Xaa Xaa Xaa Leu Xaa Pro Xaa 

<210> 51 

<211> 7 

<212> PRT 

<213> consensus 



<400> 51 

Gly Thr Lys Leu Asp Leu Arg 
1 5 

<210> 52 

<211> 7 

<212> PRT 

<213> consensus 



<220> 

< 2 2 1 > MI SCJFEATURE 
<222> (1)..(7) 



<223> Xaa in postion 1 is Gly or Ala, Xaa in postion 2 is Thr Asn or L 
eu, Xaa in postion 4 is Leu, Ala, Ser or Lys, Xaa in postion 7 is 



Arg, His or Val 



<400> 52 

Xaa Xaa Lys Xaa Asp Leu Xaa 
1 * 5 
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<210> 53 

<211> 24 

<212> DNA 

<213> primer 



<400> 53 

atgtctgaaa aggccgttag aagg 24 

<210> 54 

<211> 26 

<212> DNA 

<213> primer 



<400> 54 

ttataaaatt atgcaacagt tagccc 26 

<210> 55 
<211> 579 
<212> DNA 

<213> Saccharomyces cerevisiae 

<220> 

<221> CDS 

<222> (1)..(579) 

<400> 55 

atg tct gaa aag gcc gtt aga agg aaa ctt gtt att att ggt gat ggt 48 

Met Ser Glu Lys Ala Val Arg Arg Lys Leu Val lie lie Gly Asp Gly 

1 5 10 15 

get tgt ggc aag acc tct tta eta tat gta ttt aca tta gga aaa ttc 96 

Ala Cys Gly Lys Thr Ser Leu Leu Tyr Val Phe Thr Leu Gly Lys Phe 

20 25 30 

cct gaa caa tat cat ccg aca gtg ttc gag aat tat gtc act gat.tgc 144 
Pro Glu Gin Tyr His Pro Thr Val Phe Glu Asn Tyr Val Thr Asp Cys 

35 40 45 

aga gtt gac gga ata aaa gtg tec tta acg etc tgg gat aca gcg gga 192 
Arg Val Asp Gly lie Lys Val Ser Leu Thr Leu Trp Asp Thr Ala Gly 

50 * 55 60 

caa gag gaa tat gaa cgt tta cgt cca ttc tea tat tea aaa gca gat 240 
Gin Glu Glu Tyr Glu Arg Leu Arg Pro Phe Ser Tyr Ser Lys Ala Asp 
65 70 75 80 

ata ata tta att ggg ttt get gta gac aat ttt gaa tea eta att aac 288 
He He Leu He Gly Phe Ala Val Asp Asn Phe Glu Ser Leu He Asn 

85 90 95 

gca agg acg aaa tgg gcg gat gag gca tta cga tat tgt cct gac gca 336 
Ala Arg Thr Lys Trp Ala Asp Glu Ala Leu Arg Tyr Cys Pro Asp Ala 

100 105 HO 

cca ate gtt ctt gta ggc ttg aaa aaa gat ttg agg caa. gaa gcc cat 384 
Pro He Val Leu Val Gly Leu Lys Lys Asp Leu Arg Gin Glu Ala His 

115 * 120 125 

ttt aaa gag aat get acc gat gaa atg gtt ccc att gaa gat gca aaa 432 
Phe Lys Glu Asn Ala Thr Asp Glu Met Val Pro lie Glu Asp Ala Lys 

130 135 140 

caa gtt gca agg gcc att ggg gcc aag aaa tac atg gaa tgt agt gca 480 



WO 2005/014828 

Gin Val Ala Arg Ala He 
145 150 
ctg act ggt gag ggt gtg 
Leu Thr Gly Glu Gly Val 
165 

agt ttg ctt atg aag aag 
Ser Leu Leu Met Lys Lys 
180 

taa 



52/291 

Gly Ala Lys Lys Tyr Met 
155 

gat gat gtc ttt gaa gta 
Asp Asp Val Phe Glu Val 
170 

gaa cca ggg get aac tgt 
Glu Pro Gly Ala Asn Cys 
185 



PCT/EP2004/008136 

Glu Cys Ser Ala 
160 

get aca aga acc 528 
Ala Thr Arg Thr 
175 

tgc ata att tta 576 
Cys He He Leu 
190 

579 



<210> 56 
<211> 192 
<212> PRT 

<213> Saccharomyces cerevisiae 

Met°Ser 6 Qlu Lys Ala Val Arg Arg Lys Leu Val He He Gly Asp Gly 

1 5 10 I 5 

Ala Cys Gly Lys Thr Ser Leu Leu Tyr Val Phe Thr Leu Gly Lys Phe 

20 25 30 

Pro Glu Gin Tyr His Pro Thr Val Phe Glu Asn Tyr Val Thr Asp Cys 

35 40 45 

Arg Val Asp Gly He Lys Val Ser Leu Thr Leu Trp Asp Thr Ala Gly 

50 ~ 55 60 

Gin Glu Glu Tyr Glu Arg Leu Arg Pro Phe Ser Tyr Ser Lys Ala Asp 
65 70 75 80 

He He Leu He Gly Phe Ala Val Asp Asn Phe Glu Ser Leu He Asn 

85 90 * 5 *i 

Ala Arg Thr Lys Trp Ala Asp Glu Ala Leu Arg Tyr Cys Pro Asp Ala 

100 105 HO 

Pro He Val Leu Val Gly Leu Lys Lys Asp Leu Arg Gin Glu Ala His 

115 120 125 

Phe Lys Glu Asn Ala Thr Asp Glu Met Val Pro He Glu Asp Ala Lys 

130 135 140 

Gin Val Ala Arg Ala He Gly Ala Lys Lys Tyr Met Glu Cys Ser Ala 

145 150 155 160 

Leu Thr Gly Glu Gly Val Asp Asp Val Phe Glu Val Ala Thr Arg Thr 

165 170 175 

Ser Leu Leu Met Lys Lys Glu Pro Gly Ala Asn Cys Cys lie He Leu 

180 185 190 

<210> 57 
<211> 803 
<212> DNA 

<213> Nicotiana tabacum 

<220> 

<221> CDS 

<222> (49) . . (642) 



57 



aaaaaaalaa aaaaaaaaaa aaaaaaaaaa aaaagaaaaa agaaaaaa atg agt get 

Met Ser Ala 

i 

tea agg ttt ate aag tgt gtt acg gtt ggt gat ggt get gtt ggg aaa 105 
Ser Arg Phe He Lys Cys Val Thr Val Gly Asp Gly Ala Val Gly Lys 

5 " 10 IS 

act tgt ctt ttg att tea tac acc age aat act ttc cct acg gat tat 
Thr Cys Leu Leu He Ser Tyr Thr Ser Asn Thr Phe Pro Thr Asp Tyr 
20 25 30 35 

crta ccc act gtg ttt gac aat ttc agt gca aat gtt gtt gtc aat ggg 
Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val Val Asn Gly 

40 45 50 

age act gtc aac eta ggg ttg tgg gat act get gga cag gag gat tac 
Ser Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr 
55 60 65 



153 
201 
249 
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aat 


&gg 


tta 


aga 


cct 


ctg 


agt 


tac 


cgt 


gga 


gec 


gat 


gtt 


ttc 


att 


ttg 


297 


Asn 


Arg 


Leu 


Arg 


Pro 


Leu 


Ser 


Tyr 


Arg 


Gly 


Ala 


Asp 


Val 


Phe 


lie 


Leu 








70 










75 










80 










gca 


ttc 


tct 


etc 


att 


agt 


aaa 


gec 


age 


tat 


gag 


aat 


gtt 


tec 


aag 


aag 


345 


Ala 


Phe 


Ser 


Leu 


lie 


Ser 


Lys 


Ala 


Ser 


Tyr 


Glu 


Asn 


Val 


Ser 


Lys 


Lys 






85 










90 










95 












taa 

**3 3 


att 


cct 


gag 


ttg 


aag 


cac 


tat 


get 


cct 


ggt 


gtc 


cca 


at a 


att 


ctt 


393 


Trp 


lie 


Pro 


Glu 


Leu 


Lvs 


His 


Tyr 


Ala 


Pro 


Gly 


val 


Pro 


lie 


Val 


Leu 




100 










105 










110 










115 




att 

23 w w 


aaa 


aca 


aaa 


ctt 


aat 


ctt 


caa 


aat 


gat 


aaa 


caa 


ttc 


ttc 


ata 


aac 


441 


Val 


Gly 


Thr 


Lys 


Leu 


Asp 


Leu 


Ara 


Asp 


Asp 


Lys 


Gin 


Phe 


Phe 


lie 


Asp 












120 










125 










130 






cat 


ccc 


aat 

3 3 


act 

^ -w H. 


gta 

3 w 3 


cca 


att 


act 


act 


get 


cag 


ggc 

3 3** 


aaa 


aaa 

3**3 


eta 


aaa 

3 Zf 


489 


His 


Pro 


Glv 


Ala 


Val 


Pro 


lie 


Thr 


Thr 


Ala 


Gin 


Gly 


Glu 


Glu 


Leu 


Ara 










135 










140 










145 








aaa 


act 


att 


aat 


aca 


cct 


act 


tac 


att 


gaa 


tgt 


aat 


tea 


aaa 


aca 


caa 


537 




Thr 


lie 


Gly 


Ala 


Pro 


Ala 


Tyr 


lie 


Glu 


Cys 


Ser 


Ser 


Lvs 


Thr 


Gin 








150 










155 










160 










cag 


aat 


gtg 


aaa 


gca 


gtc 


ttt 


gat 


get 


gec 


att 


aag 


gtc 


gtc 


etc 


cag ■ 


585 


Gin 


Asn 


Val 


Lys 


Ala 


Val 


Phe 


Asp 


Ala 


Ala 


lie 


Lys 


Val 


Val 


Leu 


Gin 






165 










170 










175 












ccg 


cct 


aag 


acg 


aag 


aaa 


aag 


aag 


ggg 


aaa 


tct 


caa 


aaa 


tec 


tgc 


teg 


633 


Pro 


Pro 


Lys 


Thr 


Lys 


Lys 


Lys 


Lys 


Gly 


Lys 


Ser 


Gin 


Lys 


Ser 


Cys 


Ser 




180 










185 










190 










195 





ata ttg tgatcaaact gtaggtccaa egatagtget ggtcacaatt ctcgttgtaa 689 
lie Leu 

tccctcactc actcgtatcc ctttatcttc cttcgtctca gaggaagtac gaggtcatcg 749 

tttgtcttat atttgaaact' actaaccatc tcttggaaat atcaggtggc ttgg 803 



<210> 58 
<211> 197 
<212> PRT 

<213> Nicotiana tabacum 



<400> 58 



Met 


Ser 


Ala 


Ser Arg 


Phe 


He 


Lys 


1 








5 








Val 


Gly 


Lys 


Thr 
20 


Cys 


Leu 


Leu 


He 


Thr 


Asp 


Tyr 
35 


Val 


Pro 


Thr 


Val 


Phe 
40 


Val 


Asn 
50 


Gly 


Ser 


Thr 


Val 


Asn 
55 


Leu 


Glu 


Asp 


Tyr 


Asn Arg 


Leu 


Arg 


Pro 


65 










70 






Phe 


He 


Leu 


Ala 


Phe 
85 


Ser 


Leu 


He 


Ser 


Lys 


Lys 


Trp 
100 


He 


Pro 


Glu 


Leu 


He 


val 


Leu 


Val Gly 


Thr 


Lys 


Leu 






115 










120 


Phe 


He 
130 


Asp 


His 


Pro 


Gly 


Ala 
135 


Val 


Glu 


Leu 


Arg 


Lys 


Thr 


He 


Gly Ala 


145 










150 






Lys 


Thr 


Gin 


Gin 


Asn 
165 


Val 


Lys 


Ala 


Val 


Leu 


Gin 


Pro 


Pro 


Lys 


Thr Lys 








180 










Ser 


Cys 


Ser 
195 


He 


Leu 









Cys 


Val 


Thr 


Val 


Gly 


Asp 


Gly 


Ala 




10 










15 




Ser 


Tyr 


Thr 


Ser 


Asn 


Thr 


Phe 


Pro 


25 










30 






Asp 


Asn 


Phe 


Ser 


Ala 


Asn 


Val 


Val 










45 








Gly 


Leu 


Trp 


Asp 


Thr 


Ala 


Gly 


Gin 








60 










Leu 


Ser 


Tyr 


Arg 


Gly 


Ala 


Asp 


Val 






75 










80 


Ser 


Lys 


Ala 


Ser 


Tyr 


Glu 


Asn 


Val 




90 ' 










95 




Lys 


His 


Tyr 


Ala 


Pro 


Gly 


Val 


Pro 


105 










110 






Asp 


Leu 


Arg 


Asp 


Asp 


Lys 


Gin 


Phe 










125 








Pro 


He 


Thr 


Thr 


Ala 


Gin 


Gly 


Glu 








140 










Pro 


Ala 


Tyr 


He 


Glu 


Cys 


Ser 


Ser 






155 










160 


Val 


Phe 


Asp 


Ala 


Ala 


He 


Lys 


Val 




170 










175 




Lys 


Lys 


Lys 


Gly 


Lys 


Ser 


Glh 


Lys 



185 190 



196 



244 
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<210> 59 
<211> 993 
<212> DNA 

<213> Nicotiana tabacum 

<220> 

<221> CDS 

<222> (29) ... (625) 

<400> 59 

ggcacgaggg agatttgggt tttgtaga atg agt gcg tct agg ttc ata aag 52 

Met Ser Ala Ser Arg Phe lie Lys 
1 5 

tgt gtc acc gtt ggc gac gga get gtg ggt aaa act tgt ctt etc att 100 
Cys Val Thr Val Gly Asp Gly Ala Val Gly Lys Thr Cys Leu Leu He 

10 15 20 

tec tat acc age aac aca ttt ccc act gat tac gtc cca act gta ttc 148 
- Ser-Tyr Thr Ser-Asn Thr Phe Pro Thr Asp Tyr Val Pro Thr Val Phe 
25 30 35 40 

gac aat ttt agt gca aat gtg gtt gtc gat ggg age act gtc aat ctg 
Asp Asn Phe Ser Ala Asn Val Val Val Asp Gly Ser Thr Val Asn Leu 

45 50 55 

ggg ctg tgg gat act gca ggt cag gag gat tac aat aga tta aga ccg 
Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr Asn Arg Leu Arg Pro 

60 65 70 

ttg age tac egg ggg gca gat gta ttt ata ctg gca ttt tct etc att 292 
Leu Ser Tyr Arg Gly Ala Asp Val Phe He Leu Ala Phe Ser Leu He 

75 ~ 80 85 

age aaa gcg age tat gaa aat gtc tec aaa aag tgg att cct gaa ttg 340 
Ser Lys Ala Ser Tyr Glu Asn Val Ser Lys Lys Trp He Pro Glu Leu 

90 95 100 

agg cat tat get cct gga gtt cca att att ctt gtt gga aca aag eta 388 
Arg His Tyr Ala Pro Gly Val Pro He He Leu Val Gly Thr Lys- Leu 
105 110 H5 120 

gat etc cga gag gat aag caa ttc ttc ctg gac cat cca ggt get gtt 436 
Asp Leu Arg Glu' Asp Lys Gin Phe Phe Leu Asp His Pro Gly Ala Val 

125 130 135 

cca ctt acc aca get cag ggt gaa gag ctg aga aag teg att ggt get 484 
Pro Leu Thr Thr Ala Gin Gly Glu Glu Leu Arg Lys Ser He Gly Ala 

140 145 150 

tec get tac att gaa tgt agt gca aaa aca caa cag aat gtg aag get 532 
Ser Ala Tyr He Glu Cys Ser Ala Lys Thr Gin Gin Asn Val Lys Ala 

155 ^ 160 165 

gtt ttt gat get gee att aag gtg gtt eta caa cca ccc aaa caa aag 580 
Val Phe Asp Ala Ala He Lys Val Val Leu Gin Pro Pro Lys Gin Lys 

170 175 180 

aag aaa aag aag aga aag ggt caa aaa gec tgc tct ate ttg tgattgctga 632 
Lys Lys Lys Lys Arg Lys Gly Gin Lys Ala Cys Ser He Leu 
185 ~ 190 195 

aataatagac aagtgatgga gatgtagatt gttatcaatg tcttccaagt tcaaagaatg 692 
cagtgtaagg ttcaacgttg gtagtcctga ctgactatga taggaaagca tgaatctgee 752 
ttgtccgtaa cattggaggc caagatgtat atttgtgatc cgcatatggt tggggataca 812 
gatgtgcaaa attctctgtt tegegttgat tctgtgtaat atattatgta acacttgtgt 872 
gatgattcct tgaatttgea tctactatgt gttgttaaaa tgtaaccgaa catactgatt 932 
tagattgtcc aagaagttga atttgetatg tcataaaaaa aaaaaaaaaa aaaaaaaaaa 992 



a 



993 
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<210> 60 
<211> 198 
<212> PRT 

<213 > Nicotiana tabacum 
<400> 60 



Met 


Ser 


Ala 


Ser Arg 


Phe He 


Lys Cys Val Thr Val 


Gly Asp Gly 


Ala 


1 






5 




10 


15 




Val 


Gly 


Lys 


Thr Cys 


Leu Leu 


He Ser Tyr Thr Ser 


Asn Thr Phe 


Pro 






20 




25 


30 




Thr 


Asp 


Tyr 


Val Pro 


Thr Val 


Phe Asp Asn Phe Ser 


Ala Asn Val 


Val 






35 






40 


45 




Val 


Asp 


Gly 


Ser Thr 


Val Asn 


Leu Gly Leu Trp Asp 


Thr Ala Gly 


Gin 




50 




- - 


55 


60 


- .- 




Glu 


Asp 


Tyr 


Asn Arg 


Leu Arg 


Pro Leu Ser Tyr Arg 


Gly Ala Asp 


Val 


65 








70 


75 




80 


Phe 


He 


Leu 


Ala Phe 


Ser Leu 


He Ser Lys Ala Ser 


Tyr Glu Asn 


Val 








85 




90 


95 




Ser 


Lys 


Lys 


Trp He 


Pro Glu 


Leu Arg His Tyr Ala 


Pro Gly Val 


Pro 








100 




105 


110 




He 


He 


Leu 


Val Gly 


Thr Lys 


Leu Asp Leu Arg Glu 


Asp Lys Gin 


Phe 






115 






120 


125 




Phe 


Leu 


Asp 


His Pro 


Gly Ala 


Val Pro Leu Thr Thr 


Ala Gin Gly 


Glu 




130 




135 


140 






Glu 


Leu 


Arg 


Lys Ser 


He Gly 


Ala Ser Ala Tyr He 


Glu Cys Ser 


Ala 


145 








150 


155 




160 


Lys 


Thr 


Gin 


Gin Asn 


Val Lys 


Ala Val Phe Asp Ala 


Ala He Lys 


Val 






vl65 




170 


175 




Val 


Leu 


Gin 


Pro Pro 


Lys Gin 


Lys Lys Lys Lys Lys 


Arg Lys Gly 


Gin 








180 




185 


190 




Lys 


Ala 


Cys 


Ser He 


Leu 












195 













61 
640 
DNA 

Gossypium hirsutum 

<220> 

<221> CDS 

<222> (17) . . (613) 

<400> 61 

cacgcgtggc gtaagt atg agt get. tec agg ttc ata aaa tgc gtc acg gtc 52 

Met Ser Ala Ser Arg Phe He Lys Cys Val Thr Val 
1 5 10 



ggt 


gac 


ggt 


gec 


gtc 


ggc 


aag 


act 


tgt 


ctg 


etc 


att 


tec 


tac 


act 


age 


100 


Gly 


Asp 


Gly 


Ala 


Val 


Gly 


Lys 


Thr 


Cys 


Leu 


Leu 


He 


Ser 


Tyr 


Thr 


Ser 








15 










20 










25 










aat 


act 


ttc 


cct 


ace 


gat 


tat 


gtg 


cca 


act 


gtc 


ttt 


gac 


aac 


ttt 


agt 


148 


Asn 


Thr 


Phe 


Pro 


Thr 


Asp 


Tyr 


Val 


Pro 


Thr 


Val 


Phe 


Asp 


Asn 


Phe 


Ser 






30 










35 










40 












get 


aat 


gtg 


gtt 


gtg 


gat 


ggg 


aac 


act 


gtt 


aat 


ctg 


gga 


ttg 


tgg 


gat 


196 


Ala 


Asn 


Val 


Val 


Val 


Asp 


Gly 


Asn 


Thr 


Val 


Asn 


Leu 


Gly 


Leu 


Trp 


Asp 




45 










50 








55 










60 




act 


get 


gga 


cag 


gaa 


gat 


tac 


aat 


aga 


tta 


aga 


cca 


ttg 


age 


tat 


cgt 


244 


Thr 


Ala 


Gly 


Gin 


Glu 


Asp 


Tyr 


Asn 


Arg 


Leu 


Arg 


Pro 


Leu 


Ser 


Tyr 


Arg 












65 










70 










75 






gga 


gca 


gat 


gtc 


ttc 


ttg 


ctg 


gca 


ttt 


tct 


etc 


att 


age 


aaa 


get 


age 


292 


Gly 


Ala 


Asp 


val 


Phe 


Leu 


Leu 


Ala 


Phe 


Ser 


Leu 


He 


Ser 


Lys 


Ala 


Ser 










80 










85 










90 








tat 


gaa 


aat 


gtt 


get 


aag 


aaa 


tgg 


att 


cca 


gaa 


ttg 


aga 


cat 


tat 


gca 


340 


Tyr 


Glu 


Asn 


Val 


Ala 


Lys 


Lys 


Trp 


He 


Pro 


Glu 


Leu 


Arg 


His 


Tyr 


Ala 





95 100 105 
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ccc ggt gtt cca att att ctt gtt ggg act aag ctt gat ctt cga gaa 3 88 

Pro Gly Val Pro lie He Leu Val Gly Thr Lys Leu Asp Leu Arg Glu 

110 H5 120 

gat aag cag ttc ttc ata gat cac cct ggt gcg gtg ccc att acc aca 436 
Asp Lys Gin Phe Phe He Asp His Pro Gly Ala Val Pro He Thr Thr 
125 130 135 140 

gca cag ggt gag gaa ttg aga aag eta att gga gcg cat ttt tac att 484 
Ala Gin Gly Glu Glu Leu Arg Lys Leu He Gly Ala His Phe Tyr He 

145 150 155 

gag tgt agt tea aaa aca caa cag aat gtg" aaa gcg gtc ttt gat gcg 532 
Glu Cys Ser Ser Lys Thr Gin Gin Asn Val Lys Ala Val Phe Asp Ala 

160 165 170 

gec ate aaa gta gtt etc cag cct cca aag aag aag aag aaa aag aag 580 
Ala He Lys Val Val Leu Gin Pro Pro Lys Lys Lys Lys Lys Lys Lys 

175 180 185 

aga aag gca caa aaa get tgc tea ata ttg tgatcatgea aagaagtgat 630 
Arg Lys Ala Gin Lys Ala Cys Ser He Leu 

190 195 

attggtgcag 



640 



<210> 62 
<211> 198 
<212> PRT 

<213> Gossypium hirsutum 

<400> 62 m 

Met Ser Ala Ser Arg Phe He Lys Cys Val Thr Val Gly Asp Gly Ala 

1 5 10 15 

Val Gly Lys Thr Cys Leu Leu He Ser Tyr Thr Ser Asn Thr Phe Pro 

20 " 25 30. 

Thr -Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val 

35 40 45 

Val Asp Gly Asn Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 

50 55 60 

Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 
65 70 75 80 

Phe Leu Leu Ala Phe Ser Leu He Ser Lys Ala Ser Tyr Glu Asn Val 

85 90 95 

Ala Lys Lys Trp He Pro Glu Leu Arg His Tyr Ala Pro Gly Val Pro 

100 105 HO 

He He Leu Val Gly Thr Lys Leu Asp Leu Arg Glu Asp Lys Gin Phe 

115- 120 125 

Phe He Asp His Pro Gly Ala Val Pro He Thr Thr Ala Gin Gly Glu 

130 135 140 

Glu Leu Arg Lys Leu He Gly Ala His Phe Tyr He Glu Cys Ser Ser 
145 150 155 160 

Lys Thr Gin Gin Asn Val Lys Ala Val Phe Asp Ala Ala He Lys Val 

165 170 175 

Val Leu Gin Pro Pro Lys Lys Lys Lys Lys Lys Lys Arg Lys Ala Gin 

180 185 190 

Lys Ala Cys Ser He Leu 
195 

<210> 63 
<211> 1025 
<212> DNA 

<213> Nicotiana tabacum 

<220> 

<221> CDS 

<222> (103).. (696) 

<400> 63 

ttcttgtatc cgcccagctc ccccataccc atttcaatct ttttttttgt ttataatttt 



60 
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162 



210 



tttctgggaa tcaggtcttc tcaagaagat ctgagaagaa aa atg agt get teg 114 

Met Ser Ala Ser 
1 

agg ttt ata aaa tgt gtg acg gtg ggt gat ggt gec gtt gga aaa acg 
Arg Ptte He Lys Cys Val Thr Val Gly Asp Gly Ala Val Gly Lys Thr 
5 10 15 20 

tgt etc ttg att tct tac acc age aac ace ttt cct acg gat tat gtg 
Cys Leu Leu He Ser Tyr Thr Ser Asn Thr Phe Pro Thr Asp Tyr Val 

25 30 35 

cct act gtg ttc gat aat ttc agt gca aat gtg gtc gtc aat ggg agt 258 
Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val Val Asn Gly Ser 

40 45 50 

act gtc aac eta ggg ttg tgg gat aca get gga cag gag gat tac aat 306 
Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr Asn 

55 60 65 

agg tta aga cct ctg agt tac cgt gga get gat gtt ttc ate ttg gca 354 
Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val Phe lie Leu Ala 

70 75 80 

ttc tct etc att agt aaa gee age tat gaa aat gtt tec aag aag tgg 402 
Phe Ser Leu lie Ser Lys Ala Ser Tyr Glu Asn Val Ser Lys Lys Trp 
85 90 95 100 

att cct gag ttg aag cat tat get ccc ggt gtt cca ata gtg ctt gtt 450 
He Pro Glu Leu Lys His Tyr Ala Pro Gly Val Pro He Val Leu Val 

105 HO 115 

gga aca aaa ctt gat ctt egg gat gat aag cag ttc ttc ata gac cat 498 
Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe Phe He Asp His 

120 125 130 

cct ggt get gtg cca att act act get cag ggc gag gag ctg agg aaa 546 
Pro Gly Ala Val Pro He Thr Thr Ala Gin Gly Glu Glu Leu Arg Lys 
135 140 145 

■. acg att ggg gee cct get tac att gaa tgt agt tea aaa aca caa cag 594 
Thr He Gly Ala Pro Ala Tyr lie Glu Cys Ser Ser Lys Thr Gin Gin 

150 155 160 

aat gtg aaa gca gtc ttt gat get get att aag gtc gtg etc cag cca 642 
Asn Val Lys Ala Val Phe Asp Ala Ala He Lys Val Val Leu Gin Pro 
165 " 170 175 180 

ccc aag cag aaa aaa aag aag ggg aag gee caa aag gca tgt teg att 690 
Pro Lys Gin Lys Lys Lys Lys Gly Lys Ala Gin Lys Ala Cys Ser He 

185 190 195 

ttg tgatcaacag taggtccaac aatattactg gtcacagttc tccttgtaat 743 
Leu 

ctctccctca tattcctttg tttttccttt gtcteggaga aagtaggagg tccatgtatg 803 
tctctttaaa attacttgtg atctcttgaa ttgcaggtgg ctcggttttt atactgetgt 863 
catctctagt tgttgagtca cagcaccttg ttgtaggcca ctgattgggc cctctctggt 923 
ttcatttget ttatgattaa tgactgaaag atccagttgg aaaaaaaaaa aaaaaaaaaa 983 
aaaaaactcg gggggggccg ggcccaatcc cctatagggg tg 1025 



<210> 64 
<211> 197 
<212> PRT 

<213> Nicotiana tabacum 



<400> 64 

Met Ser Ala Ser Arg Phe He Lys Cys Val Thr Val Gly Asp Gly Ala 
1 5 10 15 
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Val Gly Lys Thr Cys Leu Leu lie Ser Tyr Thr Ser Asn Thr Phe Pro 

20 " 25 30 

Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val 

35 40 45 

Val Asn Gly Ser Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 

50 55 60 

Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 
65 70 75 80 

Phe lie Leu Ala Phe Ser Leu lie Ser Lys Ala Ser Tyr Glu Asn Val 

85 90 95 

Ser Lys Lys Trp He Pro Glu Leu Lys His Tyr Ala Pro Gly Val Pro 

100 105 HO 

He Val Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe 

115 " 120 125 

Phe He Asp His Pro Gly Ala Val Pro He Thr Thr Ala Gin Gly Glu 

130 135 140 

Glu Leu Arg Lys Thr He Gly Ala Pro Ala Tyr He Glu Cys Ser Ser 
145 • -• - 150 155 - 160 

Lys Thr Gin Gin Asn Val Lys Ala Val Phe Asp Ala Ala He Lys Val 

165 170 175 

Val Leu Gin Pro Pro Lys Gin Lys Lys Lys Lys Gly Lys Ala Gin Lys 

180 185 190 

Ala Cys Ser He Leu 
195 

<210>- 65 
<211> 1178 
<212> DNA 

<213> Medicago truncatula 

<220> 

<221> CDS 

<222> (136) . . (729) 



<400> 65 

cctatttcat cttcatcacc gattattctt gaacagttac caatttcaaa ctcaaacaca 



60 



caacaacatt gacacatgaa taattagggt ttatacctca aagaaacaca acaaacttga 12 0 

ataattaggt tcgaa atg agt ggt tec agg ttc ate aag tgt gtc aca gtt 171 
Met Ser Gly Ser Arg Phe He Lys Cys Val Thr Val 
1 5 10 

ggt gat ggt gec gtt gga aag act tgt ttg ctt ate tec tac acc age 219 
Gly Asp Gly Ala Val Gly Lys Thr Cys Leu Leu He Ser Tyr Thr Ser 

15 20 25 

aac act ttc cct acg gac tat gtg ccg act gtc ttt gac aat ttc agt 267 
Asn Thr Phe Pro Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser 

30 35 40 

gca aat gta gtt gtg gat ggg age act at a aat etc ggg ttg tgg gat 315 
Ala Asn Val Val Val Asp Gly Ser Thr He Asn Leu Gly Leu Trp Asp 
45 50 55 60 

act get ggc caa gaa gat tac aat aga tta aga ccc tta age tat cgt 363 
Thr Ala Gly Gin Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg 

65 70 75 

gga gca gat gtt ttc ctg ctt get ttt tct etc ata age aag get age 411 
Gly Ala Asp Val Phe Leu Leu Ala Phe Ser Leu He Ser Lys Ala Ser 

80 85 90 

tat gaa aat att gee aaa aaa tgg att cct gag ttg agg cat tat get 459 
Tyr Glu Asn He Ala Lys Lys Trp He Pro Glu Leu Arg His Tyr Ala 

95 100 105 

cct ggt gtt cca att att etc gtt gga aca aaa ctt gat ctt egg gat 507 
Pro Gly Val Pro He He Leu Val Gly Thr Lys Leu Asp Leu Arg Asp 

110 115 120 

gat age cag ttc ttt caa gac cat cct ggt gcg gcg cca ate acc aca 555 
Asp Ser Gin Phe Phe Gin Asp His Pro Gly Ala Ala Pro He Thr Thr 
125 130 135 140 



WO 2005/014828 PCT/EP2004/008136 

59/291 

gca cag ggt gag gaa ctg aga aaa ctt ate ggt get cca gtt tac att 603 
Ala Gin Gly Glu Glu Leu Arg Lys Leu lie Gly Ala Pro Val Tyr lie 

145 150 155 

gaa tgt agt .tcc aaa aca cag aag aat gtg aag get gtt ttt gat teg 651 
Glu Cys Ser Ser Lys Thr Gin Lys Asn Val Lys Ala Val Phe Asp Ser 

160 165 170 

gee ate aaa gta gtt eta cag cca cca aag caa aag aaa aca aag aga 699 
Ala lie Lys Val Val Leu Gin Pro Pro Lys Gin Lys Lys Thr Lys Arg 

175 180 185 

aag ggg caa aaa gee tgt tec att ttg tgatcttcag ttctttcgta 746 
Lys Gly Gin Lys Ala Cys Ser lie Leu 

190 ~ 195 

ttgtagtttg tggaggatga catgacagta gccattttcc tgggtaatcc tttgecatte 806 

tgattctata tatatcctta tactattttc gacctagatg gaggaacege catgtccaac 866 

tgaccgcgtt aaagcttget tataaaaaat caagctattt gtctctggct gaaggataat 926 

atgtgaagtc ataattccat catgtttggt gtgtacaatt gttgtgtttc teeggaatgt 986 

ttaccttttg tattgattat caaatggtca tctccatttg tggacatggg ttttactcac 1046 

attgtagtct aatcttagga agtgcttgca acatatgtgc tttggatgtg aactaagtat 1106 

atgtaattaa ttatgttagc tacttaacat ccttaatata gctcggtttg ttcaaaaaaa 1166 

aa 1178 



<210> 66 
<211> 197 
<212> PRT 

<213> Medicago truncatula 
<400> 66 

Met Ser Gly Ser Arg Phe lie Lys Cys Val Thr Val Gly Asp Gly Ala 

1 5 10 15 

Val Gly Lys Thr Cys Leu Leu lie Ser Tyr Thr Ser Asn Thr Phe Pro 

20 25 30 

Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val 

35 40 45 

Val Asp Gly Ser Thr lie Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 

50 55 60 

Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 
65 70 75 80 

Phe Leu Leu Ala Phe Ser Leu lie Ser Lys Ala Ser Tyr Glu Asn lie 

85 90 95 

Ala Lys Lys Trp lie Pro Glu Leu Arg His Tyr Ala Pro Gly Val Pro 

100 105 110 

lie lie Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Ser Gin Phe 

115 120 125 

Phe Gin Asp His Pro Gly Ala Ala Pro lie Thr Thr Ala Gin Gly Glu 

130 135 140 

Glu Leu Arg Lys Leu lie Gly Ala Pro Val Tyr lie Glu Cys Ser Ser 
145 150 155 160 

Lys Thr Gin Lys Asn Val Lys Ala Val' Phe Asp Ser Ala lie Lys Val 

165 170 175 

Val Leu Gin Pro Pro Lys Gin Lys Lys Thr Lys Arg Lys Gly Gin Lys 
180 185 190 



224 



272 



320 
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Ala Cys Ser He Leu 
195 

<210> 67 
<211> 1139 
<212> DNA 

<213> Medicago truncatula 

<220> 

<221> CDS 

<222> (174).. (767) 

<400> 67 , 

acgaacgaac gcttcaacgg caaaatttcc cgggttcctt gtctctcttt catgcgcgag 60 

aattctcatt ttctcacgct cctaatggat gcgtaattga agcgaataat tggaggagtg 120 

tgtgagtgat ttggagtaat aaggtgaaag atctggatag ggtttgaaag aag atg 176 

Met 
1 

age get tct agg ttc ate aag tgt gtt act gtt ggg gat gga get gtt 
Ser Ala Ser Arg Phe He Lys Cys val Thr Val Gly Asp Gly Ala Val 

5 10 15 

ggt aaa act tgt ttg tta att tea tac ace age aat acc ttc ccc act 
Gly Lys Thr Cys Leu Leu He Ser Tyr Thr Ser Asn Thr Phe Pro Thr 

20 25 30 

gac tat gtg cca act gtc ttc gac aat ttc agt gca aat gtg gtt gtg 
Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val Val 

35 40 45 

aat gga age act gtg aat ctg ggt ttg tgg gac act gca gga caa gag 368 
Asn Gly Ser Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin Glu 
50 - 55 60 65 

gat tat aac aga tta aga cct ttg agt tat cgt ggt gee gat gtt ttc 416 
Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val Phe 

70 75 80 

att etc get ttc tec etc ata age aag gec agt tat gaa aat gtt tec 464 
. He Leu Ala Phe Ser Leu He Ser Lys Ala Ser Tyr Glu Asn Val Ser 
85 90 95 

aaa aag tgg att cca gag ttg aag cat tat gca cct ggt gtt ccc att 512 
Lvs Lys Trp He Pro Glu Leu Lys His Tyr Ala Pro Gly Val Pro He 

100 105 HO 

att ctg gtt ggc aca aag ctt gac ctt egg gat gac aag cag ttc ttc 560 
He Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe Phe 

115 " 120 125 

ate gac cat cct ggt get gtt cct att acc act get cag gga gaa gag 
Val Asp His Pro Gly Ala Val Pro He Thr Thr Ala Gin Gly Glu Glu 
130 135 140 145 

ctt agg aag ctg ate aat gca cct get tat ate gaa tgc agt teg aaa 
Leu Arg Lys Leu He Asn Ala Pro Ala Tyr He Glu Cys Ser Ser Lys 

150 155 160 

tea cag cag aat gtg aaa gca gtc ttt gat gca gee ata aga gtt gtc 
Ser Gin Gin Asn Val Lys Ala Val Phe Asp Ala Ala He Arg Val Val 

165 170 175 

ctt caa cca cca aag caa aag aaa aag aag agt aaa gca cca aag get 
Leu Gin Pro Pro Lys Gin Lys Lys Lys Lys Ser Lys Ala Pro Lys Ala 

180 ~ 185 190 

tgt teg ata ttg tgatttgeta ggteggtcta aagaaatttg acaaccattc 804 
Cys Ser He Leu 
195 

tccaacatgt atttctctcc tcttcttcct attcttcatt ectattatge tgaagaagtt 8 64 

ggagtctatg tgggtgttct ttegtctaac agactctaga tctcgttgaa gcataaggtg 924 



608 



656 



704 



752 



gttggtgctt tgctttcttg gaattattgt gacttctaat tgtcaaattt gtatttcatg 
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Ala Cys Ser He Leu 
195 

<21p> 67 
<211> 1139 
<212> DNA 

<213> Medicago truncatula 

<220> 

<221> CDS 

<222> (174) . . (767) 

<400> 67 

acgaacgaac gcttcaacgg caaaatttcc cgggttcctt gtctctcttt catgcgcgag 60 



aattctcatt ttctcacgct cctaatggat gcgtaattga agcgaataat tggaggagtg 120 



tgtgagtgat ttggagtaat aaggtgaaag atctggatag ggtttgaaag aag atg 176 

Met 
1 



age 


get 


tct 


agg 


ttc 


ate 


aag 


rgi: 




ttCt 


gux 


ggg 


y cx l. 






crtt 

y i» i— 


224 


Ser 


Ala 


Ser 


Arg 
5 


pne 


xie 


Lys 


Cys 


vax 
i n 

1U 


Thr 


val 




Asp 


uiy 

JLD 


XU.CL 


Val 

V CLJ. 




ggt 


aaa 


act 


tgt 


ttg 




an c 


tea 


LaC 


acc 


age 


aat 






v — • ^— . V— 


act 


272 


Gly 


Lys 


Thr 


Cys 


Leu 


Leu 


xie 


C A V 

ser 


Tyr 


Thr 


oer 


Asn 


inr 


rue 


irlU 


XXIX. 




20 










25 




















gac 


tat 


gtg 


cca 


act 


guc 


tec 


gac 


aaL 


LLC 




y v— d 


-a -a f 


y ug 


erf- f 

y&t 


erf cr 

y u y 


320 


Asp 


Tyr 


Val 


Pro 


Thr 


vai 


Jtrne 


Asp 


Asn. 


XT lie 


Ser 


Ma 
AlCL 




V CLJL 


Val 

V CLl 


Val 

. V CL J. 




35 
































aat 


gga 


age 


act 


gtg 


aat 


ctg 


ggt 


ttg 


tgg 


gac 


act 


gca 


gga 


caa 


gag 


368 


Asn 


Gly 


Ser 


Thr 


Val 


Asn 


Leu 


Gly 


Leu 


Trp 


Asp 


Thr 


Ala 


Gly 


Gin 


Glu 




50 








55 










60 










65 




gat 


tat 


aac 


aga 


tta 


aga 


cct 


ttg 


agt 


tat 


cgt 


ggt 


gee 


gat 


gtt 


ttc 


416 


Asp 


Tyr 


Asn 


Arg 


Leu 


Arg 


Pro 


Leu 


Ser 


Tyr 


Arg 


Gly 


Ala 


Asp 


Val 


Phe 






70 










75 










80 






att 


etc 


get 


ttc 


tec 


etc 


ata 


age 


aag 


gee 


agt 


tat 


gaa 


aat 


gtt 


tec 


464 


lie 


Leu 


Ala 


Phe 
85 


Ser 


Leu 


He 


Ser 


Lys 
90 


Ala 


Ser 


Tyr 


Glu 


Asn 
95 


val 


Ser 




aaa 


aag 


tgg 


att 


cca 


gag 


ttg 


aag 


cat 


tat 


gca 


cct 


ggt 


gtt 


ccc 


att 


512 


Lys 


Lys 


Trp 
100 


He 


Pro 


Glu 


Leu 


Lys 
105 


His 


Tyr 


Ala 


Pro 


Gly 
110 


Val 


Pro 


He 




att 


ctg 


gtt 


ggc 


aca 


aag 


ctt 


gac 


ctt 


egg 


gat 


gac 


aag 


cag 


ttc 


ttc 


560 


He 


Leu 
115 


val 


Gly 


Thr 


Lys 


Leu 
120 


Asp 


Leu 


Arg 


Asp 


Asp 
125 


Lys 


Gin 


Phe 


Phe 




gtc 


gac 


cat 


cct 


ggt 


get 


gtt 


cct 


att 


acc 


act 


get 


cag 


gga 


gaa 


gag 


608 


Val 


Asp 


His 


Pro 


Gly 


Ala 


Val 


Pro 


He 


Thr 


Thr 


Ala 


Gin 


Gly 


Glu 


Glu 




130 






135 










140 










145 




ctt 


agg 


aag 


ctg 


ate 


aat 


gca 


cct 


get 


tat 


ate 


gaa 


tgc 


agt 


teg 


aaa 


656 


Leu 


Arg 


Lys 


Leu 


He 


Asn 


Ala 


Pro 


Ala 


Tyr 


He 


Glu 


Cys 


Ser 


Ser 


Lys 








150 










155 










160 






tea 


cag 


cag 


aat 


gtg 


aaa 


gca 


gtc 


ttt 


gat 


gca 


gee 


ata 


aga 


gtt 


gtc 


704 


Ser 


Gin 


Gin 


Asn 


Val 


Lys 


Ala 


Val 


Phe 


Asp 


Ala 


Ala 


He 


Arg 


val 


Val 










165 








170 










175 








ctt 


caa 


cca 


cca 


aag 


caa 


aag 


aaa 


aag 


aag 


agt 


aaa 


gca 


cca 


aag 


get 


752 


Leu 


Gin 


Pro 


Pro 


Lys 


Gin 


Lys 


Lys 


Lys 


Lys 


Ser 


Lys 


Ala 


Pro 


Lys 


Ala 








180 








185 










190 











tgt teg ata ttg tgatttgeta ggteggtcta aagaaatttg acaaccattc 804 
Cys Ser He Leu 



195 

tccaacatgt atttctctcc tcttcttcct attcttcatt ectattatge tgaagaagtt 864 



ggagtctatg tgggtgttct ttegtctaac agactctaga tctcgttgaa gcataaggtg 



gttggtgctt tgctttcttg gaattattgt gacttctaat tgtcaaattt gtatttcatg 984 
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<400> 69 

gttcaacgct aaaatcacac tctcatcttc caaatagatg tgtgatagta attgaaagag 



1139' 



gttcccacct ttttcttaac ataagtttta tgtattaata gactggaata tcccagttag 1044 
gattactacc ttgtttgctt gaagatatat ataatatcta cattattttg catttagtta 1104 
taatgcttgc tctattcaaa aaaaaaaaaa aaaaa 



<210> 68 
<211> 197 
<212> PRT 

<213> Medicago truncatula 
<400> 68 

Met Ser Ala Ser Arg Phe lie Lys Cys Val Thr Val Gly Asp Gly Ala 

15 10 15 

Val Gly Lys Thr Cys Leu Leu lie Ser Tyr Thr Ser Asn Thr Phe Pro 

20 25 30 

Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val 

35 40 45 

Val Asn Gly Ser Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 

50 55 60 

Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 
65 70 75 80 

Phe He Leu Ala Phe Ser Leu He Ser Lys Ala Ser Tyr Glu Asn Val 

85 90 95 

Ser Lys Lys Trp He Pro Glu Leu Lys His Tyr Ala Pro Gly Val .Pro 

100 105 HO 

He He Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe 

115 120 125 

Phe Val Asp His Pro Gly Ala Val Pro He Thr Thr Ala Gin Gly Glu 

130 * 135 140 

Glu Leu Arg Lys Leu He Asn Ala Pro Ala Tyr He Glu Cys Ser Ser 
145 ~ " 150 155 160 

Lys Ser Gin Gin Asn Val Lys Ala Val Phe Asp Ala Ala He Arg Val 

165 170 175 

Val Leu Gin Pro Pro Lys Gin Lys Lys Lys Lys Ser Lys Ala Pro Lys 

180 185 190 

Ala Cys Ser He Leu 
195 

<210> 69 
<211> 985 
<212> DNA 

<213> Medicago truncatula 

<220> 
<221> CDS 

<222> (143) . . (736) 



60 



aagaagaaga agaaagtact ggaggagggg gactgtcctc aggtgaagat ctgaggctat 120 



tttgggtatt tcatttttca ag atg agt get tct agg ttt att aaa tgt gtt 172 

Met Ser Ala Ser Arg Phe He Lys Cys Val 
1 5 10 

act gtt ggt gat gga get gtt ggc aaa act tgt ttg ttg att tct tac 220 
Thr Val Gly Asp Gly Ala Val Gly Lys Thr Cys Leu Leu He Ser Tyr 
15 20 25 
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acc 


age 


aat 


act 


ttc 


ccc 


acg 


gat 


tat 


gtg 


ccg 


aca 


gtt 


ttt 


gac 


aat 


268 


Thr 


Ser 


Asn 


Thr 


Phe 


Pro 


Thr 


Asp 


Tyr 


Val 


Pro 


Thr 


Val 


Phe 


Asp 


Asn 










30 










35 










40 






ttc 


agt 


gcg 


aat 


gtg 


gtt 


gtt 


aat 


gga 


agt 


att 


gtg 


aat 


ctg 


ggt 


ttg 


316 


Phe 


Ser 


Ala 


Asn 


Val 


Val 


Val 


Asn 


Gly 


Ser 


He 


Val 


Asn 


Leu 


Gly 


Leu 








45 










50 










55 








tgg 


gat 


act 


get 


gga 


caa 


gag 


gat 


tat 


aac 


aga 


tta 


aga 


cct 


ttg 


agt 


364 


Trp 


Asp 


Thr 


Ala 


Gly 


Gin 


Glu 


Asp 


Tyr 


Asn 


Arg Leu 


Arg 


Pro 


Leu 


Ser 






60 










65 










70 










tac 


cgt 


ggt 


gcc 


gat 


gtt 


ttc 


ata 


ttg 


get 


ttc 


tct 


etc 


ata 


age 


aaa 


412 


Tyr 


Arg 


Gly 


Ala 


Asp 


Val 


Phe 


He 


Leu 


Ala 


Phe 


Ser 


Leu 


He 


Ser 


Lys 




75 










80 










85 










90 




gcc 


agt 


tat 


gaa 


aat 


gtc 


tec 


aaa 


aag 


tgg 


att 


cca 


gag 


ttg 


aag 


cat 


460 


Ala 


Ser 


Tyr 


Glu 


Asn 
95 


Val 


Ser 


Lys 


Lys 


Trp 
100 


He 


Pro 


Glu 


Leu 


Lys 
105 


His 




tat 


gca 


cct 


ggt 


gtc 


ccc 


ata 


att 


ctg 


gtt 


gga 


aca 


aag 


ctt 


gat 


ctt 


508 


Tyr 


Ala 


Pro 


Gly 


Val 


Pro 


lie 


lie 


Leu 


Val 


Gly Thr 


Lys 


Leu 


Asp 


Leu 










110 










115 










12 0 






egg 


gat 


gat 


aag 


cag 


ttc 


tgc 


ata 


gac 


cat 


cct 


ggt 


gcc 


gtt 


ccc 


att 


556 


Arg 


Asp 


Asp 


Lys 


Gin 


Phe 


Cys 


He 


Asp 


His 


Pro Gly 


Ala 


val 


Pro 


He 








125 










13 0 










135 










acc 


aca 


get 


cag 


gga 


gaa 


gag. 


ctg 


agg 


aag 


ctg 


att 


aat 


gca 


cca 


get 


604 


Thx 


Thr 
140 


Ala 


Gin 


Gly 


Glu 


Glu 
145 


Leu 


Arg 


Lys 


Leu 


He 
150 


Asn 


Ala 


Pro 


Ala 




tac 


att 


gaa 


tgc 


agt 


tea 


aaa 


tea 


cag 


gag 


aac 


gtg 


aag 


gcg 


gtg 


ttt 


652 


Tyr 


lie 


Glu 


Cys 


Ser 


Ser 


Lys 


Ser 


Gin 


Glu 


Asn 


Val 


Lys 


Ala 


Val 


Phe 




155 










160 










165 










170 




gat 


gca 


gcc 


ata 


aga 


gtt 


gtc 


ctt 


caa 


cca 


cca 


aag 


cag 


aag 


aaa 


aag 


700 


Asp 


Ala 


Ala 


He 


Arg 
175 


Val 


Val 


Leu 


Gin 


Pro 
180 


Pro 


Lys 


Gin 


Lys 


Lys 
185 


Lys 




aag 


aat 


aaa 


gca 


caa 


aag 


gcc 


tgt 


tea 


ata 


ttg taattcacta ggtgtaaaca 


753 


Lys- 


Asn 


Lys 


Ala 
190 


Gin 


Lys 


Ala 


Cys 


Ser 
195 


He 


Leu 














tttgattgtc ctttgttcta cctgtattcc cctccccttc 


ttttcatcat tttctttgtt 


813 



attgaagaag ttggagagtc tttgtgggtg ttctttcaga caaatactct agatctcctt 873 



gaagcataag gtggcttaat gtttttgett tctatgaatt attattgtca actctaatag 933 



ttaaatttgc atttcatget tcccatcttt ttaaaaaaaa aaaaaaaaaa aa 985 



<210> 70 
<211> 197 
<212> PRT 

<213> Medicago truncatula 



<400> 70 



Met 


Ser 


Ala 


Ser 


Arg 


Phe 


He 


Lys 


Cys 


Val 


Thr Val 


Gly 


Asp 


Gly Ala 


1 








5 










10 








15 


Val 


Gly 


Lys 


Thr 


Cys 


Leu 


Leu 


He 


Ser 


Tyr 


Thr Ser 


Asn 


Thr 


Phe Pro 








20 










25 








30 




Thr 


Asp 


Tyr 


Val 


Pro 


Thr 


Val 


Phe 


Asp 


Asn 


Phe Ser 


Ala 


Asn 


Val Val 






35 










40 








45 






Val 


Asn 


Gly 


Ser 


He 


Val 


Asn 


Leu 


Gly 


Leu 


Trp Asp 


Thr 


Ala 


Gly Gin 




50 










55 








60 








Glu 


Asp 


Tyr 


Asn 


Arg 


Leu 


Arg 


Pro 


Leu 


Ser 


Tyr Arg 


Gly 


Ala 


Asp Val 


65 










70 










75 






80 


Phe 


He 


Leu 


Ala 


Phe 


Ser 


Leu 


He 


Ser 


Lys 


Ala Ser 


Tyr 


Glu 


Asn Val 










85 










90 








95 


Ser 


Lys 


Lys 


Trp 


He 


Pro 


Glu 


Leu 


Lys 


His 


Tyr Ala 


Pro 


Gly 


Val Pro 








100 










105 








110 




He 


He 


Leu 


Val 


Gly 


Thr 


Lys 


Leu 


Asp 


Leu 


Arg Asp 


Asp 


Lys 


Gin Phe 
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115 120 125 

Cys He Asp His Pro Gly Ala Val Pro He Thr Thr Ala Gin Gly Glu 

130 135 140 

Glu Leu Arg Lys Leu He Asn Ala Pro Ala Tyr He Glu Cys Ser Ser 
145 " 150 155 I 60 

Lys Ser Gin Glu Asn Val Lys Ala Val Phe Asp Ala Ala He Arg Val 

165 170 I 75 

val Leu Gin Pro Pro Lys Gin Lys Lys Lys Lys Asn Lys Ala Gin Lys 

180 185 190 

Ala Cys Ser He Leu 
195 

<210> 71 
<211> 779 
<212> DNA 

<213> Brassica napus 



<220> 

<221> CDS 

<222> (183) . . (776) 



ggctcatagg gtttctagtt ttcgggagaa agttctttca tttttcaaga atgcccagat 60 
gctcgattga ggttgagcat ccagacaacg aagaagaaaa ttaacaaaat ctttgagctt 
ttctttttat ttttggtttt tcccccctga aagtattaac aaataatctt cgagtagaga 180 



120 



227 
275 
323 



aa atg age gca tct egg ttc ata aag tgt gta acg gtt ggt gat gga 
Met Ser Ala Ser Arg Phe He Lys Cys Val- Thr Val Gly Asp Gly 
1 5 10 15 

get gtc gga aaa aca tgt ttg ttg att tct tac aca age aac act ttc 
Ala Val Gly Lys Thr Cys Leu Leu He Ser Tyr Thr Ser Asn Thr Phe 

20 25 30 

cct acg gac tat gtg ccc acc gtt ttc gat aat ttc agt get aat gtg 
Pro Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val 

35 40 45 

gtg gtt aac gga gee acc gtt aat' ctt gga ttg tgg gat act gca ggg 371 
Val Val Asn Gly Ala Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly 

50 55 60 

caa gag gat tac aac aga tta aga cca eta age tac cgt gga get gat 419 
Gin Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp 

65 70 75 

att ttc ata ttg gec ttc tct ctt ate agt aaa gee agt tat gaa aac 467 
Val Phe He Leu Ala Phe Ser Leu He Ser Lys Ala Ser Tyr Glu Asn 
80 8 5 90 

gtc tec aaa aag tgg ate ccg gag ttg aag cat tac gcg cct ggt gtc 515 
Val Ser Lys Lys Trp He Pro Glu Leu Lys His Tyr Ala Pro Gly Val 

100 105 HO 

ccc gtc ate ctt gtt gga tea aag ctt gat ctt cga gat gat aag caa 563 
Pro val He Leu Val Gly Ser Lys Leu Asp Leu Arg Asp Asp Lys Gin 

115 120 125 

ttc ttc gtc gac cat cct ggc get gtc ccg att aca act get cag gga 611 
Phe Phe Val Asp His Pro Gly Ala Val Pro He Thr Thr Ala Gin Gly 

130 ~ I 35 140 

gaq gag ctg agg aag eta ata gat gca cct act tac ate gaa tgc agt 659 
llu Glu Leu Arg Lys Leu He Asp Ala Pro Thr Tyr He Glu Cys Ser 

145 15° 155 

tec aaa tct caa gag aat gtg aaa get gtc ttt gac gca gee ata cga 707 
Ser Lys Ser Gin Glu Asn Val Lys Ala Val Phe Asp Ala Ala He Arg 
160 I 65 170 

gtg gtg ttg caa ccg cct aag cag aag aag aaa aag age aaa gcg cag 755 
val Val Leu Gin Pro Pro Lys Gin Lys Lys Lys Lys Ser Lys Ala Gin 

180 185 190 

aag gca tgc tec ate cag tgatat 



779 



35 w 
Val Asn Gly- Ala Thr. Val Asn. Leu Gly Leu Trp Asp Thr Ala Gly Gin 

55 60 



Val He Leu Val Gly Ser Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe 

_ - _ ion. 12 5 
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Lys Ala Cys Ser He Gin 
195 

<210> 72 
<211> .197 
<212> PRT 

<213> Brassica napus 

Met°Ser 2 Ala Ser Arg Phe He Lys Cys Val Thr Val Gly Asp Gly Ala 

1 5 10 15 

Val Gly Lys Thr Cys Leu Leu He Ser Tyr Thr Ser Asn Thr Phe Pro 

20 25 30 

Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val 
35 40 45 

l 

Glu A^p Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 

Phe He Leu Ala Phe Ser Leu He Ser Lys Ala Ser Tyr Glu Asn Val 

85 90 95 

Ser Lys Lys Trp He Pro Glu Leu Lys His Tyr Ala Pro Gly Val Pro 

Asp Leu Arg Asp Asp 
115 • 120 125 

Phe Val Asp His Pro Gly Ala Val Pro He Thr Thr Ala Gin Gly Glu 

130 140 
Glu Leu Arg Lys Leu He Asp Ala Pro Thr Tyr He Glu Cys Ser Ser 
145 " 150 155 ibU 

Lys Ser Gin Glu Asn Val Lys Ala Val Phe Asp Ala Ala He Arg Val 

165 I 70 1 

Val Leu Gin Pro Pro Lys Gin Lys Lys Lys Lys Ser Lys Ala Gin Lys 

180 I 85 190 

Ala Cys Ser He Gin 
195 

<210> 73 
<211> 757 
<212> DNA 

<213> Brassica napus 

<220> 

<221> CDS 

<222> (164).. (757) 

cga£acaccg attcatttca ttcttcattg atcgcttcct ctcacatacc cgatttcgtt 60 
tagggttaac attttctagg gttttagaga taggcgaaac tgaaattgag aagaagacgg 
gtgttgtttc ctcatagatc tgaggggttg aatttttcga ttt atg age gca tct 



egg ttc ata aag tgc gtg acg gtt ggg gac gga gca gtg ggc aaa aca 
Arg Phe He Lys Cys Val Thr Val Gly Asp Gly Ala Val Gly Lys Thr 
5 10 15 20 

tgt etc etc atctct tac ace age aac act ttc cct acg gat tat gtt 
Cys Leu Leu He Ser Tyr Thr Ser Asn Thr Phe Pro Thr Asp Tyr Val 
■ - * -25 30 




cca act gtt ttc gat aac ttc age get aat gtt gtt gtt aac gga gec 
Pro Thr Val Phe Lp Asn Phe Ser Ala Asn Val Val Val Asn Gly Ala 
40 ^ ' 45 50 

qtc aac tta gga etc tgg gat ace gca ggg cag gag gat tat aac 
Val Asn L-eu Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr Asn 



120 



175 



223 
271 
319. 
367 
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aga ttg aga ccc ttg agt tac cgc ggt get gac gtt ttc ate ttg gee 415 
Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val Phe He Leu Ala 

70 75 80 

ttc tct etc ate agt aag get agt tat gag aat gtc tec aag aag tgg 463 
Phe Ser Leu He Ser Lys Ala Ser Tyr Glu Asn Val Ser Lys Lys Trp 
85 90 95 10° 

ate cct gag ctg acc cac tat gee cct ggt gtc cct ate gtt ctt gtt 511 
• He Pro Glu Leu Thr His Tyr Ala Pro Gly Val Pro He Val Leu Val 
105 HO H5 

ggt acc aaa eta gat ctt agg gat gac aaa cag ttc ttc gtt gac cac 559 
Glv Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe Phe Val Asp Hxs 

120 ^ 125 130 

cct ggt get gta cct att acc act tct cag gga gag gaa eta atg aag 607 
Pro Gly Ala Val Pro He Thr Thr Ser Gin Gly Glu Glu Leu Met Lys 

135 140 145 

eta att gga get cct teg tac ate gag tgc agt tea aaa tct caa gag 655 
Leu He Gly Ala Pro Ser Tyr lie Glu Cys Ser Ser Lys Ser Gin Glu 

150 ' 155 160 

aac gtg aaa ggg gtg ttt gat gca gcg ate aga gtg gta ctt caa cct 703 
Asn Val Lys Gly Val Phe Asp Ala Ala He Arg Val Val Leu Gin Pro 
165 170 175 180 

cca aag cag aag aaa aag aag age aag gca caa aag gee tgc tec att 751 
Pro Lys Gin Lys Lys Lys Lys Ser Lys Ala Gin Lys Ala Cys Ser He 
185 190 195 



ttg taa 
Leu 



757 



<210> 74 
<211> 197 
<212> PRT 

<213> Brassica napus 

Met°Ser 4 Ala Ser Arg Phe He Lys Cys Val Thr Val Gly Asp Gly Ala 

15 io 15 

Val Gly Lys Thr Cys Leu Leu He Ser Tyr Thr Ser Asn Thr Phe Pro 

20 25 30 

Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val 

35 40 45 

Val Asn Gly Ala Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 

50 55 60 

Glu Asp Tyr Asn Arg Leu Arg- Pro Leu Ser Tyr Arg Gly Ala Asp Val 
65 * * 70 75 80 

Phe He Leu Ala Phe Ser Leu He Ser Lys Ala Ser Tyr Glu Asn Val 

85 90 95 

Ser Lys Lys Trp He Pro Glu Leu Thr His Tyr Ala Pro Gly Val Pro 

100 105 HO 

He Val Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe 

115 120 125 

Phe Val Asp His Pro Gly Ala Val Pro He Thr Thr Ser Gin Gly Glu 

130 ^ 135 140 

Glu Leu Met Lys Leu He Gly Ala Pro Ser Tyr He Glu Cys Ser Ser 
145 150 155 160 

Lvs Ser Gin Glu Asn Val Lys Gly Val Phe Asp Ala Ala He Arg Val 

165 170 175 

Val Leu Gin Pro Pro Lys Gin Lys Lys Lys Lys Ser Lys Ala Gin Lys 

180 185 190 

Ala Cys Ser He Leu 
195 

<210> 75 

<211> 645 

<212> DNA 

<213> Oryza sativa 



<220> 
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<221> CDS 

<222> (1) . . (645) 

<400> 75 

atg age teg gcg gcg gcg gcg acg agg ttc ate aag tgc gtc acc gtg 48 

Met Ser Ser Ala Ala Ala Ala Tlir Arg. Phe lie Lys Cys Val Thr Val 

1 5 10 15 

ggg gac ggc gcg gtg ggg aag acg tgc atg etc ate tgc tae acc tgc 96 

Gly Asp Gly Ala Val Gly Lys Thr Cys Met Leu He Cys Tyr Thr Cys 

'20 25 30 

aac aag ttc ccc acc gat tae ate ccc acc gtg ttc gac aac ttc age 144 
Asn Lys Phe Pro Thr Asp Tyr He Pro Thr Val Phe Asp Asn Phe Ser 

35 40 45 

gee aat gtc tec gtg gac ggg age gtc gtc aac etc ggc etc tgg gac 192 
Ala Asn Val Ser Val Asp Gly Ser Val Val Asn Leu Gly Leu Trp Asp 

50 55 60 

act gca ggt cag gag gat tac age agg ttg agg cct ctg age tac agg 240 
Thr -Ala Gly Gin Glu Asp Tyr Ser Arg Leu Arg Pro Leu Ser Tyr Arg 
65 70 ^ 75 80 

gga gee gat gtg ttc ate ctg tec ttc tec ctg ata age agg gcg age 288 
Gly Ala Asp Val Phe He Leu Ser Phe Ser . Leu lie Ser Arg Ala Ser 

85 90 95 

tat gag aat gtt cag aag aag tgg atg cea gag ctt cgc egg ttt gcg 336 
Tyr Glu Asn Val Gin Lys Lys Trp Met Pro Glu Leu Arg Arg Phe Ala 

100 105 110 

cct ggt gtt cct gta gtt ctt gtt gga acc aag ttg gat etc cgt gaa 384 
Pro Gly Val Pro Val Val Leu Val Gly Thr Lys Leu Asp Leu Arg Glu 

115 120 125 

gat agg gee tat ctt get gat cat cea get tct tec ata ata aca acg 432 
Asp Arg Ala Tyr Leu Ala Asp His Pro Ala Ser Ser He He Thr Thr 

130 135 140 : 

gag cag gga gaa gaa ctg agg aag eta ata gga gcg gtc gec tac ate ■ 4 80 

Glu Gin Gly Glu Glu Leu Arg Lys Leu He Gly Ala Val Ala Tyr lie • 
145 150 155 160 

gaa tgc age tec aag aca cag aga aac att aaa get gtt ttc gac act 528 
Glu Cys Ser Ser Lys Thr Gin Arg Asn He Lys Ala Val Phe Asp Thr 

165 170 175 

gee ate aaa gtg gtg ctt caa cct cea aga cat aag gat gta acc aga 576 
Ala He Lys Val Val Leu Gin Pro Pro Arg His Lys Asp Val Thr Arg 

180 185 *" " 190 

aag aaa etc caa tea age tec aat egg cea gta agg egg tac ttt tgc 624 
Lys Lys Leu Gin Ser Ser Ser Asn Arg Pro Val Arg Arg Tyr Phe Cys 

195 200 205 

gga age get tgt ttc gcg tag 645 
Gly Ser Ala Cys Phe Ala 
210 

<210> 76 
<211> 214 
<212> PRT 

<213> Oryza sativa 
<400> 76 

Met Ser Ser Ala Ala Ala Ala Thr Arg Phe He Lys Cys Val Thr Val 

1 5 10 " * 15 

Gly Asp Gly Ala Val Gly Lys Thr Cys Met Leu He Cys Tyr Thr Cys 

20 25 30 

Asn Lys Phe Pro Thr Asp Tyr He Pro Thr Val Phe Asp Asn Phe Ser 

35 40 45 

Ala Asn Val Ser Val Asp Gly Ser Val Val Asn Leu Gly Leu Trp Asp 

50 55 60 

Thr Ala Gly Gin Glu Asp Tyr Ser Arg Leu Arg Pro Leu Ser Tyr Arg 
65 70 75 80 

Gly Ala Asp Val Phe He Leu Ser Phe Ser Leu He Ser Arg Ala Ser 

85 90 95 

Tyr Glu Asn Val Gin Lys Lys Trp Met Pro Glu Leu Arg Arg Phe Ala 
100 105 110 
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Pro Gly Val Pro Val Val Leu Val Gly Thr Lys Leu Asp Leu Arg Glu 

115 120 125 

Asp Arg Ala Tyr Leu Ala Asp His Pro Ala Ser Ser lie He Thr Thr 

130 135 140 

Glu Gin Gly Glu Glu Leu Arg Lys Leu He Gly Ala Val Ala Tyr He 
145 150 155 160 

Glu Cys Ser Ser Lys Thr Gin Arg Asn He Lys .Ala Val Phe Asp Thr 

165 170 175 

Ala He Lys Val Val Leu Gin Pro Pro Arg His Lys Asp Val Thr Arg 

180 185 190 

Lys Lys Leu Gin Ser Ser Ser Asn Arg Pro Val Arg Arg Tyr Phe Cys 

195 200 205 

Gly Ser Ala Cys Phe Ala 
210 

<210> 77 

<211> 1135 . . . 

<212> DNA 

<213> Homo sapiens (man) 

<220> 

<221> CDS 

<222> (105) . . (680) 

<400> 77 _ n 
ggcagccgag gagaccccgc gcagtgctgc caacgccccg gtggagaagc tgaggtcatc 60 

atcagatttg aaatatttaa agtggataca aaactatttc agca atg cag aca att 116 
3 Met Gin Thr He 

1 

* aag tgt gtt gtt gtg ggc gat ggt get gtt ggt aaa aca tgt etc ctg 164 
Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys Thr Cys Leu Leu 
5 10 15 20 

ata tec tac aca aca aac aaa ttt cca teg gaa tat gta ccg act gtt 
He Ser Tyr Thr Thr Asn Lys Phe Pro Ser Glu Tyr Val Pro Thr Val 

25 " 30 35 

ttt gac aac tat gca gtc aca gtt atg att ggt gga gaa cca tat act 
Phe Asp Asn Tyr Ala Val Thr Val Met He Gly Gly Glu Pro Tyr Thr 

40 45 50 

ctt gga ctt ttt gat act gca ggg caa gag gat tat gac aga tta cga 
Leu Gly Leu Phe Asp Thr Ala Gly Gin Glu Asp Tyr Asp Arg Leu Arg 

55 60 65 

ccg ctg agt tat cca caa aca gat gta ttt eta gtc tgt ttt tea gtg 
Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu Val Cys Phe Ser Val 

70 ~ 75 80 

gtc tct cca tct tea ttt gaa aac gtg aaa gaa aag tgg gtg cct gag 404 
Val Ser Pro Ser Ser Phe Glu Asn Val Lys Glu Lys Trp Val Pro Glu 
85 90 95 100 

ata act cac cac tgt cca a<£g act cct ttc ttg ctt gtt ggg act caa 
lie Thr His His Cys Pro Lys Thr Pro Phe Leu Leu Val Gly Thr Gin 

105 HO H5 

att gat etc aga gat gac ccc tct act att gag aaa ctt gec aag aac 
He Asp Leu Arg Asp Asp Pro Ser Thr He Glu Lys Leu Ala Lys Asn 

120 125 130 

aaa cag aag cct ate act cca gag act get gaa aag ctg gec cgt gac 548 
Lys Gin Lys Pro He Thr Pro Glu Thr Ala Glu Lys Leu Ala Arg Asp 

135 140 145 

ctg aag get gtc aag tat gtg gag tgt tct gca ctt aca cag aga ggt 596 
Leu Lys Ala Val Lys Tyr Val Glu Cys Ser Ala Leu Thr Gin Arg Gly 

150 155 160 

ctg aag aat gtg ttt gat gag get ate eta get gec etc gag cct ccg 644 
Leu Lys Asn Val Phe Asp Glu Ala He Leu Ala Ala Leu Glu Pro Pro 
165 170 175 180 

gaa act caa ccc aaa agg aag tgc tgt ata ttc taaactgttt tctccttccc 697 
Glu Thr Gin Pro Lys Arg Lys Cys Cys He Phe 

185 190 
ttctttgctg ctgcttcctg tcccactact gtagaaagat cgtttaaaaa caaaggaata 757 



212 



260 



308 



356 



452 



500 
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aaaccatcct gtttgaaagc ctctgcgtct 
attagttttt gatcaagaat tgcaatatca 
ggacttgttt taacgttctg ctgcttgagt 
ttactattgc aaagggaact tggtctggct 
ttttaacact tctagatctt agttcagatg 
tatgatcaat tgttaattgt " aattgcatga 
gagtgtaatg gggaaggg 



ttttactcac caccttagag caacctctgt 817 

tataaatttt ttgtgatcag tagtcaagtt 877 

tgcctgatgc tcagagcttt ttggtttgga 937 

tagatgtcct cttggagaaa ataacaagag 997 

gagaaagtaa cacaaacatc attttactct 1057 

caaaccttat ggaaaagggg tgacctagta 1117 

1135 



<210> 78 
<211> 191 
<212> PRT 

<213> Homo sapiens (man) 
<400> 78 



Met 


Gin 


Thr 


He 


Lys 


Cys 


Val 


Val 


1 








5 








Thr 


Cys 


Leu 


Leu 


lie 


Ser 


Tyr 


Thr 








20 










Val 


Pro 


Thr 


Val 


Phe 


Asp 


Asn 


Tyr 






35 










40 


Glu 


Pro 


Tyr 


Thr 


Leu 


Gly 


Leu 


Phe 




50 










55 




Asp 


Arg 


Leu 


Arg 


Pro 


Leu 


Ser 


Tyr 


65 










70 






Cys 


Phe 


Ser 


Val 


Val 


Ser 


Pro 


Ser 










85 








Trp 


Val 


Pro 


Glu 


He 


Thr 


His 


His 








100 










Val 


Gly 


Thr 


Gin 


He 


Asp 


Leu 


Arg 






115 










120 


Leu 


Ala 


Lys 


Asn 


Lys 


Gin 


Lys 


Pro 




130 










135 




Leu 


Ala 


Arg 


Asp 


Leu 


Lys 


Ala 


Val 


145 










150 






Thr 


Gin 


Arg 


Gly 


Leu 


Lys 


Asn 


Val 










165 








Leu 


Glu 


Pro 


Pro 


Glu 


Thr 


Gin 


Pro 



■ 180 

<210> 79 
<211> 576 
<212> DNA 

<213> .Caenorhabditis elegans 

<220> 

<221> CDS 

<222> (1) . . (576) 

<400> 79 

atg caa gcg ate aaa tgt gtc gtc 
Met Gin Ala He Lys Cys Val Val 



Val 


Gly 


Asp Gly 


Ala 


Val 








10 










15 




Thr 


Asn 


Lys 


Phe 


Pro 


Ser 


Glu 


Tyr 


25 










30 






Ala 


Val 


Thr 


Val 


Met 


He 


Gly 


Gly 










45 








Asp 


Thr 


Ala Gly 


Gin 


Glu 


Asp 


Tyr 








60 










Pro 


Gin 


Thr Asp 


Val 


Phe 


Leu 


Val 






75 










80 


Ser 


Phe 


Glu 


Asn 


Val 


Lys 


Glu 


Lys 




90 










95 




Cys 


Pro 


Lys 


Thr 


Pro 


Phe 


Leu 


Leu 


105 










110 






Asp 


Asp 


Pro 


Ser 


Thr 


He 


Glu 


Lys 










125 








He 


Thr 


Pro 


Glu 


Thr 


Ala 


Glu 


Lys 








140 










Lys 


Tyr 


Val 


Glu 


Cys 


Ser 


Ala 


Leu 






155 










160 


Phe 


Asp 


Glu 


Ala 


He 


Leu 


Ala 


Ala 




170 










175 




Lys 


Arg 


Lys 


Cys 


Cys 


He 


Phe 




185 










190 






gtt 


ggt 


gac 


gga 


gec 


gtc 


ggt 


aaa 


Val 


Gly 


Asp Gly 


Ala 


Val 


Gly 


Lys 
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15 10 15 

acq tgt etc ctg ata tec tac acc aca aac gca ttt ccc gga gaa tat 96 

Thr Cys Leu Leu lie Ser Tyr Thr Thr Asn Ala Phe Pro Gly Glu Tyr 

20 25 30 

att ccg acg gta ttc gac aac tac tea gca aat gtg atg gtc gac ggt 144 
lie Pro Thr Val Phe Asp Asn Tyr Ser Ala Asn Val Met Val Asp Gly 

35 40 45 

egg ccg ata aat etc ggg etc tgg gat aca get gga cag gaa gat tac 192 
Arg Pro lie Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr 

50 55 60 

gat cga etc cga cca ctg tea tat cca caa aca gac gtg ttt etc gta 240 
Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu Val 
65- ~ 70 75 80 

tgc ttt gee ctg aac aat ccg gcg agt ttt gag aat gtt cgt gcg aaa 288 
Cys Phe Ala Leu Asn Asn Pro Ala Ser Phe Glu Asn Val Arg Ala Lys 

85 90 95 

tgg tat cca gaa gtg tea cat cat tgc ccg aat acg ccg att att ttg 336 
Trp Tyr Pro Glu Val Ser His "His Cys Pro Asn Thr Pro lie lie Leu 

100 105 HO 

gtt gga acg aaa get gat ctg cgt gag gat cga gat act gtt gaa egg 384 
Val Gly Thr Lys Ala Asp Leu Arg Glu Asp Arg Asp Thr Val Glu Arg 

115 120 125 

etc cgc gaa cgc egg etc caa cca gtg age caa acc cag ggc tac gtg 432 
Leu Arg Glu Arg Arg Leu Gin Pro Val Ser Gin Thr Gin Gly Tyr Val 

130 135 140 

atg gca aag gaa ate aag get gtc aag tat ctg gag tgc teg gcg etc 480 
Met Ala Lys Glu He Lys Ala Val Lys Tyr Leu Glu Cys Ser Ala Leu 
145 150 155 160 

acg caa cgt ggt ctg aaa caa gtt ttc gat gag gcg ate cga gee gtc 528 
Thr Gin Arg Gly Leu Lys Gin Val Phe Asp Glu Ala He Arg Ala Val 
165 170 175 

* gtc acg ccg cca caa aga gee aaa aag age aag tgt acg gtg etc 573 
Val Thr Pro Pro Gin Arg Ala Lys Lys Ser Lys Cys Thr Val Leu 

taa 



<210> 80 
<211> 191 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 80 

Met Gin Ala He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 

15 10 I s 

Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Ala Phe Pro Gly Glu Tyr 

20 25 30 

He Pro Thr Val Phe Asp Asn Tyr Ser Ala Asn Val Met Val Asp Gly 

35 40 45 

Arg Pro He Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr 

50 55 60 

Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu Val 
65 70 75 . 80 

Cys Phe Ala Leu Asn Asn Pro Ala Ser Phe Glu Asn Val Arg Ala Lys 

* 85 90 95 

Trc> Tvr Pro Glu Val Ser His His Cys Pro Asn Thr Pro He He Leu 

100 105 HO 

Val Gly Thr Lys Ala Asp Leu Arg Glu Asp Arg Asp Thr Val Glu Arg 

115 120 125 

Leu Arg Glu Arg Arg Leu Gin Pro Val Ser Gin Thr Gin Gly Tyr Val 

130 135 140 

Met Ala Lys Glu He Lys Ala Val Lys Tyr Leu Glu Cys Ser Ala Leu 
145 150 155 160 

Thr Gin Arg Gly Leu Lys Gin Val Phe Asp Glu Ala He Arg Ala Val 

165 170 175 

Val Thr Pro Pro Gin Arg Ala Lys Lys Ser Lys Cys Thr Val Leu 
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180 
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185 
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190 



<210> 81 
<211> 579 
<212> DNA 

<213> Caenor habdi t i 3 elegans 

<220> 

<221> CDS 

<222> (1) . . (579) 

<400> 81 

atg get gcg att aga aag aag ttg gtg ate gtc ggc gac gga gee tgc 48 
Met Ala Ala lie Arg Lys Lys Leu Val lie Val Gly Asp Gly Ala Cys 
1.5 10 15 
ggt aaa act tgc ctg etc ate gtt ttc tea aag gat cag ttc cca gac 96 
Gly Lys Thr Cys Leu Leu lie Val Phe Ser Lys Asp Gin Phe Pro Asp 
- - 20 - ' 25 ^ 30 

gtg tat gtg ccg act gtt ttc gag aat tat gtt gee gac att gaa gtt 144 
Val Tyr Val Pro Thr Val Phe Glu Asn Tyr Val Ala Asp lie Glu Val 

35 40 45 

gac gga aag cag gtc gaa ctt get eta tgg gat aca get gga caa gag 192 
Asp Gly Lys Gin Val Glu Leu Ala Leu Trp Asp Thr Ala Gly Gin Glu 

50 55 60 

gac tat gat cgt ctg cgt cca etc tct tat ccg gat act gac gtc att 240 
Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp Thr Asp Val lie 
65 70 75 80 

ctg atg tgc ttc tea att gat tea ccc gat tea ctg gag aat att cca 288 
Leu Met Cys Phe Ser He Asp Ser Pro Asp Ser Leu Glu Asn He Pro 

85 90 95 

gaa aaa tgg aca cca gaa gtc agg cat ttc tgt cca aat gtt ccg att 336 
Glu Lys Trp Thr Pro Glu Val Arg . His Phe Cys Pro Asn Val Pro He 

100 105 110 

att ttg gtc gga aat aaa cga gat ctt cgt agt gat cca cag act gtt 384 
He Leu Val Gly Asn Lys Arg Asp Leu Arg Ser Asp Pro Gin Thr Val 

115 - 120 125 

cga gaa ttg gca aaa atg aaa cag gag ccc gtg aag cct gag cag gga 432 
Arg Glu Leu Ala Lys Met Lys Gin Glu Pro Val Lys Pro Glu Gin Gly 

130 135 140 

cga gca att get gag caa att gga gca ttt gca tat ttg gag tgc tct 480 
Arg Ala He Ala Glu Gin He Gly Ala Phe Ala Tyr Leu Glu Cys Ser 
145 150 155 160 

gcg aag act aag gac gga att cgt gag gta ttc gag aag gcg aca cag 528 
Ala Lys Thr Lys Asp Gly He Arg Glu Val Phe Glu Lys Ala Thr Gin 

165 170 175 

gcg gca ttg caa cag aag aag aaa aag aag age aag tgc atg att ttg 576 
Ala Ala Leu Gin Gin Lys Lys Lys Lys Lys Ser Lys Cys Met He Leu 

180 185 ^ 190 

taa 579 



<210> 82 
<211> 192 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 82 

Met Ala Ala He Arg Lys Lys Leu Val He Val Gly Asp Gly Ala Cys 

1 5 10 15 

Gly Lys Thr Cys Leu Leu He Val Phe Ser Lys Asp Gin Phe Pro Asp 

20 25 30 

Val Tyr Val Pro Thr Val Phe Glu Asn Tyr Val Ala Asp He Glu Val 

35 40 45 

Asp Gly Lys Gin Val Glu Leu Ala Leu Trp Asp Thr Ala Gly Gin Glu 

50 55 . 60 

Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp Thr Asp Val He 
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65 










70 










75 








80 


Leu 


Met 


/*"*-. m 

cys 


Fne 




lie 


Asp 


Cay* 

ber 


Pro 


Asp 


Car T.Ali 
OCX JjcU 


Glu 


Asn 


lie 


Pro 


















90 








95 




Glu 


Lys 


Trp 


Thr 


Pro 


uxu 


vaj. 


Arg 


His 


Phe 


cys rXU 


Asn Val 


Pro 


lie 




10U 










105 








110 






lie 


Leu 


vai 

lid 


Gly 


Asn 


Lys 


Arg 


Asp 
ion 

X A U 


Leu 


Arg 


Gay* A an 


Pro 
125 


Gin 


Thr 


Val 


Arg 


Glu 


Leu 


Ala 


Lys 


Met 


Lys 


Gin 


Glu 


Pro 


vai iiys 


Pro 


Glu 


Gin Gly 


130 










135 








140 










Arg 


Ala 


lie 


Aia 


G1U 


bin 


lie 


biy 


Ala 


Phe 


Ala iyi 


Leu 


Glu 


Cys 


Ser 


145 










150 










155 








160 


Ala 


Lys 


Thr 


Lys 


Asp. 


Gly 


lie 


Arg 


Glu 


Val 


Phe Glu 


Lys 


Ala 


Thr 


Gin 








165 










170 








175 




Ala 


Ala 


Leu 


Gin 
180 


Gin 


Lys 


Lys 


Lys 


Lys 
185 


Lys 


Ser Lys 


Cys 


Met 
190 


lie 


Leu 



<210> 83 

<211> 928 * 
<212> DNA 

<213> Schizosaccharomyces pombe 

<220> 

<221> CDS 

<222> (55) . . (633) 

<400> 83 

gaattcaata aagtgaagca aagctttacg attaattatt ttttgtgaaa tagt atg 57 
~ ~ ~ Met 

1 

ccc acc att aag tgt gtc gta gta gga gac ggt yet gta gga aag acc 105 
Pro Thr He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys Thr 

5 10 15 ■ 

tgt ctg ctt att tec tat act aca aac aag ttt cct agt gac tat gtg 153 
Cys Leu Leu He Ser Tyr Thr Thr Asn Lys Phe Pro Ser Asp Tyr Val 

20 25 30 

cca act gta ttc gat aat tat get gtc act gtc atg ate ggt gat gaa 201 
Pro Thr Val Phe Asp Asn Tyr Ala Val Thr Val Met He Gly Asp Glu 

35 40 45 

cca tac act ctt ggt tta ttc gat acc get ggt cag gag gat tat gat 249 
Pro Tyr Thr Leu Gly Leu Phe Asp Thr Ala Gly Gin Glu Asp Tyr Asp 
50 ~ ^ 55 60 65 

cgc ttg cgt cct tta tec tat cct caa aca gac gtc ttt ttg gtt tgt 297 
Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu Val Cys 

70 75 80 

ttt agt gta act tct cct gec agt ttt gaa aat gtg aaa gaa aag tgg 345 
Phe Ser Val Thr Ser Pro Ala Ser Phe Glu Asn Val Lys Glu Lys Trp 

85 90 95 

ttt ccc gaa gtt cat cat cat tgt ccg ggc gtc ccg tgc tta att gtt 393 
Phe Pro Glu Val His His His Cys Pro Gly Val Pro Cys Leu He Val 

100 105 110 

ggt acc caa att gat tta cgt gat gac cct tct gtg caa cag aaa eta 441 
Gly Thr Gin He Asp Leu Arg Asp Asp Pro Ser Val Gin Gin Lys Leu 

115 *" 120 125 

get cgc cag cat cag cat ccc ctt aca cat gag Cqia ggt gaa cga tta 489 
Ala Arg Gin His Gin His Pro Leu Thr His Glu Gin Gly Glu Arg Leu 
130 135 140 145 

get cgt gag ttg ggt get gtc aag tat gtt gag tgt tec get ttg acc 537 
Ala Arg Glu Leu Gly Ala Val Lys Tyr Val Glu Cys Ser Ala Leu Thr 

150 155 • 160 

caa aaa ggg tta aag aat gtt ttt gat gaa get att gta gee get ctt 585 
Gin Lys Gly Leu Lys Asn Val Phe Asp Glu Ala He Val Ala Ala Leu 

165 170 175 

gat cct cct gtt cct cac aag aaa aag tea aag tgt ttg gta ctg 630 
Asp Pro Pro Val Pro His Lys Lys Lys Ser Lys Cys Leu Val Leu 

180 185 190 

taatttaacc ctggttttct ttccttttcc gaaatttcct taatacagct agtttacaac 690 
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agtaaacaat gtcttagctt tctttaacga tccctcatat ttgcaaagat atttgctatt 750 
tgttattttc atcctagctt ttgtgcttct tcattcattc tgtcaatcac acacttcttg 810 
ggctttacat atctgtgttt tctggctagg catatttcct ttttccgtta aataaacgac 870 
gattttgcta tacttttgct acattattgt attcttctgc cattactcgc aggaattc 928 



<210> 84 
<211> 192 
<212> PRT 

<213> Schizosaccharomyces- pombe 
<400> 84 

Met Pro Thr He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 

1 5 10 15 

Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Lys Phe Pro Ser Asp Tyr 

20 25 30 

Val Pro Thr Val Phe Asp Asn Tyr Ala Val Thr Val Met He Gly Asp 

35 40 45 

Glu Pro Tyr Thr Leu Gly Leu Phe Asp Thr Ala Gly Gin Glu Asp Tyr 

50 55 60 

Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu Val 
65 70 75 - 80 

Cys Phe Ser Val Thr Ser Pro Ala Ser Phe Glu Asn Val Lys Glu Lys 

*85 90 95 

Trp Phe Pro Glu Val His His His Cys Pro Gly Val Pro Cys Leu He 

100 105 no 

Val Gly Thr Gin He Asp Leu Arg Asp Asp Pro Ser Val Gin Gin Lys 

115 120 125 

Leu Ala Arg Gin His Gin His Pro Leu Thr His Glu Gin Gly Glu Arg 

130 135 140 

Leu Ala Arg Glu Leu Gly Ala Val Lys Tyr Val Glu Cys Ser Ala Leu 
145 150 155 4 160 

Thr Gin Lys Gly Leu Lys Asn Val Phe Asp Glu Ala He Val Ala Ala 

165 170 175 

Leu Asp Pro Pro Val Pro His Lys Lys Lys Ser Lys Cys Leu Val Leu 
180 185 190 

<210> 85 
<211> 579 ' 
<212> DNA 

<213> Mus musculus (house mouse) 

<220> 

<221> CDS 

<222> (1) .". (579) 

<400> 85 

atg cag gcc ate aag tgt gtg gtg gtg ggt gat gga gec gtg ggc aag 48 

Met Gin Ala He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 

1 5 10 15 

acg tgt ctt etc ate age tac ace ace aac gcc ttc cct gga gaa tac 96 

Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Ala Phe Pro Gly Glu Tyr 

20 25 30 

ate ccc act gta ttt gac aac tac tea gcc aat gtg atg gtg gac agt 144 
lie Pro Thr Val Phe Asp Asn Tyr Ser Ala Asn Val Met Val Asp Ser 

35 40 45 

aag ccg gtg aac ctg ggg ctg tgg gat ace gca ggt cag gag gac tat 192 
Lys Pro Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr 

50 55 60 

gac cgc etc egg cca etc tec tac cca cag aca gat gta ttt etc ate 240 
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Asp 


Arg 


Leu 


Arg 


Pro 


Leu 


65 










70 


tgc 


ttc 


teg 


eta 


gtc 


age 


Cys 


Phe 


Ser 


Leu 


Val 


Ser 








85 




tgg 


ttc 


cct 


gag 


gta 


egg 


Trp 


Phe 


Pro 


Glu 
100 


Val 


Arg 


gtg 


ggt 


acc 


aag 


ctg 


gac 


Val 


Gly 


Thr 


Lys 


Leu Asp 






115 








ctg 


aag 


gag 


aag 


aag 


ctg 


Leu 


Lys 
13 0 


Glu 


Lys 


Lys 


Leu 


ctg 


gec 


aag 


gat 


att 


gat 


Leu 


Ala 


Lys 


Asp 


He 


Asp 


145 










150 


acc 


cag 


cga 


ggc 


ctg aag 


Thx 


Gin 


Arg 


Gly 


Leu Lys 










165 




etc 


tgc 


cca 


cag 


ccc 


aca 


Leu 


Cys 


Pro 


Gin 


Pro 


Thr 






180 






tag 
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Ser 


Tyr Pro 


Gin 


Thr Asp 








75 




cca 


gec tec 


tat 


gag 


aat 


Pro 


Ala Ser 


Tyr 


Glu Asn 






90 






cac 


cac tgc 


ccc 


age 


acc 


His 


His Cys 


Pro 


Ser 


Thr 




105 








ctt 


cgc gat 


gac 


aag 


gac 


Leu 


Arg Asp Asp 


Lys 


Asp 




120 








get 


ccc ate 


acc 


tac 


ccg 


Ala 


Pro He 


Thr 


Tyr 


Pro 


135 








140 


tea 


gtc aag 


tac 


ttg 


gaa 


Ser 


Val Lys 


Tyr 


Leu 


Glu 








155 




acc 


gtc ttc gat 


gag 


gca 


Thr 


Val Phe 


Asp 


Glu 


Ala 






170 






cga 


cag cag 


aag 


cgc 


ccc 


Arg 


Gin Gin 


Lys 


Arg Pro 




185 
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Val Phe Leu He 
. 80 

gtc cgt gee aag 288 
Val Arg Ala Lys 
95 

ccc ate ate ctg 336 
Pro He He Leu 
110 

acc ate gag aag 384 

Thr He Glu Lys 

125 

cag ggc ctg gca 432 
Gin Gly Leu Ala 

tgt tct gca etc 480 
Cys Ser Ala Leu 
160 

ate cgc gca gtc 52 8 

He Arg Ala Val 
175 

tgc age ctg etc 576 
Cys Ser Leu Leu 
190 

579 



<210> 86 
<211> 192 
<212> PRT 

<213> Mus musculus (house mouse) 
<400> 86 



Met 


Gin 


Ala 


He 


Lys 


Cys 


Val 


Val 


Val 


Gly 


Asp Gly Ala Val Gly Lys 


1 








5 










10 


15 


Thr 


Cys 


Leu 


Leu 


He 


Ser 


Tyr 


Thr 


Thr 


Asn 


Ala Phe Pro Gly Glu Tyr 






20 










25 




30 


He 


Pro 


Thr 


Val 


Phe 


Asp Asn 


Tyr 


Ser 


Ala 


Asn Val Met Val Asp Ser 






35 










40 






45 


Lys 


Pro 


Val 


Asn 


Leu 


Gly Leu 


Trp Asp 


Thr 


Ala Gly Gin Glu Asp Tyr 


50 










55 








60 


Asp 


Arg 


Leu 


Arg 


Pro 


Leu 


Ser 


Tyr 


Pro 


Gin 


Thr Asp Val Phe Leu He 


65 








70 










75 80 


Cys 


Phe 


Ser 


Leu 


Val 


Ser 


Pro 


Ala 


Ser Tyr 


Glu Asn Val Arg Ala Lys 








85 










90 


95 


Trp 


Phe 


Pro 


Glu 


Val 


Arg 


His 


His 


Cys 


Pro 


Ser Thr Pro He He Leu 






100 








105 




110 


val 


Gly 


Thr 


Lys 


Leu 


Asp 


Leu 


Arg Asp 


Asp 


Lys Asp Thr He Glu Lys 






115 










120 






125 


Leu 


Lys 


Glu 


Lys 


Lys 


Leu 


Ala 


Pro 


He 


Thr 


Tyr Pro Gin Gly Leu Ala 




130 








135 








140 


Leu 


Ala 


Lys 


Asp 


He 


Asp 


Ser 


Val 


Lys 


Tyr 


Leu Glu Cys Ser Ala Leu 


145 










150 










155 160 


Thr 


Gin 


Arg 


Gly 


Leu 


Lys 


Thr 


Val 


Phe 


Asp 


Glu Ala He Arg Ala Val 










165 










170 


175 


Leu 


Cys 


Pro 


Gin 


Pro 


Thr Arg 


Gin 


Gin 


Lys 


Arg Pro Cys Ser Leu Leu 






180 










185 




190 



<210> 87 
<211> 840 
<212> DNA 

<213> Discopyge ommata 

<220> 

<221> CDS 

<222> (97) . . (675) 
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<400> 87 

gaccggcgcg ccattggacg gtcgctaggc ggaggcggaa gggattggtt gccgtgcgac 60 



gtcagtcagg tgcttcagga gcgacggggc acagcg 


atg 


gca 


gcg 


ate 


egg 


aag 


114 






















Met 
1 


Ala 


Ala 


lie 


Arg 
5 


•Lys 




aag 


etc 


gtc 


ate 


gtt 


ggc 


gat 


gga 


get 


tgt 


ggc 


aag 


acc 


tgc 


etc 


ctg 


162 


Lys 


Leu 


Val 


He 


Val 


Gly 


Asp 


Gly 


Ala 


Cys 


Gly 


Lys 


Thr 


Cys 


Leu 


Leu 










10 










15 










20 








ate 


gtc 


ttc 


age 


aag 


gac 


cag 


ttc 


ccg 


gag 


gtc 


tac 


gtc 


ccc 


act 


gtc 


210 


He Val 


Phe 


Ser 


Lys 


Asp 


Gin 


Phe 


Pro 


Glu 


val 


Tyr 


Val 


Pro 


Thr 


Val 








25 










30 










35 










ttc 


gag 


aac 


tac 


gtg 


gca 


gac 


ate 


gag 


gtg 


gac 


ggg 


aag 


cag 


gtg 


gag 


258 


Phe 


Glu 


Asn 


Tyr 


val 


Ala 


Asp 


lie 


Glu 


Val 


Asp 


Gly 


Lys 


Gin 


Val 


Glu 






40 










45 










50 












etc 


get 


etc 


tgg 


gac 


acg 


get 


gga 


cag 


gag 


gac 


tat 


gac 


cgc 


ctg 


cga 


306 


Leu Ala 


Leu 


Trp 


Asp 


Thr 


Ala 


Gly 


Gin 


Glu 


Asp 


Tyr 


Asp 


Arg 


Leu 


Arg 




55 










60 










65 










70 




ccc 


etc 


tec 


tac 


cct 


gac 


acc 


gac 


gtc 


att 


ctg 


atg 


tgc 


ttt 


tct 


ate 


354 


Pro 


Leu 


Ser 


Tyr 


Pro 


Asp 


Thr 


Asp 


Val 


He 


Leu 


Met 


Cys 


Phe 


Ser 


He 












75 










80 










85 






gac 


age 


ccc 


gac 


age 


ttg 


gag 


aac 


att 


ccg 


gag 


aag 


tgg 


acc 


ccg 


gag 


402 


Asp 


Ser 


Pro 


Asp 


Ser 


Leu 


Glu 


Asn 


He 


Pro 


Glu 


Lys 


Trp 


Thr 


Pro 


Glu 










90 










95 










100 








gtg 


aag 


cac 


ttc 


tgc 


ccc 


aac 


ate 


ccc 


ate 


ate 


ctg 


gtg 


ggc 


aac 


aag 


450 


Val 


Lys 


His 


Phe 


Cys 


Pro 


Asn 


He 


Pro 


He 


He 


Leu 


Val 


Gly 


Asn 


Lys 








105 










110 










115 








aag 


act 


gcg 


ggg 


gac 


gag 


cac 


aca 


egg 


agg 


gag 


eta 


gee 


aag 


atg 


aag 


498 


Lys 


Thr 


Ala 


Gly 


Asp 


Glu 


His 


Thr 


Arg 


Arg 


Glu 


Leu 


Ala 


Lys 


Met 


Lys 






12 0 










125 










130 












cag 


gag 


cct 


gtc 


aaa 


cca 


gat 


gat 


gee 


aag 


gag 


atg 


ggg 


age 


cgc 


ate 


546 


Gin 


Glu 


Pro 


Val 


Lys 


Pro 


Asp 


Asp 


Ala 


Lys 


Glu 


Met 


Gly 


Ser 


Arg 


He 




135 










140 










145 










150 




aaa 


gec 


ttt 


ggc 


tac 


ctg 


gaa 


tgc 


tea 


gcg 


aag 


acg 


aag 


gaa 


ggc 


gtg 


594 


Lys 


Ala 


Phe 


Gly 


Tyr 


Leu 


Glu 


Cys 


Ser 


Ala 


Lys 


Thr 


Lys 


Glu 


Gly 


Val 












155 










160 










165 






cgc 


gag 


gtc 


ttt 


gag 


ctg 


gee 


teg 


cgc 


get 


get 


ctg 


cag 


gee 


aag 


aaa 


642 


Arg 


Glu 


Val 


Phe 


Glu 


Leu 


Ala 


Ser 


Arg 


Ala 


Ala 


Leu 


Gin 


Ala 


Lys 


Lys 










170 










175 










180 






acc 


aag 


age 


aag 


age 


ccc 


tgt 


ctg 


ctt 


ctg 


tgagcaggag ctcccagcta 


692 


Thr 


Lys 


Ser 


Lys 


Ser 


Pro 


Cys 


Leu 


Leu 


Leu 

















185 190 

tccccacttc tctctgcttc tgctcactct ccgtcctccc cctctcccat tccttctctt 752 

tcctccctcg ttcttccctc tcttgtttct ccccatctct ctcctccctc tegttcttte 812 

ctctcctctt gttttcctcc ttgagege 840 



<210> 88 
<211> 192 
<212> PRT 

<213> Discopyge ommata 
<400> 88 

Met Ala Ala He Arg Lys Lys Leu Val He Val Gly Asp Gly Ala Cys 

1 5 * 10 15 

Gly Lys Thr Cys Leu Leu He Val Phe Ser Lys Asp Gin Phe Pro Glu 

20 25 30 

Val Tyr Val Pro Thr Val Phe Glu Asn Tyr Val Ala Asp He Glu Val 

35 40 45 

Asp Gly Lys Gin Val Glu Leu Ala Leu Trp Asp Thr Ala Gly Gin Glu 
50 55 60 
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Asp 


Tyr 


Asp 


Arg 


Leu 


Arg 


Pro 


Leu 


Ser Tyr Pro Asp Thr Asp Val He 


65 








70 






75 80 


Leu 


Met 


Cys 


Phe 


Ser 


lie 


Asp 


Ser 


Pro Asp Ser Leu Glu Asn He Pro 








85 








90 95 


Glu 


Lvs 


Trp 


Thr 


Pro 


Glu 


Val 


Lys 


His Phe Cys Pro Asn He Pro He 




100 










105 HO 


lie 


Leu 


Val 


Gly 


Asn 


Lys 


Lys 


Thr 


Ala Gly Asp. Glu His Thr Arg Arg 






115 










120 


125 


Glu 


Leu 


Ala 


Lys 


Met 


Lys 


Gin 


Glu 


Pro Val Lys Pro Asp Asp Ala Lys 




130 








135 




140 


Glu 


Met 


Gly Ser 


Arg 


He 


Lys 


Ala 


Phe Gly Tyr Leu Glu Cys Ser Ala 


145 








150 






155 160 


Lys 


Thr 


Lys 


Glu 


Gly 


Val 


Arg 


Glu 


Val Phe Glu Leu Ala Ser Arg Ala 






165 








170 175 


Ala 


Leu 


Gin 


Ala 


Lys 


Lys 


Thr 


Lys 


Ser Lys Ser Pro Cys Leu Leu Leu 



180 185 190 



<210> 89 
<211> 697 
<212> DNA 

<213> Canis lupus familiaris (dog) 

<220> 

<221> CDS 

<222> (1) . . (582) 

<400> 89 

atg get gec ate egg aag aaa ctg gtg qitt gtt ggt gat gga gee tgt 48 
Met Ala Ala He Arg Lys Lys Leu Val He Val Gly Asp Gly Ala Cys 
15 10 15 

ggt aag act tgt ttg etc ate gtc ttt age aag gac cag ttc cca gag 96 
Gly Lys Thr Cys Leu Leu He Val Phe Ser Lys Asp Gin Phe Pro Glu 

20 25 30 

gtg tat gta ccc aca gtg ttt gag aac tat gtg gca gat att gaa gtt 144 
Val Tyr Val Pro Thr Val Phe Glu Asn Tyr Val Ala Asp He Glu Val 

35 40 45 

gat gga aag cag gta gag ttg get ttg tgg gat aca get ggg cag gaa 192 
Asp Gly Lys Gin Val Glu Leu Ala Leu Trp Asp Thr Ala Gly Gin Glu 

50 55 60 

gat tat gat cgc ttg agg cct etc tec tat cca gac act gat gtt ata 240 
Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp Thr Asp Val He 
65 ~ 70 75 80 

ctg atg tgt ttc tct att gac age cct gat age tta gaa aac ate cca 288 
Leu Met Cys Phe Ser He Asp Ser Pro Asp Ser Leu Glu Asn He Pro 

85 90 95 

gaa aaa tgg ace cca gaa gtc aag cac ttc tgt ccc aac gtg ccc ate - 336 

Glu Lys Trp Thr Pro Glu Val Lys His Phe Cys Pro Asn Val Pro He 

100 105 110 

ate ctg gtt ggg aac aag aag gat ctt egg aat gat gag cac aca agg 384 
He Leu Val Gly Asn Lys Lys Asp Leu Arg Asn Asp Glu His Thr Arg 

115 120 125 

egg gag eta gee aag atg aag cag gag ccg gtg aaa ccg aca gaa ggc 432 
Arg Glu Leu Ala Lys Met Lys Gin Glu Pro Val Lys Pro Thr Glu Gly 

130 135 140 

aga gat atg gca aac agg att ggt get ttt ggg tac atg gag tgt tea 480 
Arg Asp Met Ala Asn Arg He Gly Ala Phe Gly Tyr Met Glu Cys Ser 
145 ' 150 155 160 

gca aag ace aaa gat gga gtg agg gag gtt ttt gaa atg gee acg aga 528 
Ala Lys Thr Lys Asp Gly Val Arg Glu Val Phe Glu Met Ala Thr Arg 

165 ~ 170 175 

get get ctg caa gee aga cgt ggg aag aaa aaa tct ggg tgc ctt gtc 576 
Ala Ala Leu Gin Ala Arg Arg Gly Lys Lys Lys Ser Gly Cys Leu Val 

180 " 185 190 

ttg tgaaaccctg ctgcaagcac agccctcatg cggttaattt tgaagtgctg 629 
Leu 



tttattaatc ttagtgtatg attactggee tttttcattt atctataatt tacctaagat 689 



WO 2005/014828 



77/291 



PCT7EP2004/008136 



tacaaatc 697 



<210> 90 
<211> 193 
<212> PRT 

<213> Canis lupus familiaris (dog) 
<400> 90 

Met Ala Ala lie Arg Lys Lys Leu Val lie Val Gly Asp Gly Ala Cys 

.1 5 10 15 

Gly Lys Thr Cys Leu Leu lie Val Phe Ser Lys Asp Gin Phe Pro Glu 

20 25 30 

Val Tyr Val Pro Thr Val Phe Glu Asn Tyr Val Ala Asp lie Glu Val 

35- 40 45 

Asp Gly Lys Gin Val Glu Leu Ala Leu Trp Asp Thr Ala Gly Gin Glu 

50 55 60 

Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp Thr Asp Val lie 
65 70 75 ~ 80 

Leu Met Cys Phe Ser lie Asp Ser Pro Asp Ser Leu Glu Asn lie Pro 

85 90 95 

Glu Lys Trp Thr Pro Glu Val Lys His Phe Cys Pro Asn Val Pro lie 

100 105 ~ no 

He Leu Val Gly Asn Lys Lys Asp Leu Arg Asn Asp Glu His Thr Arg 

115 120 125 

Arg Glu Leu Ala Lys Met Lys Gin Glu Pro Val Lys Pro Thr Glu Gly 

130 135 140 

Arg Asp Met Ala Asn Arg He Gly Ala Phe Gly Tyr Met Glu Cys Ser 
145 150 155 160 

Ala Lys Thr Lys Asp Gly Val Arg Glu Val Phe Glu Met Ala Thr Arg 

165 170 175 

Ala Ala Leu Gin Ala Arg Arg Gly Lys Lys Lys Ser -Gly Cys Leu Val 
180 185 190 

Leu 



<210> 91 
<211> 705 
<2l2> DNA 

<213> Drosophila melanogaster 

<220> 

<221> CDS 

<222> (25) . . (603) 



<400> 91 

tagcactcag tcagaaatcg aacc atg cag gcg ate aag tgc gtc gtc gtg 51 

Met Gin Ala He Lys Cys Val Val Val 
1 5 



ggc 


gac 


gga 


gec 


gtg 


gga 


aag 


acc 


tgc 


ctg 


ctg ate 


age 


tac 


acg 


acc 


99 


Gly 


Asp 


Gly 


Ala 


Val 


Gly 


Lys 


Thr 


Cys 


Leu 


Leu He 


Ser 


Tyr 


Thr 


Thr 




10 










15 










20 








25 




aat 


gec 


ttt 


ccc 


ggc 


gag 


tac 


ata 


ccc 


acc 


gtg ttc 


gac 


aac 


tac 


teg 


147 


Asn 


Ala 


Phe 


Pro 


Gly 
30 


Glu 


Tyr 


He 


Pro 


Thr 
35 


Val Phe 


Asp 


Asn 


Tyr 
40 


Ser 




gec 


aac 


gtg 


atg 


gtg 


gac 


gec 


aag 


ccc 


ate 


aac ctg 


ggc 


ctg 


tgg 


gat 


195 


Ala 


Asn 


Val 


Met 


Val 


Asp 


Ala 


Lys 


Pro 


He 


Asn Leu 


Gly 


Leu 


Trp 


Asp 










45 










50 








55 






acg 


gec 


ggg 


cag 


gag 


gac 


tac 


gac 


cga 


ctg 


agg cca 


ctg 


tct 


tat 


ccc 


243 


Thr 


Ala 


Gly 


Gin 


Glu 


Asp 


Tyr 


Asp 


Arg 


Leu 


Arg Pro 


Leu 


Ser 


Tyr 


Pro 








60 










65 








70 








cag 


acc 


gat 


gtc 


ttc 


etc 


ate 


tgc 


ttc 


teg 


ctg gtg 


aat 


ccg 


gca 


teg 


291 


Gin 


Thr 
75 


Asp 


val 


Phe 


Leu 


He 
80 


Cys 


Phe 


Ser 


Leu Val 
85 


Asn 


Pro 


Ala 


Ser 




ttc 


gag 


aac 


gtg 


egg 


gee 


aag 


tgg 


tat 


ccg 


gag gtg 


cgc 


cac 


cac 


tgc 


339 
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Phe Glu Asn Val Arg Ala Lys Trp Tyr Pro Glu Val Arg His His Cys 

90 95 100 105 

ccc age acg ccc ate ate ctg gtg ggc acc aag ctg gat ttg cgc gac 387 

Pro Ser Thr Pro lie lie Leu Val Gly Thr Lys Leu Asp Leu Arg Asp 

110 115 120 

gac aag aac aca ate gaa aag ctg agg gac aag aaa ctg gtg ccc ate 435 
Asp Lys Asn Thr He Glu Lys Leu Arg Asp Lys Lys Leu Val Pro He 

125 130 135 

acc tat ccg cag ggc ctg gec atg gee aag gaa ate gga gcg gtc aag 483 
Thr Tyr Pro Gin Gly Leu Ala Met Ala Lys' Glu He Gly Ala Val Lys 

140 145 150 

tat ctg gag tgc teg gee ctg acg cag aag ggt ctg aaa acc gtt ttc 531 
Tyr Leu Glu Cys Ser Ala Leu Thr Gin Lys Gly Leu Lys Thr Val Phe 

155 160 165 

gac gag gec ate egg teg gtt ttg tgc ccc gtg ctg cag ccc aag tec 579 
Asp Glu Ala He Arg Ser Val Leu Cys Pro Val Leu Gin Pro Lys Ser 
170 175 180 185 

aag cgc aag tgc gee ctg etc* taaaagagtt agacagttgg tcggtggtgt 630 
Lys Arg Lys Cys Ala Leu Leu 

190 

tccaccgcag cgactcctgc ttaaaegcta cactactcta ctctaaccta taataaccat 690 
tatgtaacct aagee 705 



<210> 92 
<211> 192 
<212> PRT 

<213> Drosophila melanogaster 
<400> 92 

Met Gin Ala He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 

1 5 10 15 

Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Ala Phe Pro Gly Glu Tyr 

20 25 30 

He Pro Thr Val Phe Asp Asn Tyr Ser Ala Asn Val Met Val Asp Ala 

35 ~ 40 45 

Lys Pro He Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr 

50 55 • 60 

Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu He 
65 " 70 75 80 

Cys Phe Ser Leu Val Asn Pro Ala Ser Phe Glu Asn Val Arg Ala Lys 

85 90 95 

Trp Tyr Pro Glu Val Arg' His His Cys Pro Ser Thr Pro He He Leu 

100 105 110 

Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Asn Thr He Glu Lys 

115 * 120 125 

Leu Arg Asp Lys Lys Leu Val Pro He Thr Tyr Pro Gin Gly Leu Ala 

130 135 140 

Met Ala Lys Glu He Gly Ala Val Lys Tyr Leu Glu Cys Ser Ala Leu 
145 ' 150 155 160 

Thr Gin Lys Gly Leu Lys Thr Val Phe Asp Glu Ala He Arg Ser Val 

165 170 175 

Leu Cys Pro Val Leu Gin Pro Lys Ser Lys Arg Lys Cys Ala Leu Leu 
180 185 190 

<210> 93 
<211> 801 
<212> DNA 

<213> Drosophila melanogaster 



<220> 

<221> CDS 

<222> (160) . . (735) 
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<400> 93 

ctcgcacgca aagagctcgc tggagagaga gaacgctaac acaaaaggaa atagtagttt 60 
tccgtacgta tcgagcccag cccagaaaag cggaaaacta aacggattcg acagcggttg 120 



caaattttaa ataactgagt catcacccct ctgcacaac atg caa acc ate aag 174 

Met Gin Thr He Lys 
1 5 



tgc 


gtg gtc 


gtc 


ggc gac 


gga 


gee 


gtg 


ggt aag 


aca 


tgc 


ctg 


etc 


ate 


222 


Cvs 


Val Val 


Val 


Glv Asd 


Glv 


Ala 


Val 


Gly Lys 


Thr 




iJCU 


Leu 


lie 










10 








15 












tccr 


tat aca 


acc 


aac aac 


ttc 


ccg 


teg 


era cr hap 


yty 


CCC 


acg 


gtg 


cue 


Z1Q 


Ser 


Tyr Thr Thr 


Asn Lvs 


Phe 


Pro 






Val 


Pro 


inr 


vai. 


rile 








25 








3 0 








«3> -J 








crac 


aac tat 


gcg 


gtc act 


erfcer 
y *-y 


atg 


ate 


ggc ggt 


gag 


CCC 


tac 


aca 


ctg 


318 


Asp 


Asn Tyr Ala 


Val "Thr 


Val 


Met 




wXjr OXjr 




Pro 


Tyr 


Thr 


Leu 






40 








45 
















crcrc 


ctg ttc 


gat 


gLCg gee 


crcra. 
yyo. 


cag 


gag 




rra t* 
y cll. 


egg 


ctg 


cgt 


ccg 


*a c c 


Glv 


Leu Phe Asp 


Thr Ala 




Gin 


Glu 


Hsp lyr 


Asp 


Arg 


Leu 


Arg 


Pro 






55 






60 








o ~> 










etc 


"tec tat 


ccg 




gat 


gtc 


ttc 


v— u u y 


tgc 


t uu 


teg 


gtg 


gtc 


414 


Leu 


Ser Tyr 


Pro 


Gin Thr 


Asp 


Val 


Phe 


J-ltS LL V CL J. 


Cys 


riie 


Cor 


Val 


Val 




70 






75 








SO 








85 




agt 


ccc agt 


tec 


ttc gag 


aac 


gtc 


aag 


gag aag 


tgg 


gtg 


CCC 


gag 


att 


462 


Ser 


Pro Ser 


Ser 


w JL LL 


A an 
noli 


Val 


Lys 


uiu iiys 
95 


Trp 


vax 


Pro 


Glu 
100 


He 




aca 


cac cat 


tgc 


caa aag 


acg 


ccg 


ttc 


ctg ctg 


gtg 


ggc 


aca 


cag 


att 


510 


Thr 


His His 


Cys 


^ST n T ire 
bill LyS 


X XJJl 


Pro 


Phe 


Leu Leu 


val 


Gly 


Thr 


Gin 


He 








105 








110 






115 








gat 


ttg cgc 


gac 


gag aac 


age 


acg 


ctg 


gag aag 


ctg 


gee 


aag 


aac. 


aag 


558 


Asp 


Leu Arg Asp 


Glu Asn 


Ser 


Thr 


Leu 


Glu Lys 


Leu 


Ala 


Lys 


Asn 


Lys 






120 








125 








130 








cag 


aag ccc 


ate 


acc atg 


gag 


cag 


ggc 


gag aag 


ctg 


gee 


aag 


gag 


ctg 


606 


Gin 


Lys Pro 


He 


Thr Met 


Glu 


Gin Gly 


Glu Lys 


Leu 


Ala 


Lys 


Glu 


Leu 






135 






140 








145 










aag 


gec gtc ^aag 


tac gtg 


gag 


tgc 


teg 


gec ttg 


aca 


cag 


aag 


ggc 


ctg 


654 


Lys Ala Val 


Lys 


Tyr Val 


Glu 


Cys 


Ser 


Ala Leu 


Thr 


Gin 


Lys 


Gly Leu 




150 






155 








160 








165 




aaa 


aat gta 


ttc 


gac gag 


gec 


ate 


ctg 


gec gee 


etc 


gag 


cca 


cca 


gag 


702 


Lys 


Asn Val 


Phe 


Asp Glu 
170 


Ala 


lie 


Leu 


Ala Ala 
175 


Leu 


Glu 


Pro 


Pro 
180 


Glu 




ccc 


aca aag 


aaa 


agg aag 


tgc 


aaa 


ttc 


tta taattctget ttcctcacac 


752 


Pro 


Thr Lys 


Lys 
185 


Arg Lys 


Cys 


Lys 


Phe 
190 


Leu 















gcggtatccc tttcattctg gettfetact ategcaatga ctccttccc - 801 



<210> 94 
<211> 191 
<212> PRT 

<213> Drosophila melanogaster 



<400> 94 



Met 


Gin Thr He Lys 


Cys 


Val 


Val 


Val 


Gly Asp Gly Ala Val 


Gly Lys 


i 


5 










10 


15 


Thr 


Cys Leu Leu He 


Ser 


Tyr 


Thr 


Thr 


Asn Lys Phe Pro Ser 


Glu Tyr 




20 








25 


30 


Val 


Pro Thr Val Phe 


Asp 


Asn 


Tyr 


Ala 


Val Thr Val Met lie 


Gly Gly 




35 






40 




45 


Glu 


Pro Tyr Thr Leu 


Gly 


Leu 


Phe 


Asp 


Thr Ala Gly Gin Glu 


Asp Tyr 




50 




55 






60 


Asp Arg Leu Arg Pro 


Leu 


Ser 


Tyr 


Pro 


Gin Thr Asp Val Phe 


Leu Val 


65 




70 








75 


80 


Cys 


Phe Ser Val Val 


Ser 


Pro 


Ser 


Ser 


Phe Glu Asn Val Lys 


Glu Lys 
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85 90 95 

Trp Val Pro Glu He Thr His His Cys Gin Lys Thr Pro Phe Leu Leu 

100 105 110 

Val Gly Thr Gin He Asp Leu Arg Asp Glu Asn Ser Thr Leu Glu Lys 

115 120 125 

Leu Ala Lys Asn Lys Gin Lys Pro He Thr Met Glu Gin Gly Glu Lys 

13 0 ~ 135 140 

Leu Ala Lys Glu Leu Lys Ala Val Lys Tyr Val Glu Cys Ser Ala Leu 
145 150 . 155 160 

Thr Gin Lys Gly Leu Lys Asn Val Phe Asp Glu Ala He Leu Ala Ala 

165 170 175 

Leu Glu Pro Pro Glu Pro Thr Lys Lys Arg Lys Cys Lys Phe Leu 
180 185 190 



<210> 95 
<211> 609 
<212> DNA 

<213> Schizosaccharomyces pombe 



<220> 
<221> CDS 
<222> (1) . 



(609) 



<400> 95 
atg gcg aca 
Met Ala Thr 
1 

tgt ggt aaa 
Cys Gly Lys 

gag gtc tat 
Glu Val Tyr 
35 

gtt gat gga 
Val Asp Gly 
50 

gaa gat tac 
Glu Asp Tyr 
65 

ate ctt att 
He Leu He 

caa gaa aaa 
Gin Glu Lys 



att ttg 
He Leu 



att 
He 

ggt 
Gly 
145 
tct 
Ser 



gag 
Glu 
130 
caa 
Gin 

gec 
Ala 



ctt 
Leu 
115 
gag 
Glu 

gca 
Ala 

aag 

Lys' 



gaa ctt 
Glu Leu 
5 

aca tgc 
Thr Cys 
20 

gtt ccc 
Val Pro 

cgc cac 
Arg His 

gac cgt 
Asp Arg 

tgc ttt 
Cys Phe 

85 
tgg att 
Trp He 
100 

gtc get 
Val Ala 



cgt get get 
Arg Ala Ala 

act aag aag 
Thr Lys Lys 
195 



tta 
Leu 

gta 
Val 

acg 
Thr 

atg 
Met 
180 
aag 
Lys 



tec 
Ser 

get 
Ala 

aat 
Asn 
165 
etc 
Leu 



cgc aga 
Arg Arg 

tta tta 
Leu Leu 

act gtt 
Thr Val 

gtt gag 
Val Glu 

55 
eta cgt 
Leu Arg 
70 

get gtt 
Ala Val 

tec gag 
Ser Glu 

tgc aag 
Cys Lys 

aag act 
Lys Thr 
135 
cag aag 
Gin Lys 
150 

gaa ggt 
Glu Gly 

aaa cac 
Lys His 



aag cgt tgt 
Lys Arg Cys 



aag 
Lys 

att 
He 

ttt 
Phe 
40 
ttg 
Leu 

ccc 
Pro 

gat 
Asp 

gtt 
Val 

get 
Ala 
120 
aat 
Asn 

att 
He 

gtt 
Val 

aag 
Lys 

ate 
He 
200 



etc gtt 
Leu Val 

10 
gta ttt 
Val Phe 
25 

gaa aat 
Glu Asn 



get 
Ala 

ttg 
Leu 

tct 
Ser 

etc 
Leu 
105 
gat 
Asp 

cag 
Gin 

ggt 
Gly 

cgt 
Arg 

ccc 
Pro 
185 
ttg 
Leu 



ctt 
Leu 

tea 
Ser 

ccc 

Pro 

90 

cat 

His 



att gtg 
He Val 

tct aaa 
Ser Lys 

tat gta 
Tyr Val 

tgg gat 
Trp Asp 

60 
tat cct 
Tyr Pro 
75 

gat tct 
Asp Ser 

ttc tgt 
Phe Cys 



gga gat 
Gly Asp 

gga acc 
Gly Thr 

30 
get gat 
Ala Asp 
45 

acg get 
Thr Ala 



etc cgt aac 
Leu Arg Asn 



cat 
His 

get 
Ala 

gag 
Glu 
170 
aaa 
Lys 



ccc gtc 
Pro Val 
140 
tac aaa 
Tyr Lys 
155 

gtt ttt 
Val Phe 

gtg aag 
Val Lys 



gac 
Asp 

ctt 
Leu 

tec 
Ser 

gac 
Asp 
125 
acc 
Thr 



tea 
Ser 

gac 
Asp 

agt 
Ser 
110 
cca 
Pro 



ggt gca 
Gly Ala 
15 

ttt CCC 

Phe Pro 

gtt gag 
Val Glu 

gga caa 
Gly Gin 

cat gtt 
His Val 

80 
aat gtt 
Asn Val 
95 

ctt cct 
Leu Pro 

aaa att 
Lys He 



aca gaa gaa 
Thr Glu Glu 



tac ctt 
Tyr Leu 

gaa tea 
Glu Ser 

ccc tct 
Pro Ser 
190 



gag tgt 
Glu Cys 
160 
gee act 
Ala Thr 
175 

agt gga 
Ser Gly 



ttg taa 
Leu 



48 

96 

144 

192 

240 

288 

336 

384 

432 

480 

528 

576 

609 



<210> 96 
<211> 202 
<212> PRT 
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<213> Schizosaccharomyces pombe 



PCT/EP2004/008136 



<400> 96 

Met Ala Thr .Giu Leu Arg Arg Lys Leu Val He Val Gly Asp Gly Ala 

1 5 io 15 

Cys Gly Lys Thr Cys Leu Leu He Val Phe Ser Lys Gly Thr Phe Pro 

20 .25 30 

Glu Val Tyr Val Pro Thr Val Phe Glu Asn Tyr Val Ala Asp Val Glu 

35 40 45 

Val Asp Gly Arg His Val Glu Leu Ala Leu' Trp Asp Thr Ala Gly Gin 

50 55 60 

Glu Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp Ser His Val 
65 70 75 80 

He Leu He Cys Phe Ala Val Asp Ser Pro Asp Ser Leu Asp Asn Val 

85 90 95 

Gin Glu Lys Trp He Ser Glu Val Leu His Phe Cys Ser Ser Leu Pro 

100 105 no 

He Leu Leu Val Ala Cys Lys Ala Asp Leu Arg Asn Asp Pro Lys He 

115 120 125 

He Glu Glu Leu Ser Lys Thr Asn Gin His Pro Val Thr Thr Glu Glu 

130 135 140 

Gly Gin Ala Val Ala Gin Lys He Gly Ala Tyr Lys Tyr Leu Glu Cys 
145 • 150 155 160 

Ser Ala Lys Thr Asn Glu Gly Val Arg Glu Val Phe Glu Ser Ala Thr 

165 170 175 

Arg Ala Ala Met Leu Lys His Lys Pro Lys Val Lys Pro Ser Ser Gly 

180 185 " 190 

Thr Lys Lys Lys Lys Arg Cys He Leu Leu 
195 200 

<210> 97 
<211> 603 
<212> DNA 

<213> Schizosaccharomyces pombe 

<220> 

<221> CDS 

<222> (1) . . (603) 

<400> 97 

atg ttg caa tct caa ccg att aga "agg aaa eta gta gtt gta ggc gat 48 

Met Leu Gin Ser Gin Pro He Arg Arg Lys Leu Val Val Val Gly Asp 

1 5 10 15 

ggt gee tgc ggg aag acg tct ttg tta tec gtt ttc act ttg gga tat 96 

Gly Ala Cys Gly Lys Thr Ser Leu Leu Ser Val Phe Thr Leu Gly Tyr 

20 25 30 

ttt cct act gaa tat gtt ccg act gtg ttt gaa aac tat gtt tea gat 144 
Phe Pro Thr Glu Tyr Val Pro Thr Val Phe Glu Asn Tyr Val Ser Asp 

35 40 45 

tgt cga gtt gat gga aaa tea gtt caa ctg gca ttg tgg gat acc gca 192 
Cys Arg Val Asp Gly Lys Ser Val Gin Leu Ala Leu Trp Asp Thr Ala 

50 55 60 

ggt caa gag gaa tat gag aga tta aga cca atg tct tac get aaa gca 240 
Gly Gin Glu Glu Tyr Glu Arg Leu Arg Pro Met Ser Tyr Ala Lys Ala 
65 70 75 80 

cac att att ttg gtt ggg ttt gcg ata gac tct cct gat tea ctt gaa 288 
His He He Leu Val Gly Phe Ala He Asp Ser Pro Asp Ser Leu Glu 

85 90 95 

aat gtt tct acg aag tgg att gag gaa att aat aca ctt tgc cca aat 336 
Asn Val Ser Thr Lys Trp He Glu Glu He Asn Thr Leu Cys Pro Asn 

100 105 no . 

gtt ccg ttt att tta gtg ggc atg aaa gec gac tta agg tea gat ccc 384 
Val Pro Phe He Leu Val Gly Met Lys Ala Asp Leu Arg Ser Asp Pro 

115 120 125 

gtt get att gaa- gaa atg cga cgt cga aat caa aat ttt gta aaa tea 432 
Val Ala He Glu Glu Met Arg Arg Arg Asn Gin Asn Phe Val Lys Ser 

130 135 140 

caa cag get gaa tta gta get cag egg att ggt gcg aga aag tat atg 480 



WO 2005/014828 



Gin 


Gin Ala 


Glu 


Leu Val 


145 






150 


gaa 


tgt tct 


tea 


ttg act 


Glu 


Cys Ser 


Ser 


Leu Thr 








165 


get 


act agg 


gca 


gca eta 


Ala 


Thr Arg 


Ala 


Ala Leu 






180 




tct 


aca aaa 


tgc 


tgc ate 


Ser 


Thr Lys 


Cys 


Cys lie 



195 



82/291 



Ala 


Gin Arg 


He Gly Ala 






155 


ggt 


gac ggc 


gtg gac gat 


Gly 


Asp Gly 


Val Asp Asp 






170 


aca 


gtt egg 


gat teg gaa 


Thr 


Val Arg 


Asp Ser Glu 




185 


att 


tea taa 




lie 


Ser 






200 





PCT/EP2004/008136 



Arg 


Lys 


Tyr 


Met 










160 




gta 


ttt 


gaa 


get 


528 


Val 


Phe 


Glu 


Ala 








175 






aat 


gac 


aag 


agt 


576 


Asn 


Asp 


Lys 


Ser 





190 



603 



<210> 98 
<211> 200 
<212> PRT 

<213> Schizosaccharomyces pombe 



<400> 98 



Met 


Leu 


Gin 


Ser 


Gin 


Pro 


He 


Arg 


1 








5 








Gly 


Ala 


Cys 


Gly 
20 


Lys 


Thr 


Ser 


Leu 


Phe 


Pro 


Thr 
35 


Glu 


Tyr 


Val 


Pro 


Thr 
40 


Cys 


Arg 
50 


Val 


Asp 


Gly 


Lys 


Ser 
55 


Val 


Gly 


Gin 


Glu 


Glu 


Tyr 


Glu 


Arg 


Leu 


65 










70 






His 


He 


He 


Leu 


Val 
85 


Gly 


Phe 


Ala 


Asn 


Val 


Ser 


Thr 
100 


Lys 


Trp 


He 


Glu 


Val 


Pro 


Phe 
115 


He 


Leu 


Val 


Gly 


Met 
120 


Val 


Ala 
130 


He 


Glu 


Glu 


Met 


Arg 
135 


Arg 


Gin 


Gin 


Ala 


Glu 


Leu 


Val 


Ala 


Gin 


145 










150 






Glu 


Cys 


Ser 


Ser 


Leu 
165 


Thr 


Gly 


Asp 


Ala 


Thr 


Arg 


Ala 
180 


Ala 


Leu 


Thr 


Val 


Ser 


Thr 


Lys 
195 


Cys 


Cys 


He 


He 


Ser 
200 



Arg 


Lys Leu Val Val 


Val 


Gly 


Asp 




10 




15 




Leu 


Ser Val Phe Thr 


Leu 


Gly 


Tyr 


25 




30 






Val 


Phe Glu Asn Tyr 


Val 


5er 


Asp 




45 








Gin 


Leu Ala Leu Trp 


Asp 


Thr 


Ala 




60 








Arg 


Pro Met Ser Tyr 


Ala 


Lys 


Ala 




75 






80 


He 


Asp Ser Pro Asp 


Ser 


Leu 


Glu 




90 




95 




Glu 


He Asn Thr Leu 


Cys 


Pro 


Asn 


.105 




110 






Lys 


Ala Asp Leu Arg 


Ser 


Asp 


Pro 




125 








Arg 


Asn Gin Asn Phe 


Val 


Lys 


Ser 




140 








Arg 


He Gly Ala Arg 


Lys 


Tyr 


Met 




155 






160 


Gly 


Val Asp Asp Val 


Phe 


Glu 


Ala 




170 




175 




Arg 


Asp Ser Glu Asn 


Asp 


Lys 


Ser 


185 




190 







<210> 99 
<211> 726 
<212> DNA 

<213> Entamoeba histolytica 



<220> 

<221> CDS 

<222> (97) . . (687) 

<400> 99 

aaatgtaaag aacattaaaa ataaaaaaac agtgaaaaga aatggattaa aaatagaatt 60 



tcaaggaact gaaactcgta attcaaaaga aaacac atg caa get gtc aaa tgt 114 

Met Gin Ala Val Lys Cys 
1 5 

gtc att gtt ggg gat gga get gta gga aaa act tgt ctt tta att tct 162 
Val He Val Gly Asp Gly Ala Val Gly Lys Thr Cys Leu Leu He Ser 

10 * 15 20 

tac act aca aat gca ttt cct aat gaa tat att cca aca gta ttt gat 210 
Tyr Thr Thr Asn Ala Phe Pro Asn Glu Tyr He Pro Thr Val Phe Asp 
25 30 35 
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aat tat tct get act gtt atg gtt gat tea aaa cca att aat ctt gga 258 
Asn Tyr Ser Ala Thr Val Met Val Asp Ser Lys Pro He Asn Leu Gly 

40 45 50 

ctt tgg gat act get gga caa gaa gat tat gat aga tta aga cca ttg 306 
Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr Asp Arg Leu Arg Pro Leu 
55 -60 65 70 

tea tat cca caa acc gat gta ttc tta att tgt ttc tea gtt gtt tct 354 
Ser Tyr Pro Gin Thr Asp Val Phe Leu He Cys Phe Ser Val Val Ser 

75 80 85 

cca cca tea ttt gat aat gtc tea tct aaa tgg caa cca gaa gtt tct 402 
Pro Pro Ser Phe Asp Asn Val Ser Ser Lys Trp Gin Pro Glu Val Ser 

90 95 100 

cat cat tgt cca aag act cct tgt tta tta gtt gga act aaa ctt gat 450 
His His Cys Pro Lys Thr Pro Cys Leu Leu Val Gly Thr Lys Leu Asp 

105 no 115 

atg aga gaa gat aaa gaa caa tta aag aga tta gaa gag aaa aaa att 498 
Met Arg Glu Asp Lys Glu Gin Leu Lys Arg Leu Glu Glu Lys Lys He 
' " 120 125 130 

act cca ate act act gaa caa ggt gag get aaa tgc aaa gat att gga 546 

Thr Pro He Thr Thr Glu Gin Gly Glu Ala Lys Cys Lys Asp He Gly 

135 140 145 150 

get gtt aag tat ate gaa tgt tct get tta act caa aag aat tta aga 594 

Ala Val Lys Tyr He Glu Cys Ser Ala Leu Thr Gin Lys Asn Leu Arg 

155 160 165 

ctt gtt ttt gat gaa get gtt aga gca gtt att tct cct gca ggt gga 642 
Leu Val Phe Asp Glu Ala Val Arg Ala Val He Ser Pro Ala Gly Gly 

170 175 180 

gca aaa aag gat aaa aag aat aat aga gga tgt tta tta ttc taaattcgat 694 
Ala Lys Lys Asp Lys Lys Asn Asn Arg Gly Cys Leu Leu Phe 

185 190 195 

tcattccttt tttaaacaac ttacaaacaa ta 726 



<210> 100 
<211> 196 
<212> PRT 

<213> Entamoeba histolytica 



<400> 100 



Met Gin Ala Val Lys 
1 5 


Cys 


Val 


He 


Thr Cys Leu Leu He 


Ser 


Tyr 


Thr 


20 






He Pro Thr Val Phe 


Asp 


Asn 


Tyr 


35 






40 


Lys Pro He Asn Leu 


Gly 


Leu 


Trp 


50 




55 


Asp Arg Leu Arg Pro 


Leu 


Ser 


Tyr 


65 


70 




Cys Phe Ser Val Val 


Ser 


Pro 


Pro 


85 








Trp Gin Pro Glu Val 


Ser 


His 


His 


100 








Val Gly Thr Lys Leu 


Asp 


Met Arg 


115 






120 


Leu Glu Glu Lys Lys 


He 


Thr 


Pro 


13 0 




13 5 




Lys Cys Lys Asp He 


Gly 


Ala 


Val 


145 


150 






Thr Gin Lys Asn Leu 


Arg 


Leu 


Val 


165 






He Ser Pro Ala Gly 


Gly 


Ala 


Lys 



180 

Cys Leu Leu Phe 
195 



Val Gly Asp Gly Ala Val Gly Lys 

10 15 
Thr Asn Ala Phe Pro Asn Glu Tyr 
25 30 
Ser Ala Thr Val Met Val Asp Ser 
45 

Asp Thr Ala Gly Gin Glu Asp Tyr 
60 

Pro Gin Thr Asp Val Phe Leu He 

75 80 
Ser Phe Asp Asn Val Ser Ser Lys 

90 95 
Cys Pro Lys Thr Pro Cys Leu Leu 
105 110 
Glu Asp Lys Glu Gin Leu Lys Arg 
125 

He Thr Thr Glu Gin Gly Glu Ala 
140 

Lys Tyr He Glu Cys Ser Ala Leu 
155 160 
Phe Asp Glu Ala Val Arg Ala Val 

170 175 
Lys Asp Lys Lys Asn Asn Arg Gly 
185 190 
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<210> 101 
<211> 744 
<212> DNA 

<213> Entamoeba histolytica 

<220> 

<221> CDS 

<222> (145) . . (729) 

<400> 101 

aaatataatc ttaaaataga aagaactgca aaaactaaaa aacataaaaa aagaaactct 60 



agaaaagcta ttaaagtaac gaaaaaaaca tgaaacaaag aaataataga caactcagaa 120 



ctaaaaaaag ttgaaagaaa atca atg agt gaa aaa ccc aca tea. ate. aaa- 171 

Met Ser Glu Lys Pro Thr Ser lie Lys 
1 5 



tta 


gtc 


gtt 


gtt 


ggg 


gat 


ggt get 


gtt 


gga 


aaa 


aca 


tgt 


tta 


tta 


att 


219 


Leu 


Val 


Val 


Val 


Gly 


Asp 


Gly Ala 


Val 


Gly 


Lys 


Thr 


Cys 


Leu 


Leu 


He 




10 










15 








20 










25 




tgc 


tat 


aca 


act 


aat 


gaa 


ttt cca 


aaa 


gat 


tat 


att 


cct 


aca 


gta 


ttc 


267 


Cys 


Tyr 


Thr 


Thr 


Asn 
30 


Glu 


Phe Pro 


Lys 


Asp 
35 


Tyr 


He 


Pro 


Thr 


Val 
40 


Phe 




gat 


aat 


tat 


gtt 


gtg 


tea 


tta act 


gca 


ggc 


aca 


aga 


caa 


att 


caa 


ttg 


315 


Asp 


Asn 


Tyr 


Val 
45 


Val 


Ser 


Leu Thr 


Ala 
50 


Gly 


Thr 


Arg 


Gin 


He 
55 


Gin 


Leu 




gcg 


ttg 


tgg 


gat 


act 


get 


gga caa 


gaa 


gag 


tat 


gat 


caa 


ttg 


aga 


cca 


363 


Ala 


Leu 


Trp 
60 


Asp 


Thr 


Ala 


Gly Gin 
65 


Glu 


Glu 


Tyr 


Asp 


Gin 
70 


Leu 


Arg 


Pro 




ttg 


agt 


tat 


tea 


tct 


get 


tea ata 


ttt 


tta 


ata 


tgt 


ttc 


tct 


gtc 


aca 


411 


Leu 


Ser 


Tyr 


Ser 


Ser 


Ala 


Ser He 


Phe 


Leu 


He 


Cys 


Phe 


Ser 


Val 


Thr 






75 . 








80 








85 












agt 


tct 


gta 


tea 


tat 


g at 


aat gta 


att 


aca 


aaa 


tgg 


cac 


cct 


gaa 


gtt 


4*59 


Ser 


Ser 


Val 


Ser 


Tyr 


Asp 


Asn Val 


He 


Thr 


Lys 


Trp 


His 


Pro 


Glu 


Val 




90 










95 








100 










105 




att 


cac 


ttt 


gca 


cca 


aaa 


gtt cct 


att 


ata 


tta 


gtt 


gga 


aca 


aaa 


ctt 


507 


lie 


His 


Phe 


Ala 


Pro 
110 


Lys 


Val Pro 


He 


He 
115 


Leu 


Val 


Gly 


Thr 


Lys 
120 


Leu 




gat 


act 


aga 


aat 


gat 


cct 


get att 


gtt 


aaa 


aga 


tta 


aca 


gaa 


cag 


ggt 


555 


Asp 


Thr 


Arg 


Asn Asp 


Pro 


Ala He 


Val 


Lys 


Arg 


Leu 


Thr 


Glu 


Gin 


Gly 










125 








13 0 










135 








atg 


act 


gtt 


att 


aat 


act 


gca aaa 


gga 


gaa 


gaa 


ctt 


aaa 


aat 


cgt 


ata 


603 


Met 


Thr 


Val 
140 


lie 


Asn 


Thr 


Ala Lys 
145 


Gly 


Glu 


Glu 


Leu 


Lys 
150 


Asn 


Arg 


He 




aaa 


get 


gtc 


aaa 


tat 


att 


gaa tgt 


tct 


gca 


aaa 


aca 


age 


gaa 


aat 


tta 


651 


Lys 


Ala 
155 


Val 


Lys 


Tyr 


He 


Glu Cys 
160 


Ser 


Ala 


Lys 


Thr 
165 


Ser 


Glu 


Asn 


Leu 




aag 


aca 


gtt 


ttc 


gat 


gaa 


gca gtt 


aaa 


act 


gtg 


tta 


atg 


aat 


aaa 


cca 


699 


Lys 


Thr 


Val 


Phe 


Asp 


Glu 


Ala Val 


Lys 


Thr 


Val 


Leu 


Met 


Asn 


Lys 


Pro 




170 










175 








180 










185 




cag 


caa 


agg 


tea 


aag 


tgt 


get ctt 


tta 


taattcactt ttaaatta 






744 


Gin 


Gin 


Arg 


Ser 


Lys 
190 


Cys 


Ala Leu 


Leu 



















<210> 102 
<211> 194 
<212> PRT 

<213> Entamoeba histolytica 
<400> 102 

Met Ser Glu Lys Pro Thr Ser He 
1 5 
Ala Val Gly Lys Thr Cys Leu Leu 
20 

Pro Lys Asp Tyr He Pro Thr Val 



Lys Leu Val Val Val Gly Asp Gly 

10 15 
He Cys Tyr Thr Thr Asn Glu Phe 
25 30 
Phe Asp Asn Tyr Val Val Ser Leu 
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35 40 45 

Thr Ala Gly Thr Arg Gin lie Gin Leu Ala Leu Trp Asp Thr Ala Gly 

50 55 60 

Gin Glu Glu Tyr Asp Gin Leu Arg Pro Leu Ser Tyr Ser Ser Ala Ser 
65 70 75 80 

lie Phe Leu lie Cys Phe Ser Val Thr Ser Ser Val Ser Tyr Asp Asn 

85 90 95 

Val lie Thr Lys Trp His Pro Glu Val He His Phe Ala Pro Lys Val 

100 105 no 

Pro He He Leu Val GXy Thr Lys Leu Asp Thr Arg Asn Asp Pro Ala 

115 . 120 125 

He Val Lys Arg Leu Thr Glu Gin Gly Met Thr Val He Asn Thr Ala 

130 135 140 

Lys Gly Glu Glu Leu Lys Asn Arg He Lys Ala Val Lys Tyr He Glu 
145 150 155 160 

Cys Ser Ala Lys Thr Ser Glu Asn Leu Lys Thr Val Phe Asp Glu Ala 
165 170 175 

"Val Lys Thr Val Leu Met Asn Lys Pro" Gin Gin Arg Ser Lys Cys Ala" 
180 185 190 

Leu Leu 

<210> 103 

<211> 588 

<212> DNA 

<213> Caenorhabditis elegans 

<220> 

<221> CDS 

<222> (1) . . (588) 

<400> 103 

atg caa gca ate aaa tgt gtc gtc gtt ggt gac gga gec gtc ggt aag 48 

Met Gin Ala He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 

1 5 10 ' 15 

acg tgt etc ctg eta teg tac act aca aac get ttt ccc gga gaa tat 96 

Thr Cys Leu Leu Leu Ser Tyr Thr Thr Asn Ala Phe Pro Gly Glu Tyr 

20 25 30 

att eta acg gta ttc gac ace tac tea aca aat gtg atg gtc gac gga 144 
He Leu Thr Val Phe Asp Thr Tyr Ser Thr Asn Val Met Val Asp Gly 

35 40 45 

agg cca ata aat etc age eta tgg gac aca get gga cag gac gat tac 192 
Arg Pro He Asn Leu Ser Leu Trp Asp Thr Ala Gly Gin Asp Asp Tyr 

50 55 60 

gat caa ttc cgc cac ctg tea ttt cca caa aca gac gta ttc etc gta 240 
Asp Gin Phe Arg His Leu Ser Phe Pro Gin Thr Asp Val Phe Leu Val 
65 70 75 80 

tgc ttt gca ctg aat aat cca gca agt ttt gag aat gtt cgt gca aaa 288 
Cys Phe Ala Leu Asn Asn Pro Ala Ser Phe Glu Asn Val Arg Ala Lys 

85 90 95 

tgg tat cca gaa gta tea cat cat tgc cca aat aca ccg att att ttg 336 
Trp Tyr Pro Glu Val Ser His His Cys -Pro Asn Thr Pro He He Leu 

100 105 no 

gtt gga act aaa get gat tta cgc gag gat cga gat act att gaa egg 384 
Val Gly Thr Lys Ala Asp Leu Arg Glu Asp Arg Asp Thr lie Glu Arg 

115 120 125 

etc cgt gaa cgt egg etc caa cca gtg age cac ace cag ggt tac gtg 432 
Leu Arg Glu Arg Arg Leu Gin Pro Val Ser His Thr Gin Gly Tyr Val 

130 135 140 

atg gca aag gaa ate aag gcg gtc aag tac ctg gaa tgc teg gcg ctt 480 
Met Ala Lys Glu He Lys Ala Val Lys Tyr Leu Glu Cys Ser Ala Leu 
145 150 " 155 160 

acc caa att gga ttg aaa caa; gtt ttc gat gag gca att cgt act ggg 528 
Thr Gin He Gly Leu Lys Gin Val Phe Asp Glu Ala He Arg Thr Gly 

165 170 175 

etc acc ccg cca caa aca cca caa acg aga gee aaa aag age aat tgc 576 
Leu Thr Pro Pro Gin Thr Pro Gin Thr Arg Ala Lys Lys Ser Asn Cys 

180 185 190 
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acg gtg ctt taa 588 
Thr Val Leu 
195 



<210> 104 
<211> 195 
<212> PRT 

<213> Caenor habdi t i s elegans 
<400> 104 

Met Gin Ala lie Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 

15 10 15 

Thr Cys Leu Leu Leu Ser Tyr Thr Thr Asn Ala Phe Pro Gly Glu Tyr 

20 25 3 0 

He Leu Thr Val Phe Asp Thr Tyr Ser Thr Asn Val Met Val Asp Gly 

35 40 45 

ArgPro lie Asn Leu Ser Leu Trp Asp Thr Ala Gly Gin Asp Asp Tyr 

50 55 60 

Asp Gin Phe Arg His Leu Ser Phe Pro Gin Thr Asp Val Phe Leu Val 
65 70 75 80 

Cys Phe Ala Leu Asn Asn Pro Ala Ser Phe Glu Asn Val Arg Ala Lys 

85 90 95 

Trp Tyr Pro Glu Val Ser His His Cys Pro Asn Thr Pro He He Leu 

100 105 HO 

Val Gly Thr Lys Ala Asp Leu Arg Glu Asp Arg Asp Thr He Glu Arg 

115 120 125 

Leu Arg Glu Arg Arg Leu Gin Pro Val Ser His Thr Gin Gly Tyr Val 

130 135 140 

Met Ala Lys Glu He Lys Ala Val Lys Tyr Leu Glu Cys Ser Ala Leu 
145 150 155 160 

Thr Gin He Gly Leu Lys Gin Val Phe Asp Glu Ala He Arg Thr Gly 

165 170 175 

Leu Thr Pro Pro Gin Thr Pro Gin Thr Arg Ala Lys Lys Ser Asn Cys 
180 185 190 

Thr Val Leu 
195 

<210> 105 
<211> 594 
<212> DNA 

<213> Zea mays (maize) 

<220> 

<221> CDS 

<222> (1) . . (594) 

<400> 105 

atg age gcg tec agg ttc ata aag tgc gtc acg gtc ggg gac ggc gec 48 
Met Ser Ala Ser Arg Phe He Lys Cys Val Thr Val Gly Asp Gly Ala 
15 10 15 

gtc ggc aag acc tgc atg etc ate tec tac acc tec aac ace ttc ccc 96 
Val Gly Lys Thr Cys Met Leu He Ser Tyr Thr Ser Asn Thr Phe Pro 

20 ' 25 30 

acc gac tat gtt ccg aca gtg ttt gat aac ttc agt gee aac gtt gtg 144 
Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val 

35 40 45 

gtt gat ggt aat act gtc aac etc ggc etc tgg gac act gca ggt caa 192 
Val Asp Gly Asn Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 

50 * 55 60 

gag gat tac aac aga ctg aga cca ctg age tat cgt gga get gat gtt 
Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 
65 70 75 80 

ttt ctt ctg get ttc tea ctg ate agt aag gee age tat gag aat gtt 288 
Phe Leu Leu Ala Phe Ser Leu He Ser Lys Ala Ser Tyr Glu Asn Val 

85 90 95 

teg aag aag tgg ata cct gaa ctg aag cat tat gca cct ggt gtg cca 33 6 

Ser Lys Lys Trp He Pro Glu Leu Lys His Tyr Ala Pro Gly Val Pro 



240 
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105 iio 
aca aag ctt gat ctt cga gac gac aag cag ttc 384 
Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe 

120 125 
ggt get gtc cct ate act act get cag gga gag 432 
Gly Ala Val Pro lie Thr Thr Ala Gin Gly Glu 

135 140 
ata ggc get cca tac tac ate gaa tgc age teg 480 
lie Gly Ala Pro Tyr Tyr lie Glu Cys Ser Ser 
ISO • 155 160 

gtg aag ggc gtc ttt gat gcg gcg ata aag gtt 528 
Val Lys Gly Val Phe Asp Ala Ala lie Lys Val 

170 175 
aag gcg aag aag aag aaa aag gtg cag agg ggg 576 
Lys Ala Lys Lys Lys Lys Lys Val Gin Arg Gly 

185 190 
tga 594 









100 




att 


att 


etc 


gta 


ggg 


lie 


lie 


Leu Val Gly 






115 






ttt 


gtg 


gac 


cat 


cct 


Phe 


val 


Asp His Pro 




130 








gag 


eta 


aga 


aag 


caa 


Glu 


Leu 


Arg Lys 


Gin 


145 










aag 


acc 


caa 


eta 


aac 


Lys 


Thr 


Gin 


Leu 


Asn 










165 


gtg 


ctg 


cag 


ccg 


cct 


Val 


Leu 


Gin 


Pro 


Pro 








180 




gcg 


tgc 


tec 


att 


ttg 


Ala 


Cys 


Ser 


lie Leu 






195 







<210> 106 
<211> 197 
<212> PRT 

<213> Zea mays (maize) 
<400> 106 

Met Ser Ala Ser Arg Phe lie Lys Cys Val Thr Val Gly Asp Gly Ala 

1 5 10 15 

Val Gly Lys Thr Cys Met Leu lie Ser Tyr Thr Srx Asn Thr Phe Pro 

20 25 30 

Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val 

35 40 45 

Val Asp Gly Asn Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 

50 55 60 

Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 
65 ,70 75 80 

Phe Leu Leu Ala Phe Ser Leu He Ser Lys Ala Ser Tyr Glu Asn Val 

85 90 95 

Ser Lys Lys Trp He Pro Glu Leu Lys His Tyr Ala Pro Gly Val Pro 

100 105 no 

He He Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe 

115 120 125 

Phe Val Asp His Pro Gly Ala Val Pro He Thr Thr Ala Gin Gly Glu 

130 135 140 

Glu Leu Arg Lys Gin He Gly Ala Pro Tyr Tyr He Glu Cys Ser Ser 
145 150 155 160 

Lys Thr Gin Leu Asn Val Lys Gly Val Phe Asp Ala Ala He Lys Val 

165 170 175 

Val Leu Gin Pro Pro Lys Ala Lys Lys Lys Lys Lys Val Gin Arg Gly 

180 185 190 

Ala Cys Ser He Leu 
.195 

<210> 107 
<211> 1058 
<212> DNA 

<213> Zea mays (maize) 

<220> 

<221> CDS 

<222> (176) . . (769) 

<400> 107 

gaatteggea cgagagctct caagacggcc gacggccggc ttgcctacct gctcccatcc 60 



ttcccgaggg accgagaaag ataagaaagg cggtggtcaa cttgtgtcct gaggtgcccg 120 
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tagaagccca aggacaagaa acaaggagaa gagtagatct acatctactc caccg atg 178 

Met 



age 


gcg 


tct 


egg 


ttc 


ate 


aag tgc 


gtc 


acc 


gtg 


ggg 


gac 


ggt 


gee 


JL 

gxc 


226 


Ser 


Ala 


Ser 


Arg 


Phe 


He 


Lys Cys 


Val 


Thr 


val 


Gly 


Asp 


Gly 


Ala 


Val 










5 








10 










15 








gga 


aag 


acc 


tgc 


atg 


etc 


ate tec 


tac 


aca 


tec 


aac 


act 


ttc 


ccc 


act 


274 


Gly 


Lys 


Thr 


Cys 


Met 


Leu 


He Ser 


Tyr 


Thr' 


Ser 


Asn 


Thr 


Phe 


Pro 


Thr 




20 






25 










30 










gac 


tat 


gtt 


cca 


act 


gtg 


ttc gac 


aac 


ttc 


agt 


gee 


aat 


gtt 


gtg 


gtt 


322 


Asp 


Tyr 


Val 


Pro 


Thr 


val 


Phe Asp 


Asn 


Phe 


Ser 


Ala 


Asn 


Val 


val 


Val 




35 










40 








45 












gac 


ggg 


age 


act 


gtc 


aac 


ttg ggt 


ctg 


tgg 


gat 


aca 


gca 


gga 


caa 


gaa 


370 


Asp 


Gly 


Ser 


Thr 


Val 


Asn 


Leu Gly 


Leu 


Trp 


Asp 


Thr 


Ala 


Gly 


Gin 


Glu 




50 










55 








60 




- 






65 . 




gat 


tac 


aat 


aga 


ctg 


cgt 


ccg ttg 


age 


tat 


cgt 


ggt 


get 


gat 


gtt 


ttt 


418 


Asp 


Tyr 


Asn 


Arg 


Leu 


Arg 


Pro Leu 


Ser 


Tyr 


Arg 


Gly 


Ala 


Asp 


Val 


Phe 












70 








75 










80 






ctg 


etc 


gee 


ttt 


tct 


ctt 


ate age 


aaa 


gca 


age 


tat 


gag 


aat 


gtc 


tct 


466 


Leu 


Leu 


Ala 


Phe 


Ser 


Leu 


He Ser 


Lys 


Ala 


Ser 


Tyr 


Glu 


Asn 


Val 


Ser 










85 








90 










95 








aag 


aag 


tgg 


gtt 


cct 


gaa 


tta agg 


cac 


tat 


get 


cct 


ggc 


gtg 


ccc 


ata 


514 


Lys 


Lys 


Trp 


Val 


Pro 


Glu 


Leu Arg 


His 


Tyr 


Ala 


Pro 


Gly 


Val 


Pro 


He 








100 








105 










110 










ate 


ctt 


gtt 


ggg 


aca 


aaa 


ctt gat 


ctg 


cgt 


gat 


gat 


aag 


cag 


ttt 


ttt 


562 


He 


Leu 


Val 


Gly 


Thr 


Lys 


Leu Asp 


Leu 


Arg 


Asp 


Asp 


Lys 


Gin 


Phe 


Phe 






115 










120 








125 












gtt 


gat 


cac 


cct 


ggt 


get 


gtt *cca 


att 


tec 


act* 


gee 


cag 


ggc 


gaa 


gag 


510 


val 


Asp 


His 


Pro 


Gly 


Ala 


Val Pro 


He 


Ser 


Thr 


Ala 


Gin 


Gly 


Glu 


Glu 




13 0 






135 








140 










145 




ctg 


agg 


aag 


eta 


att 


ggt 


get gee 


gee 


tac 


ate 


gaa 


tgc 


agt 


tea 


aaa 


658 


Leu 


Arg 


Lys 


Leu 


He 


Gly 


Ala Ala 


Ala 


Tyr 


He 


Glu 


Cys 


Ser 


Ser 


Lys 












150 








155 










160 






ate 


cag 


cag 


aac 


ata 


aaa 


gca gtg 


ttt 


gac 


gca 


gca 


att 


aag 


gtg 


gtt 


706 


He 


Gin 


Gin 


Asn 


He 


Lys 


Ala val 


Phe 


Asp 


Ala 


Ala 


He 


Lys 


Val 


Val 










165 








170 










175 








etc 


cag 


cca 


cca 


aag 


caa 


aag aag 


agg 


aag 


aag 


aag 


gtg 


cag 


aag 


gga 


754 


Leu 


Gin 


Pro 


Pro 


Lys 


Gin 


Lys Lys 


Arg 


Lys 


Lys 


Lys 


Val 


Gin 


Lys 


Gly 





180 185 190 



tgc acc att ttg taactacaaa eggtagaggg caacagtctg gctgcggcgc 806 
Cys Thr He Leu 
195 

tgctgccaat gataaccatc gcctccttgc tgtataatat ategectgat catgccacca 866 
geatgeacaa gggagatggt ggttttagc^a tccttgtcct actgtgttgt gtagaccacc 926 



gggtgtagtt gactgtatct ggttgtttgt atgtatggac aagacaaaac tagcactgea 986 



gatggtatgg taaggcgtaa gcaaatacaa tatgacattg gtccagttcc aggaaaaaaa 1046 



aaaaaaaaaa aa 1058 



<210> 108 
<211> 197 
<212> PRT 

<213> Zea mays (maize) 



<400> 108 

Met Ser Ala Ser Arg Phe He Lys Cys Val Thr Val Gly Asp Gly Ala 
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1 








5 






10 




15 




Val 


Gly 


Lys 


Thr 


Cys 


Met 


Leu He Ser 


Tyr 


Thr Ser 


Asn Thr Phe 


Pro 








20 






25 




30 




Thr 


Asp 


Tyr 
35 


Val 


Pro 


Thr 


Val Phe Asp 
40 


Asn 


Phe Ser 


Ala Asn Val 
45 


Val 


Val 


Asp 


Gly 


Ser 


Thr 


Val 


Asn Leu Gly 


Leu 


Trp Asp 


Thr Ala Gly 


Gin 




50 










55 




60 




Glu 


Asp 


Tyr 


Asn 


Arg 


Leu 


Arg Pro Leu 


Ser 


Tyr Arg 


Gly Ala Asp 


Val 


65 










70 






75 


80 


Phe 


Leu 


Leu 


Ala 


Phe 
85 


Ser 


Leu He Ser 


Lys' 
90 


Ala Ser 


Tyr Glu Asn 
95 


Val 


Ser 


Lys 


Lys 


Trp 
100 


Val 


Pro 


Glu Leu Arg 
105 


His 


Tyr Ala 


Pro Gly Val 
110 


Pro 


He 


He 


Leu 
115 


Val 


Gly 


Thr 


Lys Leu Asp 
120 


Leu 


Arg Asp 


Asp Lys Gin 
125 


Phe 


Phe 


Val 


Asp 


His 


Pro 


Gly 


Ala Val Pro 


He 


Ser Thr 


Ala Gin Gly 


Glu 




13 0 










135 




140 




Glu 


Leu 


Arg 


Lys 


Leu 


lie 


Gly Ala Ala 


Ala 


Tyr He 


Glu Cys Ser 


Ser 


145 










150 






155 


-LOU 


Lys 


He 


Gin 


Gin 


Asn 
165 


He 


Lys Ala Val 


Phe 
170 


Asp Ala 


Ala He Lys 
175 


Val 


Val 


Leu 


Gin 


Pro 


Pro 


Lys 


Gin Lys Lys 


Arg 


Lys Lys 


Lys Val Gin 


Lys 








180 






185 






190 


Gly 


Cys 


Thr 
195 


He 


Leu 















<210> 109 
<211> 1045 
<212> DNA 

<213> Zea mays (maize) 

<220> 

<221> CDS 

<222> (45).. (707) 

<400> 109 

gaattcggca cgagctggct cgtgcagcgg cggcagtgag agcg atg age gcg gcg 56 

Met Ser Ala Ala 
1 



gca 


gcg 


gcg 


gcg 


gcg 


age 


teg 


gtc 


ace 


aag 


ttc 


ate 


aag 


tgc 


gtc 


acg 


104 


Ala 


Ala 


Ala 


Ala 


Ala 


Ser 


Ser 


Val 


Thr 


Lys 


Phe 


He 


Lys 


Cys 


Val 


Thr 




5 










10 










15 








20 




gtc 


ggc 


gat 


ggg 


gee 


gtc 


ggg 


aag 


acc 


tgc 


atg 


etc 


ate 


tgc 


tac 


acc 


152 


val 


Gly 


Asp 


Gly 


Ala 


val 


Gly 


Lys 


Thr 


Cys 


Met 


Leu 


He 


Cys 


Tyr 


Thr 












25 










30 










35 






tgc 


aac 


aag 


ttc 


ccc 


acg 


gat 


tac 


ate 


ccc 


acc 


gta 


ttt 


gac 


aac 


ttc 


200 


Cys 


Asn 


Lys 


Phe 


Pro 


Thr 


Asp 


Tyr 


He 


Pro 


Thr 


Val 


Phe 


Asp 


Asn 


Phe 










40 










45 










50 








age 


gee 


aat 


gtc 


tec 


gtg 


ggt 


ggg 


age 


ate 


gtc 


aac 


ttg 


ggc 


etc 


tgg 


248 


Ser 


Ala 


Asn 


Val 


Ser 


Val 


Gly 


Gly 


Ser 


He 


Val 


Asn 


Leu 


Gly 


Leu 


Trp 








55 










60 










65 








gac 


acg 


gca 


ggc 


cag 


gag 


gat 


tac 


age 


agg 


ttg 


agg 


cct 


etc 


age 


tac 


296 


Asp 


Thr 


Ala 


Gly 


Gin 


Glu 


Asp 


Tyr 


Ser 


Arg 


Leu Arg 


Pro 


Leu 


Ser 


Tyr 






70 










75 










80 










agg 


ggt 


get 


gat 


gtg 


ttc 


ate 


etc 


tec 


ttc 


tec 


ctg 


gtc 


age 


agg 


gcg 


344 


Arg 


Gly 


Ala 


Asp 


Val 


Phe 


He 


Leu 


Ser 


Phe 


Ser 


Leu 


Val 


Ser 


Arg 


Ala 




85 










90 










95 








100 




age 


tat 


gag 


aac 


gtc 


ctg 


aag 


aag 


tgg 


atg 


cca 


gag 


ctt 


cgc 


cga 


ttt 


392 


Ser 


Tyr 


Glu 


Asn 


Val 


Leu 


Lys 


Lys 


Trp 


Met 


Pro 


Glu 


Leu 


Arg 


Arg 


Phe 












105 










110 










115 






tea 


cct 


act 


gtt 


cct 


gta 


gtt 


ctt 


gtt 


gga 


acc 


aaa 


eta 


gat 


etc 


cgt 


440 


Ser 


Pro 


Thr 


Val 


Pro 


Val 


Val 


Leu 


Val 


Gly 


Thr Lys 


Leu Asp 


Leu 


Arg 










120 










125 










13 0 






gaa 


gac 


aga 


tct 


tac 


ctt 


get 


gac 


cat 


tct 


get 


get 


tec 


ate 


ate 


tct 


488 


Glu 


Asp 


Arg 


Ser 


Tyr 


Leu 


Ala 


Asp 


His 


Ser 


Ala 


Ala 


Ser 


He 


He 


Ser 








135 










140 










145 










act 


gaa 


cag 


gga 


gaa 


gag 


etc 


agg 


aag 


cag 


ata 


ggt 


get 


gtg 


gcg 


tac 


536 


Thr 


Glu 


Gin 


Gly 


Glu 


Glu 


Leu 


Arg 


Lys 


Gin 


He Gly 


Ala 


Val 


Ala 


Tyr. 
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150 155 160 

ata gaa tgc age tea aag aca cag agg aac gta aag get gtg ttc gac 584 
He Glu Cys Ser Ser Lys Thr Gin Arg Asn Val Lys Ala Val Phe Asp 
165 170 175 180 

act gca att aaa gta gtg ctg caa cca ccg agg aga aga gaa gtt acc 632 
Thr Ala He Lys Val Val Leu Gin Pro Pro Arg Arg Arg Glu Val Thr 

185 190 195 

agg aag aaa atg aag aca agt teg aat cag tct ctg aga aga tac etc 680 
Arg Lys Lys Met Lys Thr ser Ser Asn Gin Ser Leu Arg Arg Tyr Leu 

200 205 210 

tgt gga age gga tgt ttc aca teg taaagcacag actcttctgc gactgttgta 734 
Cys Gly Ser Gly Cys Phe Thr Ser 
215 220 
ctggacttgc tagatggttg cagctctatg aatgagtagt cccctccgca gccactggga 794 

acttctggtt ctctgctacc ttccgataga gtgctctttt gcgttcacca gctgagaaaa 854 

atgaagegag gttctagttt ataaattccc tacgaggtgt accttcttta gtatgaatgg 914 

tgggctattt agcagttcag caaagtgtga agtgaccctt etatgeatgt tttgtttcca 974 

aaaactgatg ttgctaaatg gctaatgaat ggttatggtc geaceggaag aaaaaaaaaa 1034 

aaaaaaaaaa a 1045 



<210> 110 
<211> 220 
<212> PRT 

<213> Zea mays (maize) 



<400> 110 














Met 


Ser 


Ala 


Ala 


Ala 


Ala 


Ala Ala Ala Ser 


Ser Val 


Thr Lys Phe He 


1 








5 




10 




15 


Lys 


Cys 


Val 


Thr 


Val 


Gly 


Asp Gly Ala Val 


Gly Lys 


Thr Cys Met Leu 






20 






25 




30 


He 


Cys 


Tyr 


Thr 


Cys 


Asn 


Lys Phe Pro Thr 


Asp Tyr 


He Pro Thr Val 




35 








40 




45 


Phe 


Asp 


Asn 


Phe 


Ser 


Ala 


Asn Val Ser Val 


Gly Gly 


Ser He Val Asn 




50 










55 


60 




Leu Gly 


Leu 


Trp 


Asp 


Thr 


Ala Gly Gin Glu 


Asp Tyr 


Ser Arg Leu Arg 


65 










70 




75 


80 


Pro 


Leu 


Ser 


Tyr 


Arg 


Gly 


Ala Asp Val Phe 


He Leu 


Ser Phe Ser Leu 








85 




90 




95 


Val 


•Ser 


Arg 


Ala 


Ser 


Tyr 


Glu Asn Val Leu 


Lys Lys 


Trp Met Pro Glu 






100 






105 




110 


Leu 


Arg 


Arg 


Phe 


Ser 


Pro 


Thr Val Pro Val 


Val Leu Val Gly Thr Lys 




115 








120 




125 


Leu 


Asp 


Leu 


Arg 


Glu 


Asp 


Arg Ser Tyr Leu 


Ala Asp 


His Ser Ala Ala 




130 










135 


140 




Ser 


He 


He 


Ser 


Thr 


Glu 


Gin Gly Glu Glu 


Leu Arg Lys Gin He Gly 


145 










150 




155 


160 


Ala 


Val 


Ala 


Tyr 


He 


Glu 


Cys Ser Ser Lys 


Thr Gin Arg Asn Val Lys 








165 




170 




175 


Ala 


Val 


Phe 


Asp 


Thr 


Ala 


He Lys Val Val 


Leu Gin 


Pro Pro Arg Arg 








180 






185 




190 


Arg 


Glu 


Val 


Thr 


Arg 


Lys 


Lys Met Lys Thr 


Ser Ser 


Asn Gin Ser Leu 




195 








200 




205 


Arg 


Arg 


Tyr 


Leu 


Cys 


Gly 


Ser Gly Cys Phe 


Thr Ser 




210 










215 


220 
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<210> 111 
<211> 876 
<212> DNA 

<213> Saccharomyces cerevisiae 

<220> 

<221> CDS 

<222> (1) . . (876) 



PCT/EP2004/008136 



<400> 111 

atg aat aca eta tta ttt aag cga aaa ggt ggc aat tgt ggg aac gaa 48 

Met Asn Thr Leu Leu Phe Lys Arg Lys Gly Gly Asn Cys Gly Asn Glu 

1 5 10 ^ 15 

agt aac ata gtt teg cag gga teg ccc tea agt age aat ctt cct gaa 96 

Ser Asn lie Val Ser Gin Gly Ser Pro Ser Ser Ser Asn Leu Pro Glu 

20 25 30 

tea cct ggc act tta gat gaa aag aat ctt ccc aga ttg cct act cca 144 
Ser Pro Gly Thr Leu Asp Glu Lys Asn Leu Pro Arg Leu Pro Thr Pro 

35 40 45 

ttc get aga age ctt tct acc att cct agt tat gag cag atg aaa cgt 192 
Phe Ala Arg Ser Leu Ser Thr lie Pro Ser Tyr Glu Gin Met Lys Arg 

50 55 60 

aca aac aaa ctg cca gat tat cac eta aag att gtt gtt gtg gga gat 240 
Thr Asn Lys Leu Pro Asp Tyr His Leu Lys He Val Val Val Gly Asp 
65 70 75 80 

ggc get gta ggg aag acg tgc ctg ctg ata tct tat gtc caa gga aca 288 
Gly Ala Val Gly Lys Thr Cys Leu Leu He Ser Tyr Val Gin Gly Thr 

85 90 95 

ttt ccg act gat tat att cct act att ttc gaa aat tat gtc aca aac 336 
Phe Pro Thr Asp Tyr lie Pro Thr He Phe Glu Asn Tyr Val Thr Asn 

100 105 no 

ata gaa gga ccc aac ggt caa att ata gaa ttg gca tta tgg gac act 384 
He Glu Gly Pro Asn Gly Gin He lie Glu Leu Ala Leu Trp Asp Thr 

115 120 125 

gee ggc caa gaa gag tat agt aga ctt aga ccg ctt tea tat acg aat 432 
Ala Gly Gin Glu Glu Tyr Ser Arg Leu Arg Pro Leu Ser Tyr Thr Asn 

130 135 140 

gca gat gtg ctg atg gtg tgc tat tct gtt ggt agt aag aca teg ctt 480 
Ala Asp Val Leu Met Val Cys Tyr Ser Val Gly Ser Lys Thr Ser Leu 
145 150 155 160 

aaa aat gtg gaa gat etc tgg ttc cca gag gtt aag cat ttt tgt cct 528 
Lys Asn Val Glu Asp Leu Trp Phe Pro Glu Val Lys His Phe Cys Pro 

165 170 175 

tec act cca ate atg eta gtc ggc ctt aaa tea gat eta tat gaa get 576 
Ser Thr Pro He Met Leu Val Gly Leu Lys Ser Asp Leu Tyr Glu Ala 

180 185 " 190 

gat aac ctt tea gat ctg gtg gaa cca agt tea gca gaa tec ttg gee 624 
Asp Asn Leu Ser Asp Leu Val Glu Pro Ser Ser Ala Glu Ser Leu Ala 

195 200 205 

aag cgt ctg ggg gca ttt gca cat att caa tgc tea gca cga ttg aaa 672 
Lys Arg Leu Gly Ala Phe Ala His He Gin Cys Ser Ala Arg Leu Lys 

210 215 220 

gaa aat ate gat gaa gta ttt gaa act gee ata cac acg tta ctt tec 720 
Glu Asn He Asp Glu Val Phe Glu Thr Ala He His Thr Leu Leu Ser 
225 230 235 240 

gat tea tta tat get ccc aga gag cct aca cat aca ate aaa aat ccc 768 
Asp Ser Leu Tyr Ala Pro Arg Glu Pro Thr His Thr He Lys Asn Pro 

245 250 255 

ttt aaa aga aat acc acc aga tea gat ate gat tct tct act gga gat 816 
Phe Lys Arg Asn Thr Thr Arg Ser Asp He Asp Ser Ser Thr Gly Asp 

260 265 *~ 270 

acc age gtc tct att tec gga acg aaa aga tta aga aaa aac aag tgt 864 
Thr Ser Val Ser He Ser Gly Thr Lys Arg Leu Arg Lys Asn Lys . Cys 

275 280 " 285 

att ata atg taa 87 g 
He He Met 
290 
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<210> 112 
<211> 291 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 112 

Met Asn Thr Leu Leu Phe Lys Arg Lys Gly Gly Asn Cys Gly Asn Glu 

15 io 15 

Ser Asn lie Val Ser Gin Gly Ser Pro Ser Ser Ser Asn Leu Pro Glu 

20 25 30 

Ser Pro Gly Thr Leu Asp Glu Lys Asn Leu Pro Arg Leu Pro Thr Pro 

35 40 45 

Phe Ala Arg Ser Leu Ser Thr lie Pro Ser Tyr Glu Gin Met Lys Arg 

50 ~ 55 60 

Thr Asn Lys Leu Pro Asp Tyr His Leu Lys lie Val Val Val Gly Asp 
65 " 70 75 80 

Gly Ala Val Gly Lys Thr Cys Leu Leu lie Ser Tyr Val Glh Gly Thr 

85 90 95 

Phe Pro Thr Asp Tyr He Pro Thr He Phe Glu Asn Tyr Val Thr Asn 

100 105 HO 

He Glu Gly Pro Asn Gly Gin He He Glu Leu Ala Leu Trp Asp Thr 

115 120 125 

Ala Gly Gin Glu Glu Tyr Ser Arg Leu Arg Pro Leu Ser Tyr Thr Asn 

130 135 140 

Ala Asp Val Leu Met Val Cys Tyr Ser Val Gly Ser Lys Thr Ser Leu 
145 150 155 160 

Lys Asn Val Glu Asp Leu Trp Phe Pro Glu Val Lys His Phe Cys Pro 

165 170 175 

Ser Thr Pro He Met Leu'-'Val Gly Leu Lys Ser Asp Leu Tyr Glu Ala 

180 185 190 

Asp Asn Leu Ser Asp Leu Val Glu Pro Ser Ser Ala Glu Ser Leu Ala 

195 200 205 

Lys Arg Leu Gly Ala Phe Ala His He Gin Cys Ser Ala Arg Leu Lys 

210 * 215 220 

Glu Asn He Asp Glu Val Phe Glu Thr Ala He His Thr Leu Leu Ser 
225 230 235 240 

Asp Ser Leu Tyr Ala Pro Arg Glu Pro Thr His Thr He Lys Asn Pro 

245 250 255 

Phe Lys Arg Asn Thr Thr Arg Ser Asp He Asp Ser Ser Thr Gly Asp 

260 265 270 

Thr Ser Val Ser He Ser Gly Thr Lys Arg Leu Arg Lys Asn Lys Cys 

275 280 285 

He He Met 
290 



<210> 113 
<211> 696 
<212> DNA 

<213> Saccharomyces cerevisiae 



<220> 

<221> CDS 

<222> (1) . . (696) 

<400> 113 
atg tea ttt eta 
Met Ser Phe Leu 
1 

aga aag ate gtt 
Arg Lys He Val 
20 

ctg aat gtt ttc 
Leu Asn Val Phe 
35 

gtt ttt gaa aac 
Val Phe Glu Asn 
50 



tgt ggg tea get 
Cys Gly Ser Ala 
5 

att ttg ggc gac 
He Leu Gly Asp 

acc aga ggt tat 
Thr Arg Gly Tyr 
40 

tat ate cat gat 
Tyr He His Asp 
55 



tea acg tea aat 
Ser Thr Ser Asn 
10 

ggt gee tgt ggt 
Gly Ala Cys Gly 
25 

ttt ccc gaa gtt 
Phe Pro Glu Val 

att ttc gtt gac 
He Phe Val Asp 
60 



aaa ccg ate gaa 
Lys Pro He Glu 
15 

aaa act teg ttg 
Lys Thr Ser Leu 
30 

tat gag cct act 
Tyr Glu Pro Thr 
45 

agt aaa cat ate 
Ser Lys His He 
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acg 


eta 


teg 


ttg 


tgg 


gat 


act 


gcg 


srgc 


caa 


gag 


gaa ttt 


gac 


agg 


tta 


240 


Thr 


Leu 


Ser 


Leu Trp 


Asp 


Thr 


Ala 


Gly 


Gin 


Glu 


Glu Phe 


Asp 


Arg 


Leu 




65 










70 










75 








80 




cga 


tec 


ttg 


tct 


tat 


tea 


gat 


acg 


caa 


tgt 


ata 


atg tta 


tgt 


ttc 


agt 


288 


Arg 


Ser 


Leu 


Ser 


Tyr 


Ser Asp 


Thr 


Gin 


Cys 


He 


Met Leu 


Cys 


Phe 


Ser 












85 










90 








95 






att 


gat 


tea 


cgc 


gat 


tct 


tta 


gaa 


aat 


gtc 


caa 


aat aaa 


tgg 


gtg 


ggt 


336 


lie 


Asp 


Ser Arg Asp 


Ser 


Leu 


Glu 


Asn 


Val 


Gin 


Asn Lys 


Trp 


Val 


Gly 










100 










105 








110 








gaa 


ate 


act 


gat 


cat 


tgt 


gaa 


ggc 


gtc 


aag 


tta 


gtc tta 


gtt 


gca 


eta 


384 


Glu 


He 


Thr 


Asp His 


Cys 


Glu 


Gly 


val 


Lys 


Leu 


Val Leu 


Val 


Ala 


Leu 








115 










120 






125 










aag 


tgt 


gac 


tta 


aga 


aac 


aat 


gaa 


aat 


gaa 


tct 


aac gca 


ate 


aca 


ccg 


432 


Lys 


Cys 


Asp 


Leu Arg 


Asn 


Asn 


Glu 


Asn 


Glu 


Ser 


Asn Ala 


He 


Thr 


Pro 




130 










135 










140 










aac 


aat 


ate 


caa 


cag 


gat 


aac 


age 


gtt 


tct 


aac 


gac aac 


gga 


aat 


aac 


480 


Asn 


Asn 


lie 


Gin 


Gin 


Asp Asn 


Ser 


Val 


Ser 


Asn 


Asp Asn 


Gly 


Asn 


Asn 




145 










150 




* 






155 








160 




ata 


aat 


age 


ace 


tea 


aac 


ggc 


aaa 


aac 


ctg 


ata 


agt tat 


gaa 


gaa 


ggt 


528 


lie 


Asn 


Ser 


Thr 


Ser 


Asn 


Gly 


Lys 


Asn 


lieu 


He 


Ser Tyr 


Glu 


Glu 


Gly 












165 










170 








175 






eta 


get 


atg 


get 


aaa 


aag 


ate 


ggt 


gcg 


eta 


cgt 


tat ttg 


gaa 


tgt 


age 


576 


Leu 


Ala 


Met 


Ala 


Lys 


Lys 


He 


Gly 


Ala 


Leu 


Arg 


Tyr Leu 


Glu 


Cys. 


Ser 










180 










185 








190 








get 


aag 


ctg 


aat 


aaa 


ggt 


gtc 


aac 


gaa 


get 


ttc 


aca gaa 


gee 


gca 


aga 


624 


Ala 


Lys 


Leu Asn Lys 


Gly Val 


Asn 


Glu 


Ala 


Phe 


Thr Glu 


Ala 


Ala 


Arg 








195 










200 








205 










gtt 


get 


tta 


ace 


gcg 


ggc 


cca 


gta 


gca 


acc 


gaa 


gtg aaa 


agt 


gac 


agt 


672 


Val 


Ala 


Leu 


Thr 


Ala 


Gly Pro 


Val 


Ala 


Thr 


Glu 


Val Lys 


Ser 


Asp 


Ser 






210 










215 










220 










gga 


tec 


age 


tgt 


acc 


att 


atg 


taa 
















696 


Gly 


Ser 


Ser 


Cys 


Thr 


lie' 


Met 




















225 










230 






















<210> 114 




























<211> 231 




























<212> PRT 




























<213> Saccharomyces 


cerevisiae 


















<400> 114 




























Met 


Ser 


Phe 


Leu 


Cys 


Gly 


Ser 


Ala 


Ser 


Thr 


Ser 


Asn Lys 


Pro 


He 


Glu 




1 








5 








10 








15 






Arg 


Lys 


He 


Val 


He 


Leu 


Gly 


Asp 


Gly 


Ala 


Cys 


Gly Lys 


Thr 


Ser 


Leu 










20 










25 








30 








Leu 


Asn 


Val 


Phe 


Thr 


Arg 


Gly 


Tyr 


Phe 


Pro 


Glu 


Val Tyr 


Glu 


Pro 


Thr 








35 










40 








45 










Val 


Phe 


Glu 


Asn 


Tyr 


He 


His 


Asp 


He 


Phe 


Val 


Asp Ser 


Lys 


His 


He 






50 










55 










60 










Thr 


Leu 


Ser 


Leu 


Trp 


Asp 


Thr 


Ala 


Gly 


Gin 


Glu 


Glu Phe 


Asp 


Arg 


Leu 




65 










70 










75 








80 




Arg 


Ser 


Leu 


Ser 


Tyr 


Ser 


Asp 


Thr 


Gin 


Cys 


He 


Met Leu 


Cys 


Phe 


Ser 












85 










90 








95 






He 


Asp 


Ser 


Arg 


Asp 


Ser 


Leu 


Glu 


Asn 


Val 


Gin 


Asn Lys 


Trp 


Val 


Gly 










100 










105 








110 








Glu 


He 


Thr 


Asp 


His 


Cys 


Glu 


Gly 


Val 


Lys 


Leu 


Val Leu 


Val 


Ala 


Leu 








115 










120 








125 










Lys 


Cys 


Asp 


Leu 


Arg 


Asn 


Asn 


Glu 


Asn 


Glu 


Ser 


Asn Ala 


He 


Thr 


Pro 






130 










135 










140 










Asn 


Asn 


He 


Gin 


Gin 


Asp 


Asn 


Ser 


Val 


Ser 


Asn 


Asp Asn 


Gly 


Asn 


Asn 




145 










150 










155 








160 




He 


Asn 


Ser 


Thr 


Ser 


Asn 


Gly 


Lys 


Asn 


Leu 


He 


Ser Tyr 


Glu 


Glu 


Gly 












165 










170 








175 




Leu 


Ala 


Met 


Ala 


Lys 


Lys 


He 


Gly 


Ala 


Leu 


Arg 


Tyr Leu 


Glu 


Cys 


Ser 










180 










185 








190 








Ala 


Lys 


Leu 


Asn 


Lys 


Gly 


Val 


Asn 


Glu 


Ala 


Phe 


Thr Glu 


Ala 


Ala 


Arg 








195 










200 








205 










Val 


Ala 


Leu 


Thr 


Ala 


Gly 


Pro 


Val 


Ala 


Thr 


Glu 


Val Lys 


Ser 


Asp 


Ser 





96 
144 
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210 215 220 

Gly Ser Ser Cys Thr lie Met 
225 230 

<210> 115 

<211> 579 

<212> DNA 

<213> Drosophila melanogaster 

<220> 
<221> CDS 
<222> (1) (579) 

<400> 115 . q 

atg acg acg att cgc aag aaa ttg gta att gtc ggc gac ggt gcc tgc 48 
Met Thr Thr lie Arg Lys Lys Leu Val He Val Gly Asp Gly Ala Cys 
15 10 .15 . 

ggt aaa act tgc ctt ctg att gtc ttc "age aaa gat cag ttc ccc gag 
Gly Lys Thr Cys Leu Leu He Val Phe Ser Lys Asp Gin Phe Pro Glu 

20 25 30 

gtc tat gtg ccc acc gta ttc gag aat tat gtg gcc gac ate gag gtg 
Val Tyr Val Pro Thr Val Phe Glu Asn Tyr Val Ala Asp He Glu Val 

35 40 45 

gat ggc aaa cag gtg gag ctg gcc ttg tgg gat acg gcc ggg cag gag 192 
Asp Gly Lys Gin Val Glu Leu Ala Leu Trp Asp Thr Ala Gly Gin Glu 

P 50 55 60 

gac tac gac aga eta cga ccg ctg age tat ccc gac act gac gtc ata 
Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp Thr Asp Val He 
65 70 . 75 80 

ctg atg tgt ttc tea* gtg gat tea ccc gat teg eta gaa aat att cct 
Leu Met Cys Phe Ser Val Asp Ser Pro Asp Ser Leu Glu Asn He Pro 

85 90 95 

gaa aaa tgg acc cca gag gtc aaa cac ttt tgt cca aat gtt cca ate • 336 
Glu Lys Trp Thr Pro Glu Val Lys His Phe Cys Pro Asn Val Pro He 

100 105 HO 

att ttg gta gga aat aag aaa gat ttg cga aat gat ccc aac aca att 384 
He Leu Val Gly Asn Lys Lys Asp Leu Arg Asn Asp Pro Asn Thr He 

115 J 12 0 125 

egg gat eta gca aaa atg aag cag gag ccg gtg aag ccg cag gag ggt 432 
Arg Asp Leu Ala Lys Met Lys Gin Glu Pro Val Lys Pro Gin Glu Gly 

130 135 140 

cgc gcc atg gcc gag aag att aat gcc ttt gcc tat ttg gag tgt teg 
Arq Ala Met Ala Glu Lys He Asn Ala Phe Ala Tyr Leu Glu Cys Ser 
145 - 150 155 160 

get aag tec aag gag' ggt gtg cga gat gtt ttc gag acg gca act agg 
Ala Lys Ser Lys Glu Gly Val Arg Asp Val Phe Glu Thr Ala Thr Arg 

165 170 175 

gcc gcg ctg caa gtc aaa aag agg aag aag acc aga tgc ctt ttg etc 576 
Ala Ala Leu Gin Val Lys Lys Arg Lys Lys Thr Arg Cys Leu Leu Leu 
180 185 190 



240 
288 



480 
528 



taa 



579 



<210> 116 
<211> 192 
<212> PRT 

<213> Drosophila melanogaster 

Met^Thr^hr He Arg Lys Lys Leu Val He Val Gly Asp Gly Ala Cys 

15 10 15 

Gly Lys Thr Cys Leu Leu He Val Phe Ser Lys Asp Gin Phe Pro Glu 

20 25 30 

Val Tyr Val Pro Thr Val Phe Glu Asn Tyr Val Ala Asp He Glu Val 

35 40 45 

Asp Gly Lys Gin Val Glu Leu Ala Leu Trp Asp Thr Ala" Gly Gin Glu 



WO 2005/014828 



50 



Asp 


Tyr 


Asp 


Arg 


Leu 


Arg 


65 










70 


Leu 


Met 


Cys 


Phe 


Ser 


Val 










85 




Glu 


Lys 


Trp 


Thr 


Pro 


Glu 








100 






He 


Leu 


Val 


Gly 


As ii 


Lys 






115 








Arg 


Asp 


Leu 


Ala 


Lys 


Met 




130 










Arg 


Ala 


Met 


Ala 


Glu 


Lys 


145 










150 


Ala 


Lys 


Ser 


Lys 


Glu 


Gly 










165 




Ala 


Ala 


Leu 


Gin 


Val 


Lys 



180 



95/291 



55 








60 


Pro 


Leu 


Ser 


Tyr Pro 


Asp 








75 




Asp 


Ser 


Pro 


Asp Ser 


Leu 








90 




Val 


Lys 


His 


Phe Cys 


Pro 






105 






Lys 


Asp 


Leu 


Arg Asn 


Asp 




120 








Lys 


Gin 


Glu 


Pro' Val 


Lys 


135 








140 


He 


Asn 


Ala 


Phe Ala 


Tyr 








155 


Val 


Arg 


Asp 


Val Phe 


Glu 








170 




Lys 


Arg 


Lys 


Lys Thr 


Arg 



185 



PCT/EP2004/008136 



Thr 


Asp 


Val 


He 






80 


Glu 


Asn 


He 


Pro 






95 


He 


Asn 


Val 


Pro 




110 






Pro 


Asn 


Thr 


lie 


125 








Pro 


Gin 


Glu 


Gly 


Leu 


Glu 


Cys 


Ser 








160 


Thr 


Ala 


Thr 


Arg 






175 




Cys 


Leu 


Leu 


Leu 



190 



<210> 117 

<211> 579 

<212> DNA 

<213> Drosophila melanogaster 

<220> 

<221> CDS 

<222> (1) . . (579) 

<400> 117 



atcr 
«* w^ 


cacr 

w«.^-j 


crccr 


ate 


aaa 


tore 


crtc 

3 ww 


crtc 


gtg 


ggc 


gac 


gga 


gee 


gtg 


gga 


aag 


48 


Met 


Gin 


Ala 


He 


Lys 


Cys 


val 


Val 


val 


Gly Asp 


Gly 


Ala 


Val 


Gly 


Lys 




1 








5 










10 










15 






acc 


tgc 


ctg 


ctg 


ate 


age 


tac 


acg 


acc 


aat 


gee 


ttt 


ccc 


ggc 


gag 


tac 


96 


Thr 


Cys 


Leu 


Leu 
20 


He 


Ser 


Tyr 


Thr 


Thr 
25 


Asn 


Ala 


Phe 


Pro 


Gly 
30 


Glu 


Tyr 




ata 


ccc 


acc 


gtg 


ttc 


gac 


aac 


tac 


teg 


gee 


aac 


gtg 


atg 


gtg 


gac 


gee 


144 


He 


Pro 


Thr 
35 


Val 


Phe 


Asp 


Asn 


Tyr 
40 


Ser 


Ala 


Asn 


Val 


Met 
45 


val 


Asp 


Ala 




aag 


ccc 


ate 


aac 


ctg 


ggc 


ctg 


tgg 


gat 


acg 


gee 


ggg 


cag 


gag 


gac 


tac 


192 


Lys 


Pro 


He 


Asn 


Leu 


Gly 


Leu 


Trp 


Asp 


Thr 


Ala 


Gly 


Gin 


Glu 


Asp 


Tyr 




50 










55 










60 












gac 


cga 


ctg 


agg 


cca 


ctg 


tec 


tat 


ccc 


cag 


acc 


gat 


gtc 


ttc 


etc 


ate 


240 


Asp 


Arg 


Leu 


Arg 


Pro 


Leu 


Ser 


Tyr 


Pro 


Gin 


Thr 


Asp 


Val 


Phe 


Leu 


He 




65 










70 










75 










80 




tgc 


ttc 


teg 


ctg 


gtg 


aat 


ccg 


gca 


teg 


.ttc 


gag 


aac 


gtg 


egg 


gca 


aag 


288 


Cys 


Phe 


Ser 


Leu 


Val 


Asn 


Pro 


Ala 


Ser 


Phe 


Glu 


Asn 


Val 


Arg 


Ala 


Lys 










85 










90 










95 






tgg 


tat 


ccg 


gag 


gtg 


cgc 


cac 


cac 


tgc 


ccc 


age 


acg 


ccc 


ate 


ate 


ctg 


336 


Trp 


Tyr 


Pro 


Glu 
100 


Val 


Arg 


His 


His 


Cys 
105 


Pro 


Ser 


Thr 


Pro 


He 
110 


lie 


Leu 




gtg 


ggc 


acc 


aag 


ctg 


gat 


ttg 


cgc 


gac 


gac 


aag 


aac 


aca 


ate 


gaa 


aag 


384 


Val 


Gly 


Thr 


Lys 


Leu 


Asp 


Leu 


Arg 


Asp 


Asp Lys 


Asn 


Thr 


lie 


Glu 


Lys 








115 










120 










125 










ctg 


agg 


gac 


aag 


aaa 


ctg 


gcg 


ccc 


ate 


acc 


tat 


ccg 


cag 


ggc 


tct 


ggc 


432 


Leu 


Arg 


Asp 


Lys 


Lys 


Leu 


Ala 


Pro 


He 


Thr 


Tyr 


Pro 


Gin 


Gly 


Ser 


Gly 






130 








135 










140 












cat 


ggc 


aag 


gaa 


ate 


gga 


gcg 


gtc 


aag 


tat 


ctg 


gag 


tgc 


teg 


gee 


ctg 


480 


His 


Gly 


Lys 


Glu 


He 


Gly 


Ala 


Val 


Lys 


Tyr Leu 


Glu 


Cys 


Ser 


Ala 


.Leu 




145 








150 










155 










160 




acg 


cag 


aag 


ggt 


ctg 


aaa 


acc 


gtt 


ttc 


gae 


gag 


gee 


ate 


egg 


teg 


gtt 


528 


Thr 


Gin 


Lys 


Gly 


Leu 


Lys 


Thr 


Val 


Phe 


Asp 


Glu 


Ala 


He 


Arg 


Ser 


Val 










165 










170 










.175 






ttg 


tgc 


ccc 


gtg 


ctg 


cag 


ccc 


aag 


tec 


aag 


cgc 


aag 


tgc 


gee 


ctg 


etc 


* 576 


Leu 


Cys 


Pro 


Val 


Leu 


Gin 


Pro 


Lys 


Ser 


Lys 


Arg 


Lys 


Cys 


Ala 


Leu 


Leu 





180 185 190 

taa 579 
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96/291 



PCT7EP2004/008136 



<210> 118 
<211> 192 
<212> PRT 

<213> Drosophila melanogaster 

<400> 118 M 

Met Gin Ala He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 

1 5 10 I 5 

Thr Cys Leu Leu He Ser Tyr Thr Thr Asn' Ala Phe Pro Gly Glu Tyr 

20 25 30 

He Pro Thr Val Phe Asp Asn Tyr Ser Ala Asn Val Met Val Asp Ala 

35 40 45 

Lys Pro He Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr 

50 55 60 

Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu He 
65 ~ 70 75 80 

Cys Phe Ser Leu Val Asn Pro Ala Ser Phe Glu Asn Val Arg Ala Lys 

85 90 95 

Trp Tyr Pro Glu Val Arg His His Cys Pro Ser Thr Pro He He Leu 

100 105 HO 

Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Asn Thr He Glu Lys 

115 120 125 

Leu Arg Asp Lys Lys Leu Ala Pro He Thr Tyr Pro Gin Gly Ser Gly 

130 135 140 

His Gly Lys Glu He Gly Ala Val Lys Tyr Leu Glu Cys Ser Ala Leu 
145 150 155 160 

Thr Gin Lys Gly Leu Lys Thr Val Phe Asp Glu Ala He Arg Ser Val 

165 170 175 

Leu Cys Pro Val Leu Gin Pro Lys Ser Lys Arg Lys Cys Ala Leu Leu 
180 185 190 

<210> 119 
<211> 579 
<212> DNA 

<213> Drosophila melanogaster 

<220> 

<221> CDS 

<222> (1) . . (579) 

<400> 119 AQ 

atg cag gcc ate aag tgt gtg gtt gtg ggc gac gga gcg gtg gga aag 48 

Met Gin Ala He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 

1 5 10 15 

acc tgt ctg ctg ate age tat acg ace aac gcc ttc ccc ggc gag tac 96 

Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Ala Phe Pro Gly Glu Tyr 

20 25 .30 

ata ccc acg gtg ttc gac aac tat teg gcg aat gtg atg gtg gat gcc 144 
He Pro Thr Val Phe Asp Asn Tyr Ser Ala Asn Val Met Val Asp Ala 

35 40 45 

aag ccc ate aat ctg ggc etc tgg gat acg get gga cag gag gac tac 192 
Lys Pro He Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr 

50 55 60 

gat cgc ctg agg ccg eta tec tat ccg caa acg gat gtc ttt etc ate 
Asd Ara Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu He 
65 70 75 80 

tgt ttc tea ctg gtg aat ccg gca teg ttt gag aat gtg cga gcc aaa 
Cvs Phe Ser Leu Val Asn Pro Ala Ser Phe Glu Asn Val Arg Ala Lys 

85 90 95 

tgg ttt ccc gag gtg cgt cat cat tgc ccg agt gtg ccg ata ate ctg 336 
Trp Phe Pro Glu Val Arg His His Cys Pro Ser Val Pro He He Leu 

100 105 HO 

gtc ggc acc aaa ctg gat ctg cgc gac gat aag cag acg ate gag aag 384 
Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Thr He Glu Lys 

115 120 125 

ctg aag gac aag aag eta aca ccg ate acc tat ccc caa gga ctg gcg 432 

Leu Lys Asp Lys Lys Leu Thr Pro He Thr Tyr Pro Gin Gly Leu Ala 



240 
288 
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130 




135 








140 










atg 


gec 


aag 


gaa ata get gcg 


gtc 


aag 


tat ctg 


gag 


tgc teg 


gec 


ctg 


480 


Met 


Ala 


Lys 


Glu He Ala Ala 


Val 


Lys 


Tyr Leu 


Glu Cys Ser 


Ala 


Leu 




145 






150 




155 








160 




acc 


caa 


aag 


ggt ctg aag acg 


gtc 


ttc 


gac gag 


gec 


ata cga 


tec 


gtg 


528 


Thr 


Gin 


Lys 


Gly Leu Lys Thr 


Val 


Phe 


Asp Glu 


Ala 


He Arg 


Ser 


Val 










165 






170 






175 






eta 


tgt 


cct 


gtc gtt cga gga 


ccc 


aag 


egg cac 


aag 


tgc gec 


ctg 


etc 


576 


Leu 


Cys 


Pro 


val Val Arg Gly 


Pro 


Lys 


Arg His 


Lys 


Cys Ala 


Leu 


Leu 










180 




185 






190 








taa 






















579 



<210> 120 






















<211> 192 






















<212> PRT 






















<213> Drosophila melanogaster 












<400> 120 






















Met Gin 


Ala 


He 


Lys 


Cys 


Val 


Val 


Val 


Gly Asp 


Gly Ala Val 


Gly 


Lys 


1 






5 










10 




15 




Thr Cys 


Leu 


Leu 


He 


Ser 


Tyr 


Thr 


Thr 


Asn Ala 


Phe Pro Gly 


Glu 


Tyr 






20 










25 




30 






He Pro 


Thr 


Val 


Phe 


Asp 


Asn 


Tyr 


Ser 


Ala Asn 


Val Met Val 


Asp 


Ala 




35 










40 






45 






Lys Pro 


He 


Asn 


Leu 


Gly 


Leu 


Trp 


Asp 


Thr Ala 


Gly Gin Glu 


Asp 


Tyr 


50 










55 








60 






Asp • Arg 


Leu 


Arg 


Pro 


Leu 


Ser 


Tyr 


Pro 


Gin Thr 


Asp Val Phe 


Leu 


He 


65 








70 








75 






80 


Cys Phe 


Ser 


Leu 


Val 


Asn 


Pro 


Ala 


Ser 


Phe Glu 


Asn Val Arg 


Ala 


Lys 






85 










90 




95 




Trp Phe 


Pro 


Glu 


Val 


Arg 


His 


His 


Cys 


Pro Ser 


val Pro He 


He 


Leu 






100 










105 




110 






Val Gly 


Thr 


Lys 


Leu 


Asp 


Leu 


Arg 


Asp 


Asp Lys 


Gin Thr He 


Glu 


Lys 




115 










120 






125 






Leu Lys 


Asp 


Lys 


Lys 


Leu 


Thr 


Pro 


He 


Thr Tyr 


Pro Gin Gly 


Leu 


Ala 


130 










135 








140 






Met Ala 


Lys 


Glu 


He 


Ala 


Ala 


Val 


Lys 


Tyr Leu 


Glu Cys Ser 


Ala 


Leu 


145 






150 








155 






160 


Thr Gin 


Lys 


Gly 


Leu 


Lys 


Thr 


Val 


Phe 


Asp Glu 


Ala He Arg 


Ser 


Val 








165 










170 * 




175 




Leu Cys 


Pro 


Val 


val 


Arg 


Gly 


Pro 


Lys 


Arg His 


Lys Cys Ala 


Leu 


Leu 






180 










185 




190 







<210> 121 
<211> 913 
<212> DNA 

<213> Gossypium hirsutum (upland cotton) 

<220> 

<221> CDS 

<222> (12) . . (602) 

<400> 121 

gagaaaaaac a atg age act gca aga ttt ate aag tgt gtc acg gtc ggt 50 
Met Ser Thr Ala Arg Phe He Lys Cys Val Thr Val Gly 
1 5 10 

gat gga get gtg ggg aaa act tgt atg etc att tea tat acc age aat 98 
Asp Gly Ala Val Gly Lys Thr Cys Met Leu He Ser Tyr Thr Ser Asn 

15 20 25 

act ttc cca acg gat tat gtt cca aca gta ttt gat aac ttt agt gec 146 
Thr Phe Pro Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala 
30 35 40 45 

aat gtg gtg gtg gat ggc age aca gtg aac ctt ggc eta tgg gac act 194 
Asn Val Val Val Asp Gly Ser Thr Val Asn Leu Gly Leu Trp Asp Thr 
50 55 60 
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gcc ggg caa gaa gat tat aat agg eta agg cca ctg agt tat aga gga 242 
Ala Gly Gin Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly 

65 "* 70 75 

get gat gtg ttt ttg ttg gcc ttt tct ctt ata age aag gcc agt tat 
Ala Asp Val Phe Leu Leu Ala Phe Ser Leu lie Ser Lys Ala Ser Tyr 

80 85 . 90 

gaa aac ate tac aaa aag tgg ate cca gag eta aga cat tat get cat 
Glu Asn lie Tyr Lys Lys Trp lie Pro Glu Leu Arg His Tyr Ala His 

95 * 100 105 

aat gta cca gtt gtg ctt gtt gga acc aaa* eta gat ttg cga gat gac 
Asn Val Pro Val Val Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp 
110 H5 120 125 

aag cag ttc etc att gat cac cct gga gca aca cca ata tea aca tec 
Lys Gin Phe Leu lie Asp His Pro Gly Ala Thr Pro lie Ser Thr Ser 

130 135 140 

cag gga gaa gaa eta aag aag atg ata gga gca gtt act tat ata gaa 
Gin Gly Glu Glu Leu Lys Lys Met He Gly Ala Val Thr Tyr He Glu 

145 150 155 

tgc age tec aaa acc caa cag aat gtg aag get gtt ttc gat get gca 530 
Cys Ser Ser Lys Thr Gin Gin Asn Val Lys Ala Val Phe Asp Ala Ala 

160 165 170 

ata aaa gta get ttg agg cca cca aaa cca aag aga aag cct tgc aaa 578 
He Lys Val Ala Leu Arg Pro Pro Lys Pro Lys Arg Lys Pro Cys Lys 

175 180 185 

agg aga aca tgt get ttc ctt tgaatattgg atcattatta cagtcaaaaa 629 
Arg Arg Thr Cys Ala Phe Leu 
190 195 

cagttaacaa aagctgttgc agataaacac tgaatctget atagtttgtt tttggtttac 689 



290 
338 
386 
434 
482 



749 



atatgttcca cgtgaaacta tgaagcatct ctaagaaaac ccaaactatc atatcaaccc 
atcgatcaat gaatcgattt caattttege agtataagtt ccttttaatc ctttcttttt 809 
acttcatttt ataacgaatt ctatggataa tgttccctac aaacatgtca ttacaatgtt 869 



taattataaa ttccattctt ctattttact aaaaaaaaaa aaaa 913 



<210> 122 
<211> 196 
<212> PRT 

<213> Gossypium hirsutunl (upland cotton) 
<400> 122 

it an a atct Phe He Lvs Cys Val Thr Val Gly Asp Gly Ala 

10 15 
Ser Tyr Thr Ser Asn Thr Phe Pre 
25 30 
Asp Asn Phe Ser Ala Asn Val Va3 
45 

Gly Leu Trp Asp Thr Ala Gly Glr 
60 

Leu Ser Tyr Arg Gly Ala Asp Va] 
75 ~ 80 
Ser Lys Ala Ser Tyr Glu Asn lie 

90 95 
Arg His Tyr Ala His Asn Val Pr< 
105 HO 
Asp Leu Arg Asp Asp Lys Gin Ph< 
125 

Pro He Ser Thr Ser Gin Gly Gli 
140 



Met 


Ser 


Thr Ala Arg 
5 


Phe He 


Lys 


1 

Val 


Gly 


Lys Thr Cys 


Met Leu 


He 






20 






Thr 


Asp 


Tyr Val Pro 


Thr Val 


Phe 






35 




40 


Val 


Asp 


Gly Ser Thr 


Val Asn 


Leu 




50 




55 




Glu 


Asp 


Tyr Asn Arg 


Leu Arg 


Pro 


65 






70 




Phe 


Leu 


Leu Ala Phe 


Ser Leu 


He 






85 






Tyr Lys 


Lys Trp He 


Pro Glu 


Leu 






100 






Val 


Val 


Leu val Gly 


Thr Lys 


Leu 






115 




120 


Leu 


He 


Asp His Pro 


Gly Ala 


Thr 




130 




135 





WO 2005/014828 PCT/EP2004/008136 

99/291 

Glu Leu Lys Lys Met lie Gly Ala Val Thr Tyr lie Glu Cys Ser Ser 
145 150 155 160 

Lys Tlir Gin Gin Asn Val Lys Ala Val Phe Asp Ala Ala lie Lys Val 

165 170 175 

Ala Leu Arg Pro Pro Lys Pro Lys Arg Lys Pro Cys Lys Arg Arg Thr 

180 18S 190 

Cys Ala Phe Leu 
195 

<210> 123 
<211> 1067 
<212> DNA 

<213> Arabidopsis thai i ana (mouse-ear cress) 

<220> 

<221> CDS 

<222> (314) . . (907) 

<400> 123 

cttttctctc tctagttgtt gttctctctc tctctcgcat cctccaattc atcgtcctca 60 

cgttcccttt tgtttattca tcttccttcc ttcttcacat ttctgatttt ctctatttgg 120 

ggggtttgtt tccacattat acatatctag ggttttgaga tggttaattg aaagataatg 180 

gtcgaagttt cggaggaatt ggcttcacat tgtgggtgtt tctggcttcc tcaggtttaa 240 



atctgaggtt gatctctttt gttttttggt aaattgtgac atattttggc tcgaagaaga 300 



agaagaagag gca atg age gca tea agg ttc ata aag tgc gtc acc gtt 349 
Met Ser Ala Ser Arg Phe lie Lys Cys Val Thr Val 
1 5 10 



ggt 


gat 


gga 


get 


gtt 


ggt 


aaa 


acc 


tgt 


ttg 


ctg 


att 


tct 


tat 


acc 


age 


397 


Gly 


Asp 


Gly 


Ala 


Val 


Gly 


Lys 


Thr 


Cys 


Leu 


Leu 


lie 


Ser 


Tyr 


Thr 


Ser 








15 










20 










25 










aac 


acc 


ttt 


ccc 


acg 


gat 


tat 


gtt 


ccg 


act 


gtt 


ttc 


gat 


aac 


ttt 


agt 


445 


Asn 


Thr 


Phe 


Pro 


Thr 


Asp 


Tyr 


Val 


Pro 


Thr 


Val 


Phe 


Asp 


Asn 


Phe 


Ser 






30 










35 










40 












gca 


aat 


gtg 


gtt 


gtc 


aat 


ggg 


gec 


acg 


gtg 


aat 


ctt 


gga 


ttg 


tgg 


gat 


493 


Ala 


Asn 


Val 


Val 


Val 


Asn 


Gly 


Ala 


Thr 


Val 


Asn 


Leu 


Gly 


Leu 


Trp 


Asp 




45 










50 










55 










60 




act 


gca 


ggg 


caa 


gag 


gac 


tat 


aac 


agg 


tta 


aga* 


cct 


ttg 


agt 


tac 


cgt 


541 


Thr 


Ala 


Gly 


Gin 


Glu 


Asp 


Tyr 


Asn 


Arg 


Leu 


Arg 


Pro 


Leu 


Ser 


Tyr 


Arg 












65 










70 










75 






ggt 


get 


gat 


gtt 


ttc 


att 


ctt 


gee 


ttc 


tct 


etc 


att 


agt 


aag 


get 


agt 


589 


Gly 


Ala 


Asp 


Val 


Phe 


He 


Leu 


Ala 


Phe 


Ser 


Leu 


He 


Ser 


Lys 


Ala 


Ser 






80 










85 










90 








tat 


gag 


aat 


gtt 


tec 


aag 


aag 


tgg 


att 


cca 


gag 


ttg 


aag 


cac 


tat 


get 


637 


Tyr 


Glu 


Asn 


Val 


Ser 


Lys 


Lys 


Trp 


He 


Pro 


Glu 


Leu 


Lys 


His 


Tyr 


Ala 








95 










100 










105 










cct 


ggt 


gtc 


cca 


att 


gtc 


ctt 


gtt 


gga 


acc 


aaa 


eta 


gat 


ctt 


cga 


gat 


685 


Pro 


Gly 


val 


Pro 


lie 


Val 


Leu 


Val 


Gly 


Thr 


Lys 


Leu 


Asp 


Leu 


Arg 


Asp 






110 










115 










120 












gac 


aaa 


cag 


ttt 


ttc 


ate 


gac 


cat 


cct 


ggt 


get 


gtc 


cct 


att 


acc 


act 


733 


Asp 


Lys 


Gin 


Phe 


Phe 


He 


Asp 


His 


Pro 


Gly 


Ala 


Val 


Pro 


He 


Thr 


Thr 




125 










130 










135 










140 




gtt 


cag 


gga 


gag 


gag 


ctg 


aag 


aag 


eta 


att 


gga 


gcg 


cca 


get 


tac 


ate 


781 


val 


Gin 


Gly 


Glu 


Glu 


Leu 


Lys 


Lys 


Leu 


He 


Gly 


Ala 


Pro 


Ala 


Tyr 


lie 












145 










150 










155 






gag 


tgc 


agt 


tea 


aaa 


tea 


caa 


gag 


aac 


gtg 


aag 


ggc 


gtg 


ttt 


gat 


gca 


829 


Glu 


Cys 


Ser 


Ser 


Lys 


Ser 


Gin 


GlU 


Asn 


val 


Lys 


Gly 


Val 


Phe 


Asp 


Ala 










160 










165 










170 








gcg 


ate 


aga 


gtg 


gtc 


ctt 


caa 


cct 


cca 


aag 


cag 


aag 


aaa 


aag 


aag 


aac 


877 
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Ala He Arg Val Val Leu Gin Pro Pro Lys Gin Lys Lys Lys Lys Asn 

175 180 185 

aaa gca caa aag gcc tgc tec ate ttg taatagcact catggaagtc 924 
Lys Ala Gin Lys Ala Cys Ser He Leu 

190 * 195 

aagaagctct ttgagatgag gatgacaggg tggtttaaaa aagtatcget ttttcatttt 984 

tgttgttgtt ttccgttttc atttgtgatc tcttagttcc gtacaaaacg aattgtgagg 1044 

ttaattcaac ttctatttta act 1067 



<210> 124 

<2ii>- i'97 
<212> PRT 

<213> Arabidopsis thaliana (mouse-ear cress) 

<400> 124 _ m ^, , 

Met Ser Ala Ser Arg Phe He Lys Cys Val Thr Val Gly Asp Gly Ala 
1 5 io 15 

Val Gly Lys Thr Cys Leu Leu He Ser Tyr Thr Ser Asn Thr Phe Pro 

20 25 30 

Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val 

35 40 45 

Val Asn Gly Ala Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 

50 ^ 55 60 

Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 
65 ' 70 75 80 

Phe He Leu Ala Phe Ser Leu He Ser Lys Ala Ser Tyr Glu Asn Val 

85 90 95 

Ser Lys Lys Trp He Pro Glu Leu Lys His Tyr Ala Pro Gly Val Pro 

100 105 HO 

He Val Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe 

115 120 125 

Phe He Asp His Pro Gly Ala Val Pro He Thr Thr Val Gin Gly Glu 

130 135 140 

Glu Leu Lys Lys Leu He Gly Ala Pro Ala Tyr He Glu Cys Ser Ser 
145 150 155 160 

Lys Ser Gin Glu Asn Val Lys Gly Val Phe Asp Ala Ala He Arg Val 

165 170 175 

Val Leu Gin Pro Pro Lys Gin Lys Lys Lys Lys Asn Lys Ala Gin Lys 

180 185 190 

Ala Cys Ser He Leu 
195 

<210> 125 
<211> 695 
<212> DNA 

<213> Arabidopsis thaliana (mouse-ear cress) 

<220> 

<221> CDS 

<222> (1) . . (594) 

<400> 125 . . q 

atg age get teg agg ttc ata aag tgt gtc acc gtt ggc gac gga get 48 
Met Ser Ala Ser Arg Phe He Lys Cys Val Thr Val Gly Asp Gly Ala 
5 10 15 



1 5 J-u 

gtt ggt aaa acc tgt ttg ctg att tct tac acc age aac act ttt cct 
Val Gly Lys Thr Cys Leu Leu He Ser Tyr Thr Ser Asn Thr Phe Pro 

20 25 30 

acg gat tat gta ccg act gtt ttc gat aac ttt age gca aat gtg gtt 
Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val 



96 



144 
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gtt 


aat 


gga 


gec 


act 


gtg 


aat 


ctt 


crcrcr 


eta 


taa 

"33 


aat 


acc 


gca 


999 


cag 


192 


Val 


Asn 
50 


Gly 


Ala 


Thr 


Val 


Asn 
55 


Leu 


Gly 


Leu 


Trp 


Asp 
60 


Thr 


Ala 


Gly 


Gin 




gag 


gat 


tat 


aac 


aga 


tta 


aga 


cct 


ttg 

**3 


agt 


tac 


cgc 


ggt 


get 


gat 


gtt 


240 


Glu 


Asp 


Tyr 


Asn 


Arg 


Leu 


Arg 


Pro 


Leu 


Ser 


Tyr 


Arg 


Gly 


Ala 


Asp 


Val 




65 










70 










75 










80 




ttc 


ate 


tta 


gca 


ttc 


tct 


ctt 


ate 


aat 


aag 


get 


agt 

. 3 


tat 


gag 


aat 


gtc 


288 


Phe 


lie 


Leu 


Ala 


Phe 
85 


Ser 


Leu 


He 


Ser 


Lys 
90 


Ala 


Ser 


Tyr 


Glu 


Asn 
95 


Val 




tec 


aag 


aag 


tgg 

w 3 3 


ate 


cca 


gag 


eta 


aaa 

****3 


cat 


tat 


gee 


cct 


ggt 

33 


gtc 


cct 


336 


Ser 


Lys 


Lys 


Trp 
100 


lie 


Pro 


Glu 


Leu 


Lvs 
105 


His 


Tvr 


Ala 


Pro 


GlV 

110 


Val 


Pro 




ata 


gtt 


ctt 


crtt 


gga 


ace 


aaa 


eta 


aat 


ctt 


cag 

w 33 


gat 


gac 


aaa 


cag 

** 3 


ttc 


384 


lie 


Val 


Leu 
115 


Val 


Gly 


Thr 


Lvs 


Leu 
120 


ASD 


Leu 


Ara 


Asp 


Asd 
125 


Lvs 


Gin 


Phe 




ttc 


att 


aac 


cac 


cct 


age 


crct 


ata 


cca 


att 


act 


act 


get 


caa 


gga 

33 s * 


aaa 

3^*3 


432 


Phe 


lie 


Asp 


His 


Pro 


Glv 


Ala 


Val 


Pro 


He 


Thr 


Thr 


Ala 


Gin 


GlV 


Glu 






130 








13 5 










140 












gaa 


ctg 


aag 


aaa 


eta 


att 


crcra 


get 


ccc 


gca 


tac 


ate 


gag 


tac 


agt 


tea 


480 


Glu 


Leu 


Lys 


Lys 


Leu 


lie 


Gly 


Ala 


Pro 


Ala 


Tyr 


He 


Glu 


Cys 


Ser 


Ser 














150 










155 










160 




aaa 


aca 


caa 


gag 


aac 


gtg 


aaa 


gga 


gta 


ttt 


gat 


gca 


gcg 


ate 


cga 


gtg 


528 


Lys 


Thr 


Gin 


Glu 


Asn 


Val 


Lys 


Gly 


Val 


Phe 


Asp 


Ala 


Ala 


He 


Arg 


Val 










165 










170 










175 






gtt 


ctt 


caa 


cct 


cca 


aag 


cag 


aag 


aaa 


aag 


aaa 


age 


aaa 


gca 


caa 


aaa 


576 


Val 


Leu 


Gin 


Pro 
180 


Pro 


Lys 


Gin 


Lys 


Lys 
185 


Lys 


Lys 


Ser 


Lys 


Ala 
190 


Gin 


Lys 





gec tgc tec att ttg taatttctct acgcttctct taegtatetc tctctgtctc 631 
Ala Cys Ser lie Leu 
195 

tctgtctctt ccactcttct agtgaaggct taagaagaaa acacattggg cttaaaaatt 691 



gttc 695 



<210> 126 
<211> 197 
<212> PRT 

<213> Arabidopsis thaliana (mouse-ear cress) 
<400> 126 



Met 


ser 


Ala 


Ser 


Arg 


Phe 


He 


Lys 


Cys 


Val 


Thr 


Val 


Gly Asp Gly Ala 


1 








5 . 










10 






15 


Val 


Gly 


Lys 


Thr 


Cys 


Leu 


Leu 


He 


Ser 


Tyr 


Thr 


Ser 


Asn Thr Phe Pro 








20 










25 








30 


Thr 


Asp 


Tyr 


Val 


Pro 


Thr 


Val 


Phe 


Asp 


Asn 


Phe 


Ser 


Ala Asn Val Val 






35 










40 










45 


Val 


Asn 


Gly 


Ala 


Thr 


Val 


Asn 


Leu 


Gly 


Leu 


Trp 


Asp 


Thr Ala Gly Gin 




50 










55 










60 




Glu 


Asp 


Tyr 


Asn 


Arg 


Leu 


Arg 


Pro 


Leu 


Ser 


Tyr 


Arg 


Gly Ala Asp Val 


65 










70 










75 




80 


Phe 


He 


Leu 


Ala 


Phe 


Ser 


Leu 


He 


Ser 


Lys 


Ala 


Ser 


Tyr Glu Asn Val 










85 










90 






95 


Ser 


Lys 


Lys 


Trp 


He 


Pro 


Glu 


Leu 


Lys 


His 


Tyr 


Ala 


Pro Gly Val Pro 








100 










105 








110 


He 


Val 


Leu 


Val 


Gly 


Thr 


Lys 


Leu 


Asp 


Leu 


Arg 


Asp 


Asp Lys Gin Phe 






115 










120 










125 


Phe 


He 


Asp 


His 


Pro 


Gly 


Ala 


Val 


Pro 


He 


Thr 


Thr 


Ala Gin Gly Glu 




130 








135 










140 




Glu 


Leu 


Lys 


Lys 


Leu 


He 


Gly 


Ala 


Pro 


Ala 


Tyr 


He 


Glu Cys Ser Ser 


145 










150 










155 




160 


Lys 


Thr 


Gin 


Glu 


Asn 


Val 


Lys 


Gly 


Val 


Phe 


Asp 


Ala 


Ala He Arg Val 








165 










170 






175 


Val 


Leu 


Gin 


Pro 


Pro 


Lys 


Gin 


Lys 


Lys 


Lys 


Lys 


Ser 


Lys Ala Gin Lys 



180 185 190 



144 
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Ala Cys Ser lie Leu 
195 

<210> 127 
<211> 630 
<212> DNA 

<213> Arabidopsis thaliana (mouse-ear cress) 

<220> 
<221> CDS 
<222> (1) . . (630) 

<400> 127 

atg agt get teg aag ttc ata aaa tgt gtt act gtt gga gat ggg get 48 
Met Ser Ala Ser Lys Phe lie Lys Cys Val Thr Val Gly Asp Gly Ala 
15 10 15 

gtt ggg aag aca tgt atg ctt ate tgt tac act age aac aag ttt cct 96 
Val Gly Lys Thr Cys Met Leu lie Cys Tyr Thr Ser Asn Lys Phe Pro 

20 25 30 

act gat tat ata ccg act gtg ttc gac aat ttc agt gec aat gta get 
Thr Asp Tyr He Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Ala 

35 40 45 

gtg gat gga caa ate gtt aat tta ggg eta tgg gac act gec ggt caa 192 
Val Asp Gly Gin He Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 

50 55 60 

gaa gat tac agt agg tta aga cca ttg agt tat aga gga get gat ate 240 
Glu Asp Tyr Ser Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp lie 
65 70 75 80 

ttc gtc tta gee ttt teg ctt att age aag gcg agt tac gaa aat gta 288 
Phe Val Leu Ala Phe Ser Leu He Ser Lys Ala Ser Tyr Glu Asn Val 

85 90 95 

etc aag aag tgg atg cct gaa ctt cgt egg ttt gcg cca aat gtt ccc 336 
Leu Lys Lys Trp Met Pro Glu Leu Arg Arg Phe Ala Pro Asn Val Pro 

100 105 HO 

ata gtt ctt gtt ggt aca aag eta gat etc egg gat gac aag gga tac 384 
He Val Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gly Tyr 

115 120 125 

etc gcg gat cac ace aat gtc att ace tct act cag gga gag gaa ttg 432 
Leu Ala Asp His Thr Asn Val He Thr Ser Thr Gin Gly Glu Glu Leu 

130 135 140 

agg aag caa att ggt gca get get tat att gag tgt agt tec aag act 480 
Arg Lys Gin He Gly Ala Ala Ala Tyr He Glu Cys Ser Ser Lys Thr 
145 150 155 160 

caa caa aat gtg aaa gca gtg ttt gat aca gcg ate aag gtg gtt ctt 528 
Gin Gin Asn Val Lys Ala Val Phe Asp Thr Ala He Lys Val Val Leu 

165 170 175 

cag cct cca agg agg aaa gag gtc ccg agg agg agg aag aat cat aga 576 
Gin Pro Pro Arg Arg Lys Glu Val Pro Arg Arg Arg Lys Asn His Arg 

180 185 190 

aga tec ggt tgc tec att gcg agt att gtc tgt gga ggt tgc ace get 624 
Arq Ser Gly Cys Ser He Ala Ser He Val Cys Gly Gly Cys Thr Ala 
195 200 205 

get taa 
Ala 



630 



<210> 128 
<211> 209 
<212> PRT 

<213> Arabidopsis thaliana (mouse-ear cress) 

Met°ser 2 Ala Ser Lys Phe He Lys Cys Val Thr Val Gly Asp Gly Ala 

15 io 15 

Val Gly Lys Thr Cys Met Leu He Cys Tyr Thr Ser Asn Lys Phe Pro 

20 25 30 

Thr Asp Tyr He Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Ala 
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35 










40 45 


Val 


Asp 


Gly 


Gin 


He 


Val 


Asn 


Leu Gly Leu Trp Asp Thr Ala Gly Gin 




50 










55 


60 


Glu 


Asp 


Tyr 


Ser 


Arg 


Leu 


Arg 


Pro Leu Ser Tyr Arg Gly Ala Asp He 


65 










70 




75 80 


Phe 


Val 


Leu 


Ala 


Phe 


Ser 


Leu 


He Ser Lys Ala Ser Tyr Glu Asn Val 










85 






90 95 


Leu 


Lys 


Lys 


Trp 


Met 


Pro 


Glu 


Leu Arg Arg Phe Ala Pro Asn Val Pro 






- 


100 








105 110 


lie 


Val 


Leu 


Val 


Gly 


Thr 


Lys 


Leu Asp Leu Arg Asp Asp Lys Gly Tyr 






115 










120 125 


Leu 


Ala 


Asp 


His 


Thr 


Asn 


Val 


He Thr Ser Thr Gin Gly Glu Glu Leu 




130 










135 


140 


Arg 


Lys 


Gin 


He 


Gly 


Ala 


Ala 


Ala tyr He Glu Cys Ser Ser Lys Thr 


145 










150 




155 160 


Gin 


Gin 


Asn 


Val 


Lys 


Ala 


Val 


Phe Asp Thr Ala He Lys Val Val Leu 










165 






170 175 


Gin* 


Pro' 


Pro" 


Arg 


Arg 


Lys 


Glu 


Val Pro Arg Arg Arg Lys Asn His Arg 








180 








185 190 


Arg 


Ser 


Gly 


Cys 


Ser 


He 


Ala 


Ser He Val Cys Gly Gly Cys Thr Ala 






195 










200 205 



Ala 



<210> 129 
<211> 588 
<212> DNA 

<213> Caenorhabditis elegans 

<220> 

<221> CDS 

<222> (1) . • (588) 

<400> 129 



atg 


tct 


tea 


ccg 


teg 


agg 


cag 


ate 


aaa tgt gta gtt gtt gga gac gga 


48 


Met 


Ser 


Ser 


Pro 


Ser Arg 


Gin He Lys Cys Val Val Val Gly Asp Gly 




r 








5 








10 15 




aca 


gtt 


gga 


aaa 


aca 


tgc 


atg 


tta 


ata tct tac aca act gac tct ttt 


96 


Thr 


Val 


Gly 


Lys 


Thr 


Cys 


Met 


Leu 


He Ser Tyr Thr Thr Asp Ser Phe 










20 










25 30 




cca 


gtt 


cag 


tat 


gtg 


cct 


aca 


gta 


ttt gat aac tat teg gca cag atg 


144 


Pro 


Val 


Gin 


Tyr 


Val 


Pro 


Thr 


Val 


Phe Asp Asn Tyr Ser Ala Gin Met 








35 










40 


45 




agt 


ctt 


gat 


ggg 


aac 


gtt 


gtg 


aac 


tta gga ttg tgg gat act get gga 


192 


Ser 


Leu 


Asp 


Gly 


Asn val 


Val Asn Leu Gly Leu Trp Asp Thr Ala Gly 






50 










55 




60 




cag 


gag 


gat 


tat 


gat 


cgt 


tta 


cga 


cca ctt tec tac cca cag acg gat 


240 


Gin 


Glu 


Asp 


Tyr 


Asp Arg 


Leu Arg 


Pro Leu Ser Tyr Pro Gin Thr Asp 




65 










70 






75 80 




gtt 


ttc 


att 


etc 


tgc 


ttc 


tct 


gtc 


gtc teg ccc gta teg ttt gac aat 


288 


Val 


Phe 


He 


Leu 


Cys 


Phe 


Ser 


Val 


Val Ser Pro Val Ser Phe Asp Asn 












85 








90 95 




gtg 


gca 


age 


aag 


tgg att 


ccg 


gaa 


ata cga cag cat tgt cca gat gcg 


336 


Val 


Ala 


Ser 


Lys 


Trp 


He 


Pro 


Glu 


He Arg Gin His Cys Pro Asp Ala 










100 










105 110 




cct 


gtc 


att 


eta 


gtt 


ggt 


ace 


aaa 


etc gat ttg cgc gac gag gee gaa 


384 


Pro 


Val 


He 


Leu 


Val Gly 


Thr 


Lys 


Leu Asp Leu Arg Asp Glu Ala Glu 








115 










120 


125 




ccg 


atg 


cgt 


get 


ctg cag 


gee 


gaa 


gga aag tec cca att tec aaa acg 


432 


Pro 


Met 


Arg 


Ala 


Leu 


Gin 


Ala Glu Gly Lys Ser Pro He Ser Lys Thr 






130 










135 




140 




caa 


ggc 


atg 


aaa 


atg 


get 


caa 


aaa 


att aaa get gtc aag tat ttg gaa 


480 


Gin 


Gly 


Met 


Lys 


Met 


Ala 


Gin; 


Lys 


He Lys Ala Val Lys Tyr Leu Glu 




145 










150 






155 ." 160 




tgc 


tct 


gca 


ttg 


acg 


caa 


cag 


gga 


etc aca cag gtg ttc gaa gac gec 


528 


Cys 


Ser 


Ala 


Leu 


Thr 


Gin 


Gin Gly Leu Thr Gin Val Phe Glu Asp Ala 












165 








170 175 




gta 


egg 


tec 


att 


ctt 


cat 


ccg 


aaa 


cca cag aaa aag aag aag tct tgc 


576 
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Val Arg Ser lie Leu His Pro Lys Pro Gin Lys Lys Lys Lys Ser Cys 

180 185 190 

aat att atg taa 588 
Asn lie Met 
195 

<210> 130 
<211> 195 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 130 

Met Ser Ser Pro Ser Arg Gin He Lys Cys Val Val Val Gly Asp Gly 

1 5 " 10 15 

Thr Val Gly Lys Thr Cys Met Leu He Ser Tyr Thr Thr Asp Ser Phe 

20 25 30 

Pro Val Gin Tyr Val Pro Tnr Val Phe Asp Asn Tyr Ser Ala Gin Met 

35 40 45 

Ser Leu Asp Gly Asn Val Val Asn Leu Gly Leu Trp Asp Tnr Ala Gly 

50 ~ 55 60 

Gin Glu Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp 
65 70 75 80 

Val Phe He Leu Cys Phe Ser Val Val Ser Pro Val Ser Phe Asp Asn 

85 90 95 

Val Ala Ser Lys Trp He Pro Glu He Arg Gin His Cys Pro Asp Ala 

100 105 HO 

Pro Val He Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Glu Ala Glu 

115 120 125 

Pro Met Arg Ala Leu Gin Ala Glu Gly Lys Ser Pro He Ser Lys Thr 

130 135 140 

Gin Gly Met Lys Met Ala Gin Lys He Lys Ala Val Lys Tyr Leu Glu 
145 ■ 150 155 160 

Cys Ser Ala Leu Thr Gin Gin Gly Leu Thr Gin Val Phe Glu Asp Ala 

165 . 170 175 

Val Arg Ser He Leu His Pro Lys Pro Gin Lys Lys Lys Lys Ser Cys 
180 185 190 

Asn He Met 
195 

<210> 131 
<211> 843 
<212> DNA 

<213> Arabidopsis thaliana (mouse-ear cress) 

<220> 

<221> CDS 

<222> (90) . . (683) 

acgcgtcgag aaactccaat tccaaaggct aaatctttga gatctttttt tttataaatt 60 

tctctgaaat taataaactt tgaggggaa atg age get teg agg ttc gta aag 113 

Met Ser Ala Ser Arg Phe Val Lys 
1 5 

tgc gtg acg gtt ggt gat gga get gtc gga aaa act tgt ttg ttg att 161 
Cys Val Thr Val Gly Asp Gly Ala Val Gly Lys Thr Cys Leu Leu He 

10 15 20 

tct tac aca age aac act ttc cct acg gat tat gtg cct ace gtt ttc 
Ser Tyr Thr Ser Asn Thr Phe Pro Thr Asp Tyr Val Pro Thr Val Phe 
25 30 35 40 

gat aat ttc agt gec aat gtt gtg gtt aat gga age act gtg aat ctt 
Asp Asn Phe Ser Ala Asn Val Val Val Asn Gly Ser Thr Val Asn Leu 

45 50 55 

gga ttg tgg gac act gca ggg caa gag gat tac aat aga tta aga cca 
Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr Asn Arg Leu Arg Pro 
60 65 70 



209 
257 
305 
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ctg 


agt 


tac 


cgt 


gga 


gca 


gat 


gtt 


ttc 


att 


ttg 


gcc 


ttc 


tct 


ctt 


ate 


353 


Leu 


Ser 


Tyr 


Arg 


Gly 


Ala 


Asp 


Val 


Phe 


lie 


Leu 


Ala 


Phe 


Ser 


Leu 


He 








75 










80 










85 










agt 


aaa 


gcc 


agt 


tat 


gaa 


aac 


gtc 


tec 


aaa 


aag 


tgg 


ate 


ccg 


gag 


ttg 


401 


Ser 


Lys 


Ala 


Ser 


Tyr 


Glu 


Asn 


Val 


Ser 


Lys 


Lys 


Trp 


He 


Pro 


Glu 


Leu 






90 










95 










100 












aaa 


cat 


tac 


gcg 


cct 


ggt 


gtc 


ccc 


ate 


gtc 


ctt 


gtt 


gga 


aca 


aag 


ctt 


449 


Lys 


His 


Tyr 


Ala 


Pro 


Gly 


Val 


Pro 


lie 


Val 


Leu 


Val 


Gly 


Thr 


Lys 


Leu 




105 










110 










115 










120 




gat 


ctt 


cga 


gat 


gat 


aaa 


cag 


ttc 


ttt 


ate' 


gac 


cat 


cct 


ggt 


get 


gtt 


497 


Asp 


Leu 


Arg 


Asp 


Asp 


Lys 


Gin 


Phe 


Phe 


lie 


Asp 


His 


Pro 


Gly 


Ala 


Val 












12S 










130 










135 






ccg 


att 


act 


act 


get 


cag 


gga 


gag 


gag 


ctg 


agg 


aag 


caa 


ata 


gga 


gca 


545 


Pro 


lie 


Thr 


Thr 


Ala 


Gin 


Gly 


Glu 


Glu 


Leu 


Arg 


Lys 


Gin 


He 


Gly 


Ala 










140 










145 










150 








cct 


act 


tac 


ate 


gaa 


tgc 


agt 


tec 


aaa 


act 


caa 


gag 


aat 


gtg 


aag 


gcg 


593 


Pro 


Thr 


Tyr 


lie 


Glu 


Cys 


Ser 


Ser 


Lys 


Thr 


Gin 


Glu 


Asn 


val 


Lys 


Ala 








155 








160 










165 








gtg 


ttt 


gac 


gca 


gcc 


ate 


cga 


gtg 


gtg 


ttg 


caa 


ccg 


cca 


aag 


cag 


aag 


641 


Val 


Phe 


Asp 


Ala 


Ala 


lie 


Arg 


Val 


Val 


Leu 


Gin 


Pro 


Pro 


Lys 


Gin 


Lys 






170 










175 










180 












aag 


aag 


aag 


age 


aaa 


gcg 


cag 


aag 


gca 


tgc 


tec 


att 


eta 


tgatgattgg 


690 


Lys 


Lys 


Lys 


Ser 


Lys 


Ala 


Gin 


Lys 


Ala 


Cys 


Ser 


lie 


Leu 










185 










190 










195 















aaatctctgt ttttatgtat ttggttttgg tatattaatc ttatatcaat gaatgaatta 75 0 
atgtgttaat ggacagacac ccaagtttga ctggtccttt ttgttcttaa tattaatgga 810 
gtttgtcgga aaaaaaaaaa aaaaaaaaaa aaa 843 



<210> 132 
<211> 197 
<212> PRT 

<213> Arabidopsis thaliana (mouse-ear cress) 



<400> 132 



Met 


Ser 


Ala 


Ser 


Arg 


Phe 


Val 


Lys 


Cys 


Val 


Thr 


Val Gly Asp Gly Ala 


1 








5 








10 




15 


Val 


Gly 


Lys 


Thr 
20 


Cys 


Leu 


Leu 


He 


Ser 
25 


Tyr 


Thr 


Ser Asn Thr Phe Pro 
30 


Thr 


Asp 


Tyr 


Val 


Pro 


Thr 


Val 


Phe 


Asp 


Asn 


Phe 


Ser Ala Asn Val Val 




35 










40 








45 


Val 


Asn 
50 


Gly 


Ser 


Thr 


Val 


Asn 
55 


Leu 


Gly 


Leu 


Trp 


Asp Thr Ala Gly Gin 
60 


Glu 


Asp 


Tyr 


Asn 


Arg 


Leu 


Arg 


Pro 


Leu 


Ser 


Tyr 


Arg Gly Ala Asp Val 


65 










70 










75 


80 


Phe 


He 


Leu 


Ala 


Phe 
85 


Ser 


Leu 


He 


Ser 


Lys 
90 


Ala 


Ser Tyr Glu Asn Val 
95 


Ser 


Lys 


Lys 


Trp 
100 


He 


Pro 


Glu 


Leu 


Lys 
105 


His 


Tyr 


Ala Pro Gly Val Pro 
110 


He 


Val 


Leu 
115 


Val 


Gly 


Thr 


Lys 


Leu 
120 


Asp 


Leu 


Arg 


Asp Asp Lys Gin Phe 
125 


Phe 


He 


Asp 


His 


Pro 


Gly 


Ala 


Val 


Pro 


He 


Thr 


Thr Ala Gin Gly Glu 




13 0 








135 










140 


Glu 


Leu 


Arg 


Lys 


Gin 


He 


Gly 


Ala 


Pro 


Thr 


Tyr 


He Glu Cys Ser Ser 


145 








150 










155 


160 


Lys 


Thr 


Gin 


Glu 


Asn 


Val 


Lys 


Ala 


Val 


Phe 


Asp 


Ala Ala He Arg Val 








165 










170 




175 


Val 


Leu 


Gin 


Pro 
180 


Pro 


Lys 


Gin 


Lys 


Lys 
185 


Lys 


Lys 


Ser Lys Ala Gin Lys 
190 


Ala 


Cys 


Ser 
195 


He 


Leu 

















150 
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<210> 133 
<211> 841 
<212> DNA 

<213> Arabidopsis thaliana (mouse-ear cress) 

<220> 

<221> CDS 

<222> (28).. (633) 

<400> 133 

aaggaattga agacaaaagc aaagaaa atg age aca gca aga ttc att aag tgt 54 
- ^ Met Ser Thr Ala Arg Phe He Lys Cys 

1 5 

gtg act gtc gga gat gga gca gtg gga aag act tgt atg etc att tea 102 
Val Thr Val Gly Asp Gly Ala Val Gly Lys Thr Cys Met Leu He Ser 
10 15 20 25 

tat acc age aat acg ttt cct acg gat tat gtt cca aca gtt ttc gac 
' Tyr Thr Ser Asn Thr Phe Pro Thr Asp Tyr Val Pro Thr Val Phe Asp 
30 35 40 

aac ttc age gca aat gtg gtg gtc gac ggg agt acc gtg aac ctt ggc 198 
Asn Phe Ser Ala Asn Val Val Val Asp Gly Ser Thr Val Asn Leu Gly 

45 50 55 

ctg tgg gat act gec ggt cag gaa gat tat aat agg ctt agg cct ttg 246 
Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr Asn Arg Leu Arg Pro Leu 

60 65 70 

agt tac aga gga gca gat gtc ttc tta tta gca ttt tec ctt ata age 294 
Ser Tyr Arg Gly Ala Asp Val Phe Leu Leu Ala Phe Ser Leu He Ser 

75 80 85 

aag gee agt tac gag aat att cac aaa aag tgg ctt ccg gag ctg aaa 342 
Lys Ala Ser Tyr Glu Asn He His Lys Lys Trp- Leu Pro Glu Leu Lys 
90 - 95 100 105 

cat tat get cct ggc ate ccc att gtg etc gtc gga aca aaa tta gat 390 
His Tyr Ala Pro Gly He Pro He Val Leu Val Gly Thr Lys Leu Asp 

110 H5 120 

ttg agg gat gac aag cag ttc ttg aag gat cat cca gga gca get tct 438 
Leu Arg Asp Asp Lys Gin Phe Leu Lys Asp His Pro Gly Ala Ala Ser 

125 130 135 

ata aca act get cag gga gaa gaa tta agg aaa atg att gga get gtt 486 
He Thr Thr Ala Gin Gly Glu Glu Leu Arg Lys Met He Gly Ala Val 

140 145 150 

agg tac tta gag tgc age tec aaa acc caa cag aat gtg aag gca gtg 534 
Arg Tyr Leu Glu Cys Ser Ser Lys Thr Gin Gin Asn Val Lys Ala Val 

155 160 165 

ttt gat aca gcg ata agg gta get ttg agg cca cca aag gca aag aaa 
Phe Asp Thr Ala He Arg val Ala Leu Arg Pro Pro Lys Ala Lys Lys 
170 175 180 185 

aag ata aaa cca ttg aag act aag aga tea aga ata tgc ttt ttc eta 
Lys He Lys Pro Leu Lys Thr Lys Arg Ser Arg He Cys Phe Phe Leu 

190 195 200 

taatatctga acagatttca ataaataaca aacatttata tatacttgta ttttgatatt 690 

tttttgaata ttttctactt gaaatagtgg actttgette ateaegttat tggcgagagt 750 

attgtcgaat catgttacca ttattcggga gaaatttgta aaatagtagt aatttggcaa 810 

gaaaattgag gattgatatt ttttccccat a 841 



<210> 134 
<211> 201 
<212> PRT 

<213> Arabidopsis thaliana (mouse-ear cress) 



582 



630 
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<400> 134 



Met Ser 


Thr 


Ala 


Arg 


Phe 


He 


Lys 


Cys 


Val 


Thr 


Val 


Gly 


Asp 


Gly 


Ala 




1 






5 










10 










15 






Val Gly 


Lys 


Thr 


Cys 


Met 


Leu 


He 


Ser 


Tyr 


Thr 


Ser 


Asn 


Thr 


Phe 


Pro 








20 










25 










30 








Thx Asp 


Tyr 


Val 


Pro 


Thr 


Val 


Phe 


Asp 


Asn 


Phe 


Ser 


Ala 


Asn 


Val 


Val 






35 










40 








45 










Val Asp 


Gly 


Ser 


Thr 


Val 


Asn 


Leu 


Gly 


Leu 


Trp 


Asp 


Thr 


Ala 


Gly 


Gin 




50 










55 










60 












Glu Asp 


Tyr 


Asn 


Arg 


Leu 


Arg Pro 


Leu 


Ser 


Tyr 


Arg 


Gly' 


Ala 


Asp 


val 




65 








70 










75 










80 




Phe Leu 


Leu 


Ala 


Phe 


Ser 


Leu 


He 


Ser 


Lys 


Ala 


Ser 


Tyr 


Glu 


Asn 


He 










85 










90 










95 






His Lys 


Lys 


Trp 


Leu 


Pro 


Glu 


Leu 


Lys 


His 


Tyr 


Ala 


Pro 


Gly 


He 


Pro 








100 










105 










110 








lie val 


Leu 


Val 


Gly 


Thr 


Lys 


Leu 


Asp 


Leu 


Arg 


Asp 


Asp 


Lys 


Gin 


Phe 






115 










120 










125 




■ 






Leu Lys 


Asp 


His 


Pro 


Gly 


Ala Ala 


Ser 


He 


Thr 


Thr 


Ala 


Gin 


Gly 


Glu * 




130 










135 










140 












Glu Leu 


Arg 


Lys 


Met 


He 


Gly Ala 


Val 


Arg 


Tyr 


Leu 


Glu 


Cys 


Ser 


Ser 




145 








150 










155 










16 0 




Lys Thr 


Gin 


Gin 


Asn 


Val 


Lys 


Ala 


Val 


Phe 


Asp 


Thr 


Ala 


He 


Arg 


Val 










165 










170 










175 






Ala Leu 


Arg 


Pro 


Pro 


Lys 


Ala 


Lys 


Lys 


Lys 


He 


Lys 


Pro 


Leu 


Lys 


Thr 








180 










185 










190 








Lys Arg 


Ser 


Arg 


lie 


Cys 


Phe 


Phe 


Leu 




















195 










200 




















<210> 135 






























<211> 784 






























<212> DNA 






























<213> Arabidopsis thaliana 


(mouse -ear cress) 












<220> 
































<221> CDS 






























<222> (1) . . 


(591) 


























<400> 135 






























atg agt 


get 


teg 


agg 


ttt 


ata 


aag 


tgt 


gtc 


acc 


gtc 


ggc 


gat 


ggt 


gcc 


48 


Met Ser 


Ala 


Ser 


Arg 


Phe 


He 


Lys 


Cys 


Val 


Thr Val 


Gly 


Asp 


Gly 


Ala 




1 






5 










10 










15 






gtc gga 


aaa 


act 


tgt 


atg 


ctg 


att 


tct 


tac 


aca 


age 


aac 


act 


ttc 


cct 


96 


Val Gly 


Lys 


Thr 


Cys 


Met' 


Leu 


He 


Ser 


Tyr 


Thr Ser 


Asn 


Thr 


Phe 


Pro 








20 










25 










30 








acg gac 


tat 


gtt 


cca 


act 


gtt 


ttc 


gac 


aac 


ttc 


agt 


get 


aat 


gtg 


gtt 


144 


Thr Asp 


Tyr 


Val 


Pro 


Thr 


Val 


Phe 


Asp 


Asn 


Phe 


Ser 


Ala 


Asn 


Val 


Val 




35 










40 








45 










gta gat 


ggg 


aac 


acg 


gtg 


aat 


ctt 


gga 


ttg 


tgg 


gat 


aca 


get 


ggt 


caa 


192 


Val Asp 


Gly 


Asn 


Thr 


Val 


Asn 


Leu 


Gly 


Leu 


Trp 


Asp 


Thr 


Ala 


Gly 


Gin 




50 










55 










60 












gaa gac 


tat 


aac 


agg 


tta 


aga 


ccg 


ttg 


agt 


tac 


cgt 


ggt 


gcc 


gat 


gtc 


240 


Glu Asp 


Tyr 


Asn 


Arg 


Leu 


Arg 


Pro 


Leu 


Ser 


Tyr 


Arg 


Gly 


Ala 


Asp 


Val 




65 








70 










75 










80 




ttc att 


ctt 


gca 


ttc 


teg 


ctt 


att 


age 


aaa 


get 


age 


tac 


gag 


aat 


gta 


288 


Phe lie 


Leu 


Ala 


Phe 


Ser 


Leu 


He 


Ser 


Lys 


Ala 


Ser 


Tyr 


Glu 


Asn 


val 










85 










90 










95 






gcc aag 


aag 


tgg 


att 


cct 


gag 


ctt 


agg 


cat 


tat 


gcc 


cct 


ggt 


gtt 


cct 


336 


Ala Lys 


Lys 


Trp 


He 


Pro 


Glu 


Leu 


Arg 


His 


Tyr 


Ala 


Pro 


Gly 


Val 


Pro 








100 










105 










110 








ata ate 


etc 


gtt 


gga 


acg 


aaa 


etc 


gat 


ctt 


cga 


gat 


gac 


aag 


caa 


ttc 


384 


lie lie 


Leu 


Val 


Gly 


Thr 


Lys 


Leu 


Asp 


Leu 


Arg 


Asp 


Asp 


Lys 


Gin 


Phe 






115 










120 










125 










ttc ata 


gac 


cat 


cct 


ggt 


gca 


gtg 


cct 


att 


act 


aca 


aac 


cag 


gga 


gag 


432 


Phe lie 


Asp 


His 


Pro 


Gly 


Ala 


Val 


Pro 


He 


Thr 


Thr 


Asn 


Gin 


Gly 


Glu 




13 0 








135 










140 












gaa eta 


aag 


aaa 


ctg 


ata 


gga 


tea 


cca 


ate 


tac 


att 


gaa 


tgt 


agt 


tea 


480 


Glu Leu 


Lys 


Lys 


Leu 


He 


Gly 


Ser 


Pro 


He 


Tyr 


He 


Glu 


Cys 


Ser 


Ser 




145 








150 










155 










160 
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aag act cag cag aat gtg aaa gca gtc ttt gac gca gcc ata aaa gtg 528 
I*ys Thr Gin Gin Asn Val Lys Ala Val Phe Asp Ala Ala He Lys Val 

165 170 175 

gtg ctt cag cca ccg aaa cag aag aag aag aaa aag aac aag aac cgc 576 
Val Leu Gin Pro Pro Lys Gin Lys Lys Lys Lys Lys Asn Lys Asn Arg 

180 185 190 

tgc gtg ttc ttg tgatcgaaca tctcttaaaa cgaaaaaagg tttaggtaac 628 
Cys Val Phe Leu 
195 

aaaagaagct gaaggaaaac gaacacctgc aacatt'gtat agttgttgaa tccggcttgt 688 
ttcggtcttt actcttcttt ttgctggtga atctaatgag tgaactgagg ccaagaggag 748 
gattctgggt aagagaaggg gatgtgttct tttctt 784 



<210> 136 
<211> 196 
<212> PRT 

<213> Arabidopsis thaliana (mouse-ear cress) 
<400> 136 

Met Ser Ala Ser Arg Phe He Lys Cys Val Thr Val Gly Asp Gly Ala 

15 10 15 

Val Gly Lys Thr Cys Met Leu He Ser Tyr Thr Ser Asn Thr Phe Pro 

20 25 30 

Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val 

35 40 45 

Val Asp Gly Asn Thr Val Asn Leu Gly Leu Tip Asp Thr Ala Gly Gin 

50 55 60. 

Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 
65 ^ 70 75 80 

Phe He Leu Ala Phe Ser Leu He Ser Lys Ala Ser Tyr Glu Asn Val 

85 90 95 

Ala Lys Lys Trp He Pro Glu Leu Arg His Tyr Ala Pro Gly Val Pro 

100 105 HO 

He He Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe 

115 " 120 125 

Phe He Asp His Pro Gly Ala Val Pro He Thr Thr Asn Gin Gly Glu 

130 " 135 140 

Glu Leu Lys Lys Leu He Gly Ser Pro He Tyr He Glu Cys Ser Ser 
145 ~ " 150 155 160 

Lys Thr Gin Gin Asn Val . Lys Ala Val Phe Asp Ala Ala He Lys Val 

165 170 175 

Val Leu Gin Pro Pro Lys Gin Lys Lys Lys Lys Lys Asn Lys Asn Arg 

180 185 190 

Cys Val Phe Leu 
195 

<210> 137 
<211> 679 
<212> DNA 

<213> Arabidopsis thaliana (mouse-ear cress) 

<220> 

<221> CDS 

<222> (1) . . (648) 



<400> 137 

atg get tea agt get tea aag tte ate aag tgt gtg act gtt ggt gat 
Met Ala Ser Ser Ala Ser Lys Phe He Lys Cys Val Thr Val Gly Asp 
1 5 10 15 

ggt get gtt ggt aaa ace tgt atg etc ate tgc tac acc age aat aaa 
Gly Ala Val Gly Lys Thr Cys Met Leu He Cys Tyr Thr Ser Asn Lys 



48 



96 
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20 








25 










30 








ttc 


ccc 


act 


gac 


tac ata 


cca 


aca 


gtt 


ttt 


gac 


aac 


ttt 


agt 


gca 


aat 


144 


Phe 


Pro 


Thr Asp 


Tyr He 


Pro 


Thr 


Val 


Phe 


Asp 


Asn 


Phe 


Ser 


Ala 


Asn 








35 








40 










45 










gtt 


gtt 


gtt 


gaa 


ggc acc 


act 


gtc 


aat 


ttg 


ggg 


ctt 


tgg 


gac 


act 


get 


192 


Val 


Val 


val 


Glu 


Gly Thr Thr 


Val 


Asn 


Leu 


Gly 


Leu 


Trp 


Asp 


Thr 


Ala 






50 








55 










60 












ggg 


caa 


gaa 


gac 


tat aac 


aga 


tta 


agg 


cct 


tta 


agt 


tac 


agg 


gga 


gca 


240 


Gly 


Gin 


Glu Asp 


Tyr Asn Arg 


Leu 


Arg 


Pro 


Leu 


Ser 


Tyr 


Arg 


Gly 


Ala 




65 








70 










75 










80 




gat 


gtt 


ttc 


crtc 


ttg tct 


ttc 


tea 


tta 


gtc 


age 


cga 


get 


age 


tac 


gag 


288 


Asp 


val 


Phe 


Val 


Leu Ser 


Phe 


Ser 


Leu 


Val 


Ser 


Arg 


Ala 


Ser 


Tyr 


Glu 










85 








90 










95 






aat 


gtt 


ttt 


aaa 


aacr tcrcr 


ate 


cct 


gaa 


etc 


caa 


cac 


ttt 


get 


cca 


gga 


336 


Asn 


Val 


Phe 


Lys 
100 


Lvs Trp 


He 


Pro 


Glu 
105 


Leu 


Gin 


His 


Phe 


Ala 
110 


Pro 


Gly 




gtt 


ccc 


ctt 


gtc 


ctt gtt 


crcrt 


acc 


aaa 


tta 


gat 


ctt 


cgt 


gaa 


gat 


aag 


384 


Val 


Pro 


Leu 
115 


val 


Leu Val 


Glv 


Thr 
120 


Lys 


Leu 


Asp 


Leu 


Arg 
125 


Glu 


Asp 


Lys ' 




cat 


tat 


ttg 


get 


gat cat 


cct 


gga 


eta 


tec 


cct 


gta 


act 


act 


gca 


cag 


432 


His 


Tyr 
130 


Leu 


Ala 


Asp His 


Pro 
135 


Gly 


Leu 


Ser 


Pro 


Val 
140 


Thr 


Thr 


Ala 


Gin 




gga 


gag 


gag 


ttg 


cgt aag 


eta 


att 


ggt 


gcg 


acg 


tat 


tac 


att 


gag 


tgt 


480 


Gly 


Glu 


Glu 


Leu 


Arg Lys 


Leu 


He 


Gly 


Ala 


Thr 


Tyr 


Tyr 


He 


Glu 


Cys 




145 








150 










155 










160 




agt 


tea 


aaa 


act 


caa cag 


aat 


gtg 


aaa 


gca 


gtt 


ttt 


gat 


tct 


gcg 


ata 


528 


Ser 


Ser 


Lys 


Thr 


Gin Gin 


Asn 


Val 


Lys 


Ala 


Val 


Phe 


Asp 


Ser 


Ala 


He 










165 








170 










175 






aag 


gaa 


gtg ate 


aaa cct 


ctg 


gtt 


aaa 


caa 


aag 


gag 


aag 


act 


aag 


aag 


576 


Lys 


Glu 


Val 


lie 


Lys Pro 


Leu 


Val 


Lys 


Gin 


Lys 


Glu 


Lys 


Thr 


Lys 


Lys 








180 








185 










190 








aag 


aag 


aag 


caa 


aag teg 


aat 


cac 


ggc 


tgt 


tta 


tea 


aat 


gtt 


ctg 


tgt 


624 


Lys 


Lys 


Lys 


Gin 


Lys Ser 


Asn 


His 


Gly 


Cys 


Leu 


Ser 


Asn 


Val 


Leu 


Cys 






195 








200 










205 










ggg 


agg 


ata 


gtg 


act egg 


cat 


tgatgacgat gacccaactc agtctgatga 


675 


Gly 


Arg 
210 


He 


Val 


Thr Arg 


His 
215 























tttt 679 



<210> 138 
<211> 215 
<212> PRT 

<213> Arabidopsis thaliana (mouse-ear cress) 



<400> 138 



Met 


Ala 


Ser 


Ser 


Ala 


Ser 


Lys 


Phe 


He 


Lys 


Cys 


Val 


Thr 


Val 


Gly 


Asp 


1 








5 










10 










15 




Gly 


Ala 


Val 


Gly 


Lys 


Thr 


Cys 


Met 


Leu 


He 


Cys 


Tyr 


Thr 


Ser 


Asn 


Lys 






20 










25 










30 






Phe 


Pro 


Thr 


Asp 


Tyr 


He 


Pro 


Thr 


Val 


Phe 


Asp 


Asn 


Phe 


Ser 


Ala 


Asn 






35 






40 










45 








Val 


Val 


Val 


Glu 


Gly 


Thr 


Thr 


Val 


Asn 


Leu 


Gly 


Leu 


Trp 


Asp 


Thr 


Ala 




50 








55 










60 










Gly 


Gin 


Glu 


Asp 


Tyr 


Asn 


Arg 


Leu 


Arg 


Pro 


Leu 


Ser 


Tyr 


Arg 


Gly 


Ala 


65 








70 








75 










80 


Asp 


Val 


Phe 


val 


Leu 


Ser 


Phe 


Ser 


Leu 


Val 


Ser 


Arg 


Ala 


Ser 


Tyr 


Glu 








85 










90 










95 




Asn 


Val 


Phe 


Lys 
100 


Lys 


Trp 


He 


Pro 


Glu 
105 


Leu 


Gin 


His 


Phe 


Ala 
110 


Pro 


Gly 


Val 


Pro 


Leu 


Val 


Leu 


Val 


Gly Thr 


Lys 


Leu 


Asp 


Leu 


Arg 


Glu 


Asp 


Lys 






115 










120 










125 








His 


Tyr 
130 


Leu 


Ala 


Asp 


His 


Pro 
135 


Gly 


Leu 


Ser 


Pro 


Val 
140 


Thr 


Thr 


Ala 


Gin 


Gly 


Glu 


Glu 


Leu 


Arg 


Lys 


Leu 


He 


Gly 


Ala 


Thr 


Tyr 


Tyr 


He 


Glu 


Cys 



145 150 155 160 
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Ser Ser Lys Thr Gin Gin Asn Val Lys Ala Val Phe Asp Ser Ala He 

165 170 175 

Lys Glu Val He Lys Pro Leu Val Lys Gin Lys Glu Lys Thr Lys Lys 

180 185 190 

Lys Lys Lys Gin Lys Ser Asn His Gly Cys Leu Ser Asn Val Leu Cys 
195 200 205 



144 



195 200 
Gly Arg He Val Thr Arg His 
210 215 

<210> 139 
<211> 630 
<212> DNA 

<213> Saccharomyces cerevisiae 

<220> 
<221> CDS 
<222> (1) . . (630) 

<400> 139 ^ . q 

atg tea caa caa gtt ggt aac agt ate aga aga aag ctg gta ate gtt 4 a 

Met Ser Gin Gin Val Gly Asn Ser He Arg Arg Lys Leu Val He Val 
15 10 15 

ggt gat ggt gee tgt ggt aag aca tgt tta tta ate gtc ttt tec aag 96 
Gly Asp Gly Ala Cys Gly Lys Thr Cys Leu Leu He Val Phe Ser Lys 

20 25 30 

ggc caa ttt cca gaa gtc tac gta cca act gtc ttt gaa aac tat gta 
Gly Gin Phe Pro Glu Val Tyr Val Pro Thr Val Phe Glu Asn Tyr Val 

35 40 45 

gca gat gtt gaa gtt gat ggg cgt cgt gta gag eta gcg eta tgg gat 192 
Ala Asp Val Glu Val Asp Gly Arg Arg Val Glu Leu Ala Leu Trp Asp 

50 55 60 

ace get ggt caa gaa gat tat gat aga eta aga cca ttg tea tac cca 240 

Thr Ala Gly Gin Glu Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro 

65 70 75 80 

gac tec aat gtc gta tta att tgt ttc tct ate gat ctt cca gat tct 288 

Asp Ser Asn Val Val Leu He Cys Phe Ser lie Asp Leu Pro Asp Ser 

85 90 95 

tta gag aat gta caa gaa aaa tgg att gee gaa gta tta cat ttc tgt 336 

Leu Glu Asn Val Gin Glu Lys Trp He Ala Glu Val Leu His Phe Cys 

100 105 HO 

caa ggt gtg cca att att ctt gtt ggt tgt aaa gtg gat ttg aga aac 384 

Gin Gly Val Pro He He Leu Val Gly Cys Lys Val Asp Leu Arg Asn 

115 120 125 

gac cca caa ace att gaa caa tta aga caa gaa ggt caa caa ccc gtt 432 

Asp Pro Gin Thr He Glu Gin Leu Arg Gin Glu Gly Gin Gin Pro Val 

130 135 140 

aca tea cag gag gga caa tct gta gca gac cag att ggc gca ace gga 480 

Thr Ser Gin Glu Gly Gin Ser Val Ala Asp Gin He Gly Ala Thr Gly 

145 150 155 160 

tac tac gaa tgt teg gee aag act ggt tat ggt gtc aga gaa gtg ttt 

Tyr Tyr Glu Cys Ser Ala Lys Thr Gly Tyr Gly Val Arg Glu Val Phe 

165 170 175 

gag gee gee act aga get tea ttg atg ggt aaa tct aaa acg aat ggt 576 

Glu Ala Ala Thr Arg Ala Ser Leu Met Gly Lys Ser Lys Thr Asn Gly 

180 185 190 

aaa get aag aag aac act act gaa aag aag aag aag aag tgt gtc ttg 624 

Lys Ala Lys Lys Asn Thr Thr Glu Lys Lys Lys Lys Lys Cys Val Leu 

195 200 205 

tta tag 
Leu 



528 



630 



<210> 140 
<211> 209 
<212> PRT 

<213> Saccharomyces cerevisiae 
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<400> 140 




























Met 


Ser 


Gin 


Gin 


Val 


Gly 


Asn 


Ser 


He 


Arg 


Arg 


Lys 


Leu 


Val 


He 


Val 


1 








5 










10 










15 




Gly 


Asp 


Gly Ala 


Cys 


Gly 


Lys 


Thr 


Cys 


Leu 


lieu 


He 


Val 


Phe 


Ser 


Lys 








20 










25 










30 






Gly 


Gin 


Phe 


Pro 


Glu 


Val 


Tyr 


Val 


Pro 


Thr 


Val 


Phe 


Glu Asn 


Tyr 


Val 




35 










40 










45 






Ala 


Asp 
50 


Val 


Glu 


Val 


Asp 


Gly 
55 


Arg 


Arg 


Val 


Glu 


Leu 
60 


Ala 


Leu 


Trp 


Asp 


Thr 


Ala 


GlV 


Gin 


Glu 


Asp 


Tyr 


Asp 


Arg 


Leu 


Arg 


Pro 


Leu 


Ser 


Tyr 


Pro 


65 










70 










75 










80 


Asp 


Ser 


Asn 


Val 


Val 
85 


Leu 


He 


Cys 


Phe 


Ser 
90 


He 


Asp 


Leu 


Pro 


Asp 
95 


Ser 


Leu 


Glu 


Asn 


Val 


Gin 


Glu 


Lys 


Trp 


He 


Ala 


Glu 


Val 


Leu 


His 


Phe 


Cys 








100 










105 










110 




Gin 


Gly 


Val 
115 


Pro 


lie 


He 


Leu 


Val 
120 


Gly 


Cys 


Lys 


Val 


Asp 
125 


Leu 


Arg 


Asn 


Asp 


Pro 
130 


Gin 


Thr 


lie' 


Glu 


Gin 
135 


"Leu 


Arg 


Gin 


Glu 


Gly 
140 


Gin 


Gin 


Pro 


Val 


Thr 


Ser 


Gin 


Glu 


Gly 


Gin 


Ser 


Val 


Ala 


Asp 


Gin 


He 


Gly Ala 


Thr 


Gly 


145 










150 










155 










160 


Tyr 


Tyr 


Glu 


Cys 


Ser 
165 


Ala 


Lys 


Thr 


Gly 


Tyr 
170 


Gly 


Val 


Arg 


Glu 


Val 
175 


Phe 


Glu 


Ala 


Ala 


Thr 
180 


Arg 


Ala 


Ser 


Leu 


Met 
185 


Gly 


Lys 


Ser 


Lys 


Thr 
190 


Asn 


Gly 


Lys 


Ala 


Lys 
195 


Lys 


Asn 


Thr 


Thr 


Glu 
200 


Lys 


Lys 


Lys 


Lys 


Lys 
205 


Cys 


Val 


Leu 


Leu 

































<210> 141 
<211> 987 
<212> DNA 

<213> Aplysia cal if orriica (California sea hare) 

<220> 

<221> CDS 

<222> (14) . . (592) 

<400> 141 

ccaagtgcca acc atg gca gcg ata cga aag aag ctt gtt ata gtc gga 49 
Met Ala Ala He Arg Lys Lys Leu Val He Val Gly 
1 5 10 

gat ggt gcg tgt ggt aaa aca tgt eta ctt att gtc ttc age aaa gac 97 
Asp Gly Ala Cys Gly Lys Thr Cys Leu Leu lie Val Phe Ser Lys Asp 

15 20 25 

cag ttc cct gaa gtt tac gtg cca aca gtt ttt gaa aat tat gta gca 145 
Gin Phe Pro Glu Val Tyr Val Pro Thr Val Phe Glu Asn Tyr Val Ala 

30 35 40 

gac att gaa gtt gat ggc aaa cag gtt gag eta get ctg tgg gac aca 193 
Asp He Glu Val Asp Gly Lys Gin Val Glu Leu Ala Leu Trp Asp Thr 
45 50 55 60 

gcg gga caa gag gac tat gac aga ctg agg ccg ctg tct tac cct gac 241 
Ala Gly Gin Glu Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp 

65 70 75 

aca gat gtc ate etc atg tgt ttc tct ata gac agt cca gac agt ctg 289 
Thr Asp Val He Leu Met Cys Phe Ser He Asp Ser Pro Asp Ser Leu 

80 85 90 

gag aac ata ccg gag aag tgg acg cct gag gtt cgt cac ttt tgt cca 337 
Glu Asn He Pro Glu Lys Trp Thr Pro Glu Val Arg His Phe Cys Pro 

95 100 105 • 

aat gtt cct ata ata ctt gtg ggt aac aaa aag gat ctt cgc aac gat 385 
Asn Val Pro He He Leu Val Gly Asn Lys Lys Asp Leu Arg Asn Asp 

110 115 120 

gaa agt acc aaa cgt gag etc atg aaa atg aaa cag gaa cca gtg aga 433 
Glu Ser Thr Lys Arg Glu Leu Met Lys Met Lys- Gin Glu Pro Val Arg 
125 130 135 140 

cca gag gat ggg cgc gec atg get gag aaa ate aac gee tac tct tat 481 
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Pro Glu Asp Gly Arg Ala Met Ala Glu Lys He Asn Ala Tyr Ser Tyr 

145 150 155 

ctt gag tgc tct get aaa acc aag gag ggc gtg agg gat gtg ttt gag 529 
Leu Glu Cys Ser Ala Lys Thr Lys Glu Gly Val Arg Asp Val Phe Glu 

160 165 170 

aca get acc aga get gcg ctg caa gtg aaa aag aag aag aag ggt gga 577 
Thr Ala Thr Arg Ala Ala Leu Gin Val Lys Lys Lys Lys Lys Gly Gly 

175 180 185 

tgt gtt gta ttg tgaataagtc gctgttttct tcaattcccc acaacagggc 629 
Cys Val Val Leu 
190 

tggtgaaagg aggactgeat gaaattttgt cttttgaaga tattcctttt aaaattattt 689 
tttttaaaat accaattttc aacttaagat tgttcaatta taattagtgc aaaaagaegg 749 
atgcttcccg actgaaacac aagaaaggee atatcagagt ggcatatgat cacagtcata 809 
ttttgtggct tgtgtgattg catatatatt taagaacaaa tataagegtc aactatctat 869 
ggctggagaa tgaggtaatg gaaatgtttt atttgagtaa attcctcatc tgcaaaaagt 929 
ttgtactaca gttgagtaaa atttcaagca ttgtctcctt tacaaattgt nngccccc 987 



<210> 142 
<211> 192 
<212> PRT 

<213> Aplysia calif ornica (California sea hare) 
<400> 142 

Met Ala Ala He Arg Lys Lys Leu Val He Val Gly Asp Gly Ala Cys 

15 10 15 

Gly Lys Thr Cys Leu Leu He Val Phe Ser Lys Asp Gin Phe Pro Glu 

20 25 30 

Val Tyr Val Pro Thr Val Phe Glu Asn Tyr Val Ala Asp He Glu Val 

35 40 45 

Asp Gly Lys Gin Val Glu Leu Ala Leu Trp Asp Thr Ala Gly Gin Glu 

50 55 60 

Asd Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp Thr Asp Val He 
65 . 70 75 80 

Leu Met Cys Phe Ser He Asp Ser Pro Asp Ser Leu Glu Asn He Pro 

85 90 95 

Glu Lys Trp Thr Pro Glu Val Arg His Phe Cys Pro Asn Val Pro He 

100 105 HO 

He Leu Val Gly Asn Lys Lys Asp Leu Arg Asn Asp Glu Ser Thr Lys 

115 120 125 

Arg Glu Leu Met Lys Met Lys Gin Glu Pro Val Arg Pro Glu Asp Gly 

130 135 140 

Arq Ala Met Ala Glu Lys He Asn Ala Tyr Ser Tyr Leu Glu Cys Ser 
145 150 155 160 

Ala Lys Thr Lys Glu Gly Val Arg Asp Val Phe Glu Thr Ala Thr Arg 

165 170 175 

Ala Ala Leu Gin Val Lys Lys Lys Lys Lys Gly Gly Cys Val Val Leu 
180 185 190 

<210> 143 
<211> 579 
<212> DNA 

<213> Homo sapiens (man) 



<220> 



WO 2005/014828 



113/291 



PCT/EP2004/008136 



<221> CDS 

<222> (1) . . (579) 

<400> 143 

atg cag gcc ate aag tgt gtg gtg gtg gga gac gga get gta ggt aaa 48 

Met Gin Ala lie Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 

1 5 10 15 

act tgc eta ctg ate agt tac aca acc aat gca ttt cct gga gaa tat 96 

Thr Cys Leu Leu lie Ser Tyr Thr Thr Asn Ala Phe Pro Gly Glu Tyr 

20 25 30 

ate cct act gtc ttt gac aat tat tct gcc aat gtt atg gta gat gga 144 
He Pro Thr Val Phe Asp Asn Tyr Ser Ala Asn Val Met Val Asp Gly 

35 40 . 45 

aaa ccg gtg aat ctg ggc tta tgg gat aca get gga caa gaa gat tat 192 
Lys Pro Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr 

50 55 60 

gac aga tta cgc ccc eta tec tat ccg caa aca gat gtg ttc tta att 240 
Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu He 
65 70 75 80 

tgc ttt tec ctt gtg agt cct gca tea ttt gaa aat gtc cgt gca aag 288 
Cys Phe Ser Leu Val Ser Pro Ala Ser Phe Glu Asn Val Arg Ala Lys 

85 90 95 

tgg tat cct gag gtg egg cac cac tgt ccc aac act ccc ate ate eta 336 
Trp Tyr Pro Glu Val Arg His His Cys Pro Asn Thr Pro He He Leu 

100 105 110 

gtg gga act aaa ctt gat ctt agg gat gat aaa gac acg ate gag aaa 384 
Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Asp Thr He Glu Lys 

115 120 125 

ctg aag gag aag aag ctg act ccc ate acc tat ccg cag ggt eta gcc 432 
Leu Lys Glu Lys Lys Leu Thr Pro He Thr Tyr Pro Gin Gly Leu Ala 

130 135 140 

atg get aag gag att ggt get gta aaa tac ctg gag tgc teg gcg etc 480 
Met Ala Lys Glu He Gly Ala Val Lys Tyr Leu Glu Cys Ser Ala Leu 
145 150 155 ^ 160 

aca cag cga ggc etc aag aca gtg ttt gac gaa gcg ate cga gca gtc 528 
Thr Gin Arg Gly Leu Lys Thr Val Phe Asp Glu Ala He Arg Ala Val 

165 170 175 

etc tgc ccg cct ccc gtg aag aag agg aag aga aaa tgc ctg ctg ttg 576 
Leu Cys Pro Pro Pro Val Lys Lys Arg Lys Arg Lys Cys Leu Leu Leu 

180 185 190 

taa 579 



<210> 144 
<211> 192 
<212> PRT 

<213> Homo sapiens (man) 
<400> 144 

Met Gin Ala He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 

1 5 10 15 

Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Ala Phe Pro Gly Glu Tyr 

20 25 30 

He Pro Thr Val Phe Asp Asn Tyr Ser Ala Asn Val Met Val Asp Gly 

35 * 40 45 

Lys Pro Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr 

50 55 60 

Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu He 
65 70 75 80 

Cys Phe Ser Leu Val Ser Pro Ala Ser Phe Glu Asn Val Arg Ala Lys 

85 90- 95 

Trp Tyr Pro Glu Val Arg His His Cys Pro Asn Thr Pro He He Leu 

' 100 105 110 

Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Asp Thr He Glu Lys 

115 120 1 125 

Leu Lys Glu Lys Lys Leu Thr Pro He Thr Tyr Pro Gin Gly Leu Ala 
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130 135 140 

Met Ala Lys Glu He Gly Ala Val Lys Tyr Leu Glu Cys Ser Ala Leu 
145 150 155 160 

Thr Gin Arg Gly Leu Lys Thr Val Phe Asp Glu Ala He Arg Ala Val 

165 170 175 

Leu Cys Pro Pro Pro Val Lys Lys Arg Lys Arg Lys Cys Leu Leu Leu 
180 185 190 

<210> 145 
<211> 642 
<212> DNA 

<213> Homo sapiens (man) 

<220> 

<221> CDS 

<222> (1) . . (642) 



240 



288 



<400> 145 

atg ccc gga gcc ggc cgc age age atg get cac ggg ccc ggc gcg ctg 48 

Met Pro Gly Ala Gly Arg Ser Ser Met Ala His Gly Pro Gly Ala Leu 

1 * 5 10 15 

atg etc aag tgc gtg gtg gtc ggc gac ggg gcg gtg ggc aag acg tgc 96 

Met Leu Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys Thr Cys 

20 25 30 

eta etc atg age tat gcc aac gac gcc ttc ccg gag gag tac gtg ccc 144 
Leu Leu Met Ser Tyr Ala Asn Asp Ala Phe Pro Glu Glu Tyr Val Pro 

35 40 45 

acc gtc ttc gac cac tac gca gtc age gtc acc gtg ggg ggc aag cag 192 
Thr Val Phe Asp His Tyr Ala Val Ser Val Thr Val Gly Gly Lys Gin 

50 • . 55 60 

tac etc eta gga etc tat gac acg gcc gga cag gaa gac tat gac cgt 
Tyr Leu Leu Gly Leu Tyr Asp Thr Ala Gly Gin Glu Asp Tyr Asp Arg 
65 70 * 75 80 

ctg agg cct tta tct tac cca atg acc gat gtc ttc ctt ata tgc ttc 
Leu Arg Pro Leu Ser Tyr Pro Met Thr Asp Val Phe Leu He Cys Phe 

85 90 95 

teg gtg gta aat cca gcc tea ttt caa aat gtg aaa gag gag tgg gta 336 
Ser Val Val Asn Pro Ala Ser Phe Gin Asn Val Lys Glu Glu Trp Val 

100 105 HO 

ccg gaa ctt aag gaa tac gca cca aat gta ccc ttt tta tta ata gga 
Pro Glu Leu Lys Glu Tyr Ala Pro Asn Val Pro Phe Leu Leu He Gly 

115 120 125 

act cag att gat etc cga gat gac ccc aaa act tta gca aga ctg aat 
Thr Gin He Asp Leu Arg Asp Asp Pro Lys Thr Leu Ala Arg Leu Asn 

130 " 135 140 

gat atg aaa gaa aaa cct ata tgt gtg gaa caa gga cag aaa eta gca 
Asp Met Lys Glu Lys Pro He Cys Val Glu Gin Gly Gin Lys Leu Ala 
145 ^ 150 155 160 

aaa gag ata gga gea tgc tgc tat gtg gaa tgt tea get tta acc cag 528 
Lys Glu He Gly Ala Cys Cys Tyr Val Glu Cys Ser Ala Leu Thr Gin 

165 170 175 

aag gga ttg aag act gtt ttt gat gag get ate ata gcc att tta act 
Lys Gly Leu Lys Thr Val Phe Asp Glu Ala He He Ala He Leu Thr 

180 185 190 

cca aag aaa cac act gta aaa aaa aga ata gga tea aga tgt ata aac 
Pro Lys Lys His Thr Val Lys Lys Arg He Gly Ser Arg Cys He Asn 

195 200 205 

tgt tgt tta att acg tga 
Cys Cys Leu He Thr 
210 



384 



432 



480 



576 



624 



642 



<210> 146 
<211> 213 
<212> PRT 

<213> Homo sapiens (man) 



<400> 146 



Met 


Pro 


Gly 


Ala 


1 ' 








Met 


Leu 


Lys 


Cys 








20 


Leu 


Leu 


Met 


Ser 










Thr 


Val 


Phe 


Asp 




50 






Tyr 


Leu 


Leu 


Gly 


65 








Leu 


Arg 


Pro 


Leu 


Ser 


Val 


Val 


As n 








100 


Pro 


Glu 


Leu 


Lys 






XJ.J 




Thr 


Gin 


He 


Asp 




13 0 






Asp 


Met 


Lys 


Glu 


145 








Lys 


Glu 


He 


Gly 


Lys 


Gly 


Leu 


Lys 








180 


Pro 


Lys 


Lys 


His 






195 




Cys 


Cys 


Leu 


He 




210 
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Gly Arg Ser Ser Met Ala His Gly Pro Gly Ala Leu 
5 10 15 

Val val Val Gly Asp Gly Ala Val Gly Lys Thr Cys 

25 30 
Tyr Ala Asn Asp Ala Phe Pro Glu Glu Tyr Val Pro 

40 45 
His Tyr Ala Val Ser Val Thr Val Gly Gly Lys Gin 

55 60 
Leu Tyr Asp Thr Ala Gly Gin Glu Asp Tyr Asp Arg 

70 75 80 

Ser Tyr Pro Met Thr Asp Val Phe Leu He Cys Phe 
85 90 95 

Pro Ala Ser Phe Gin Asn Val Lys Glu Glu Trp Val 

. 105 110 
Glu Tyr Ala Pro Asn Val Pro Phe Leu Leu He Gly 

120 125 
Leu Arg Asp Asp Pro Lys Thr Leu Ala Arg Leu Asn 

135 140- 
Lys Pro He Cys Val Glu Gin Gly Gin Lys Leu Ala 
150 155 160 

Ala Cys Cys Tyr Val Glu Cys Ser Ala Leu Thr Gin 
165 170 175 

Thr Val Phe Asp Glu Ala He He Ala He Leu Thr 

185 190 
Thr Val Lys Lys Arg He Gly Ser Arg Cys He Asn 
200 205 



<210> 147 
<211> 582 
<212> DNA 

<213> Homo sapiens (man) • 

<220> 

<221> CDS 

<222> (1) . . (582) 

<400> 147 

atg get gca ate cga aag aag ctg gtg ate gtt ggg gat ggt gec tgt 48 

Met Ala Ala He Arg Lys Lys Leu Val He Val Gly Asp Gly Ala Cys 

1 5 10 15 

ggg aag acc tgc etc etc ate gtc ttc age aag gat cag ttt ccg gag 96 

Gly Lys Thr Cys Leu Leu He Val Phe Ser Lys Asp Gin Phe Pro Glu 

20 25 30 

gtc tac gtc cct act gtc ttt gag aac tat att gcg gac att gag gtg 144 
Val Tyr Val Pro Thr Val Phe Glu Asn Tyr He Ala Asp He Glu Val 

35 40 45 

gac ggc aag cag gtg gag ctg get ctg tgg gac aca gca ggg cag gaa 192 
Asp Gly Lys Gin Val Glu Leu Ala Leu Trp Asp Thr Ala Gly Gin Glu 

50 55 60 

gac tat gat cga ctg egg cct etc tec tac ccg gac act gat gtc ate 240 
Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp Thr Asp Val He 
65 70 75 80 

etc atg tgc ttc tec ate gac age cct gac age ctg gaa aac att cct 288 
Leu Met Cys Phe Ser He Asp Ser Pro Asp Ser Leu Glu Asn He Pro 

85 90 95 

gag aag tgg acc cca gag gtg aag cac ttc tgc ccc aac gtg ccc ate 336 
Glu Lys Trp Thr Pro Glu Val Lys His Phe Cys Pro Asn Val Pro He 

100 105 110 

ate ctg gtg ggg aat aag aag gac ctg agg caa gac gag cac acc agg 384 
He Leu Val Gly Asn Lys Lys Asp Leu Arg Gin Asp Glu His Thr Arg 

115 120 ~ 125 

aga gag ctg gee aag atg aag cag gag ccc gtt egg tct gag gaa ggc 432 
Arg Glu Leu Ala Lys Met Lys Gin Glu Pro Val Arg Ser Glu Glu Gly 

130 135 140 

egg gac atg gcg aac egg ate agt gee ttt ggc tac ctt gag tgc tea 480 
Arg Asp Met Ala Asn Arg He Ser Ala Phe Gly Tyr Leu Glu Cys Ser 
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145 150 
gcc aag acc aag gag gga 
Ala Lys Thr Lys Glu Gly 
165 

get ggc etc cag gtc cgc 
Ala Gly Leu Gin Val Arg 
180 

etc . tga 
Leu 
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155 

gtg egg gag gtg ttt gag 
Val Arg Glu Val Phe Glu 
170 

aag aac aag cgt egg agg 
Lys Asn Lys Arg Arg Arg 
185 



PCT/EP2004/008136 
160 

atg gcc act egg 528 
Met Ala Thr Arg 
175 

ggc tgt ccc att 576 
Gly Cys Pro lie 
190 

582 



<210> 148 
<211> 193 
<212> PRT 

<213> Homo sapiens (man) 
<r400>'14-8 

Met Ala Ala lie Arg Lys Lys Leu Val He Val Gly Asp Gly Ala Cys 

15 10 15 

Gly Lys Thr Cys Leu Leu He Val Phe Ser Lys Asp Gin Phe Pro Glu 

20 25 30 

Val Tyr Val Pro Thr Val Phe Glu Asn Tyr He Ala Asp He Glu Val 

35 40 45 

Asp Gly Lys Gin Val Glu Leu Ala Leu Trp Asp Thr Ala Gly Gin Glu 

50 55 60 

Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp Thr Asp Val He 
65 70 75 80 

Leu Met Cys Phe Ser He Asp Ser Pro Asp Ser Leu Glu Asn He Pro 

85 90 95 

Glu Lys Trp Thr Pro Glu Val Lys His Phe Cys Pro Asn Val Pro He 

100 105 110 

He Leu Val Gly Asn Lys Lys Asp Leu Arg Gin Asp Glu His Thr Arg 

115 120 125 

Arg Glu Leu Ala Lys Met Lys Gin Glu Pro Val Arg Ser Glu Glu Gly 

130 135 140 

Arg Asp Met Ala Asn Arg He Ser Ala Phe Gly Tyr Leu Glu Cys Ser 
145 150 155 160 

Ala Lys Thr Lys Glu Gly Val Arg Glu Val Phe Glu Met Ala Thr Arg 

165 170 175 

Ala Gly Leu Gin Val Arg Lys Asn Lys Arg Arg Arg Gly Cys Pro He 
180 185 190 

Leu 



<210> 149 
<211> 576 
<212> DNA 

<213> Homo sapiens (man) 

<220> 

<221> CDS 

<222> (1) . - (576) 



<400> 149 



atg 


cag 


age 


ate 


aag 


tgc 


Met 


Gin 


Ser 


He 


Lys 


Cys 


1 








5 




acg 


tgc 


ctg 


etc 


ate 


tgc 


Thr 


Cys 


Leu 


Leu 


He 


Cys 








20 






ate 


ccc 


acc 


gtg 


ttc 


gac 


He 


Pro 


Thr 


Val 


Phe 


Asp 






35 








cgc 


aca 


gtg 


aac 


ctg 


aac 


Arg 


Thr 


Val 


Asn 


Leu 


Asn 




50 










gac 


cgc 


etc 


cgt 


aca 


etc 



gtg gtg gtg ggt gat ggg 
val Val val Gly Asp Gly 
10 

tac aca act aac get ttc 
Tyr Thr Thr Asn Ala Phe 
25 

aat tac age gcg cag age 
Asn Tyr Ser Ala Gin Ser 
40 

ctg tgg gac act gcg ggc 
Leu Trp Asp Thr Ala Gly 
55 60 
tec tac cct cag acc aac 



get gtg ggc aag 48 
Ala Val Gly Lys 
15 

ccc aaa gag tac 96 
Pro Lys Glu Tyr 
30 

gca gtt gac ggg 144 

Ala Val Asp Gly 

45 

cag gag gag tat 192 
Gin Glu Glu Tyr 

gtt ttc gtc ate 240 
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Asp 


Arg 


Leu 


Arg 


Thr 


Leu Ser Tyr Pro Gin Thr 


Asn Val Phe Val lie 




65 










70 




75 


80 




tgt 


ttc 


tec 


att 


gec 


agt 


ccg 


ccg tec tat gag 


aac gtg egg cac aag 


288 


Cys 


Phe 


Ser 


lie 


Ala 


Ser 


Pro Pro Ser Tyr Glu 


Asn Val Arg His Lys 












85 






90 


95 




tgg 


cat 


cca 


gag 


gtg 


tgc 


cac 


cac tgc cct gat 


gtg ccc ate ctg ctg 


336 


Trp 


His 


Pro 


Glu 


Val 


Cys 


His 


His Cys Pro Asp 


Val Pro lie Leu Leu 










100 








105 


110 




gtg 


ggc 


acc 


aag 


aag 


gac 


ctg 


aga gee cag cct 


gac acc eta egg cgc 


384 


Val 


Gly 


Thr 


Lys 


Lys 


Asp Leu Arg Ala Gin Pro 


Asp Thr Leu Arg Arg 








115 










120 


125 




etc 


aag 


gag 


cag 


age 


cag 


gcg 


ccc ate aca ccg 


cag cag ggc cag gca 


432 


Leu 


Lys 


Glu 


Gin 


Ser 


Gin 


Ala 


Pro lie Thr Pro 


Gin Gin Gly Gin Ala 






130 










135 




140 




etc 


gcg 


aaa 


cag 


ate 


cac 


get 


gtg cgc tac etc 


gaa tgc tea gee ctg 


480 


Leu 


Ala 


Lys 


Gin 


lie 


His 


Ala 


Val Arg Tyr Leu 


Glu Cys Ser Ala Leu 




145 










150 




155 


160 




caa 


cag 


gat 


ggt 


gtc 


aag 


gaa 


gtg ttc gee" gag 


get gtc egg get gtg 


528 


Gin 


Gin 


Asp 


Gly 


Val 


Lys 


Glu 


Val Phe Ala Glu 


Ala Val Arg Ala Val 












165 






170 


175 




etc 


aac 


ccc 


acg 


ccg 


ate 


aag 


cgt ggg egg tec 


tgc ate etc ttg 


573 


Leu 


Asn 


Pro 


Thr 


Pro 


lie 


Lys 


Arg Gly Arg Ser 


Cys He Leu Leu 










180 








. 185 


190 




tga 


















576 



<210> 150 
<211> 191 
<212> PRT 

<213> Homo sapiens (man) 
<400> 150 

Met Gin Ser He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 

1 5 10 15 

Thr Cys Leu Leu lie Cys Tyr Thr Thr Asn Ala Phe Pro Lys Glu Tyr 

20 25 30 

He Pro Thr Val Phe Asp Asn Tyr Ser Ala Gin Ser Ala Val Asp Gly 

35 40 45 

Arg Thr Val Asn Leu Asn Leu Trp Asp Thr Ala Gly Gin Glu Glu Tyr 

50 . 55 60 

Asp Arg Leu Arg Thr Leu Ser Tyr Pro Gin Thr Asn Val Phe Val He 
65 70 75 80 

Cys Phe Ser He Ala Ser Pro Pro Ser Tyr Glu Asn Val Arg His Lys 

85 90 95 

Trp His Pro Glu Val Cys His His Cys Pro Asp Val Pro He Leu Leu 

100 105 " 110 

Val Gly Thr Lys Lys Asp Leu Arg Ala Gin Pro Asp Thr Leu Arg Arg 

115 120 125 

Leu Lys Glu Gin Ser Gin Ala Pro He Thr Pro Gin Gin Gly Gin Ala 

130 135 140 

Leu Ala Lys Gin He His Ala Val Arg Tyr Leu Glu Cys Ser Ala Leu 
145 150 155 160 

Gin Gin Asp Gly Val Lys Glu Val Phe Ala Glu Ala Val Arg Ala Val 

165 170 175 

Leu Asn Pro Thr Pro He Lys Arg Gly Arg Ser Cys He Leu Leu 
180 185 190 

<210> 151 

<211> 2292 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDS 

<222> (1) . . (2292) 
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144 



240 



288 



480 



528 



<400> 151 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 48 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
15 10 15 

gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

gag ggc gag ggc gat gec acc tac ggc aag ctg acc ctg aag ttc ate 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 

35 * 40 45 

tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

50 55 60 

ctg ggc tac ggc ctg cag tgc ttc gec cgc tac ccc gac cac atg aag 
Leu Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 
65 " 70 75 80 

cag cac gac ttc ttc aag tec gee atg ccc gaa ggc tac gtc cag gag 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Giu Gly Tyr Val Gin Glu 

85 90 95 

cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gee gag 336 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 

115 " * 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 

130 ~ 135 140 

aac tac aac age cac aac gtc tat ate acc gee gac aag cag aag aac 
Asn Tyr Asn Ser His Asn Val Tyr lie Thr Ala Asp Lys Gin Lys Asn 
- 145 150 155 160 

ggc ate aag gec aac ttc aag ate cgc cac aac: ate gag gac ggc ggc 
Gly lie Lys Ala Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Gly 

165 170 175 

gtg cag etc gee gac cac tac cag cag aac acc ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

ccc gtg ctg ctg ccc gac aac cac tac ctg age tac cag tec gee ctg 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 

195 200 205 

age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

210 215 220 

gtg acc gec gec etc gag aaa gag aaa gag egg cca gag att tct etc 
Val Thr Ala Ala Leu Glu Lys Glu Lys Glu Arg Pro Glu He Ser Leu 
225 230 235 240 

cct tea gat ttt gaa cac aca att cat gtc ggt ttt gat get gtc aca 
Pro Ser Asp Phe Glu His Thr He His Val Gly Phe Asp Ala Val Thr 

245 250 255 

ggg gag ttt acg gga atg cca gag cag tgg gec cgc ttg ctt cag aca 816 
Gly Glu Phe Thr Gly Met Pro Glu Gin Trp Ala Arg Leu Leu Gin Thr 

260 265 270' 

tea aat ate act aag teg gag cag aag aaa aac ccg cag get gtt ctg 864 
Ser Asn He Thr Lys Ser Glu Gin Lys Lys Asn Pro Gin Ala Val Leu 

275 280 285 

gat gtg ttg gag ttt tac aac teg aag aag aca tec aac age cag aaa 912 
Asp Val Leu Glu Phe Tyr Asn Ser Lys Lys Thr Ser Asn Ser Gin Lys 

290 295 300 

tac atg age ttt aca gat aag tea get tec gga ggt gga acc ggt ggt 
Tyr Met Ser Phe Thr Asp Lys Ser Ala Ser Gly Gly Gly Thr Gly Gly 
305 310 315 320 

gga ggt acc atg cag gee ate aag tgt gtg gtg gtg gga gac gga get 
Gly Gly Thr Met Gin Ala He Lys Cys Val Val Val Gly Asp Gly Ala 

325 330 335 

gta ggt aaa act tgc eta ctg ate agt tac aca' acc aat gca ttt cct 
Val Gly Lys Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Ala Phe Pro 

340 345 350 

gga gaa tat ate cct act gtc ttt gac aat tat tct gee aat gtt atg 1104 



672 



720 



768 



960 



1008 



1056 
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Gly Glu Tyr lie Pro Thr Val Phe Asp Asn Tyr Ser Ala Asn Val Met 

355 360 365 

gta gat gga aaa ccg gtg aat ctg ggc tta tgg gat aca get gga caa 1152 
Val Asp Gly Lys Pro Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 

370 375 " 380 

gaa gat tat gac aga tta cgc ccc eta tec tat ccg caa aca gat gtg 1200 
Glu Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val 
385 390 395 400 

ttc tta att tgc ttt tec ctt gtg agt cct gca tea ttt gaa aat gtc 1248 
Phe Leu lie Cys Phe Ser Leu Val Ser Pro Ala Ser Phe Glu Asn Val 

405 410 415 

cgt gca aag tgg tat cct gag gtg egg cac cac tgt ccc aac act ccc 1296 
Arg Ala Lys Trp Tyr Pro Glu Val Arg His His Cys Pro Asn Thr Pro 

420 425 430 

ate ate eta gtg gga act aaa ctt gat ctt agg gat gat aaa gac acg 1344 
He He Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Asp Thr 

435 440 445 

ate gag aaa ctg -aag gag aag aag ctg act ccc ate acc tat ccg cag 1392 
He Glu Lys Leu Lys Glu Lys Lys Leu Thr Pro He Thr Tyr Pro Gin 

450 455 460 

ggt eta gee atg get aag gag att ggt get gta aaa tac ctg gag tgc 1440 
Gly Leu Ala Met Ala Lys Glu lie Gly Ala Val Lys Tyr Leu Glu Cys 
465 470 475 • 480 

teg gcg etc aca cag cga ggc etc aag aca gtg ttt gac gaa gcg ate 1488 
Ser Ala Leu Thr Gin Arg Gly Leu Lys Thr Val Phe Asp Glu Ala He 

485 490 495 

cga gca gtc cgc ggc cgc atg gtg age aag ggc gag gag ctg ttc acc 1536 
Arg Ala Val Arg Gly Arg Met Val Ser Lys Gly Glu Glu Leu Phe Thr 

500 505 " * 510 

ggg gtg gtg ccc ate ctg gtc gag ctg gac ggc gac gta aac ggc Gac 1584 
Gly Val Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His 

515 520 525 

aag ttc age gtg tec ggc gag ggc gag ggc gat gee acc tac ggc aag 1632 
Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys 

530 535 540 

ctg acc ctg aag ttc ate tgc acc acc ggc aag ctg ccc gtg ccc tgg 1680 
Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp 
545 ~ 550 555 560 

ccc acc . etc gtg acc acc ctg acc tgg ggc gtg cag tgc ttc age cgc 1728 
Pro Thr Leu Val Thr Thr Leu Thr Trp Gly Val Gin Cys Phe Ser Arg 

565 570 575 

tac ccc gac cac atg aag cag cac gac ttc ttc aag tec gee atg ccc 1776 
Tyr Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro 

580 585 590 

gaa ggc tac gtc cag gag cgc acc ate ttc ttc aag gac gac ggc aac 1824 
Glu Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn 

595 600 605 

tac aag acc cgc gee gag gtg aag ttc gag ggc gac acc ctg gtg aac 1872 
Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn 

610 615 620 

cgc ate gag ctg aag ggc ate gac ttc aag gag gac ggc aac ate ctg 1920 
Arg He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu 
625 63 0 635 640 

ggg cac aag ctg gag tac aac tac ate age cac aac gtc tat ate acc 1968 
Gly His Lys Leu Glu Tyr Asn Tyr He Ser His Asn Val Tyr He Thr 

645 650 655 

gee gac aag cag aag aac ggc ate aag gee aac ttc aag ate cgc cac 2016 
Ala Asp Lys Gin Lys Asn Gly He Lys Ala Asn Phe Lys He Arg His 

660 665 670 

aac ate gag. gac ggc age gtg cag etc gee gac cac tac cag cag aac 2064 
Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn 

675 680 685 

acc ccc ate ggc gac ggc ccc gtg ctg ctg ccc gac aac cac tac ttg 2112 
Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu 

690 695 700 

age acc cag tec gee ctg age aaa gac ccc aac gag aag cgc gat cac 2160 
Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His 
705 710 715 720 
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atg gtc ctg ctg gag ttc 
Met Val Leu Leu Glu Phe 
725 

gac gag ctg ggc cgc tct 
Asp Glu Leu Gly Arg Ser 
740 

aaa aag aag tea aag aca 
Lys Lys Lys Ser Lys Thr 
755 
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gtg acc gec gec ggg ate 
Val Tiir Ala Ala Gly He 
730 

aga aag atg age aaa gat 
Arg Lys Met Ser Lys Asp 
745 

aag tgt gta att atg taa 
Lys Cys Val He Met 
760 
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act etc ggc atg 2208 
Thr Leu Gly Met 
735 

ggt aaa aag aag 2256 
Gly Lys Lys Lys 
750 

2292 



<210> 152 
<211> 763 
<212> PRT 
<213> Homo sapiens 

Met°val 5 Lr Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Xle Leu 

- 5 10 15 

Val Glu Leu Asp Gly Asp val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

50 55 
Leu Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 

65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg -Ala Glu 

100 105 HO 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

115 • 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 

130 " 135 I 40 

Asn Tyr Asn Ser His Asn val Tyr He Thr Ala Asp Lys Gin Lys Asn 
IAS 150 155 i6 ° 

Gly He Lys Ala Asn Phe Lys He Arg His Asn He Glu Asp Gly Gly 

165 I* 70 _ 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 I 85 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 

195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

210 215 220 

val Thr Ala Ala Leu Glu Lys Glu Lys Glu Arg Pro Glu He Ser Leu 
oocr 230 235 

Pro Ser Asp Phe Glu His Thr He His Val Gly Phe Asp Ala Val Thr 

245 250 255 

Gly Glu Phe Thr Gly Met Pro Glu Gin Trp Ala Arg Leu Leu Gin Thr 

260 265 270 

Ser Asn He Thr Lys Ser Glu Gin Lys Lys Asn Pro Gin Ala Val Leu 

275 280 285 

Asp Val Leu Glu Phe Tyr Asn Ser Lys Lys Thr Ser Asn Ser Gin Lys 

290 295 300 

Tyr Met Ser Phe Thr Asp Lys Ser Ala Ser Gly Gly Gly Thr Gly Gly 

Gly Gly Thr Met Gin Ala He Lys Cys Val Val Val Gly Asp Gly Ala 

325 330 335 

Val Gly Lys Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Ala Phe Pro 



Glv Glu Tyr He Pro Thr Val Phe Asp Asn Tyr Ser Ala Asn Val Met 

355 360 365 

Val Asp Gly Lys Pro Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 

370 375 380 

Glu Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val 

Phe Leu He Cys Phe Ser Leu Val Ser Pro Ala Ser Phe Glu Asn Val 
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405 








410 






415 




Arg 


Ala 


Lys 


Trp 


Tyr 


Pro Glu 


Val 


Arg 


His His 


Cys 


Pro Asn 


Thr 


Pro 








420 








425 






430 






lie 


lie 


Leu 


Val 


Gly 


Thr Lys 


Leu 


Asp 


Leu Arg 


Asp 


Asp Lys 


Asp 


Thr 






435 








440 








445 






lie 


Glu 


Lys 


Leu 


Lys 


Glu Lys 


Lys 


Leu 


Thr Pro 


He 


Thr Tyr 


pro 


Gin 




450 






455 








460 






Gly 


Leu 


Ala 


Met 


Ala 


Lys Glu 


He 


Gly 


Ala Val 


Lys 


Tyr Leu 


Glu 


Cys 


465 










470 






475 








480 


Ser 


Ala 


Leu 


Thr 


Gin Arg Gly 


Leu 


Lys 


Thr' Val 


Phe 


Asp Glu 


Ala 


He 










485 








490 






495 




Arg 


Ala 


Val 


Arg 


Gly Arg Met 


Val 


Ser 


Lys Gly 


Glu 


Glu Leu 


Phe 


Thr 








500 








505 






510 






Gly 


Val 


Val 


Pro 


He 


Leu Val 


Glu 


Leu 


Asp Gly 


Asp 


Val Asn 


Gly 


His 






515 








520 








525 






Lys 


Phe 


Ser 


Val 


Ser Gly Glu 


Gly 


Glu 


Gly Asp 


Ala 


Thr Tyr 


Gly 


Lys 




530 








535 








540 






* 


Leu 


Thx 


Leu- Lys 


Phe 


He Cys 


Thr 


Thr 


Gly Lys 


Leu 


Pro Val 


Pro 


Trp 


545 










550 






555 








560 


Pro 


Thr 


Leu 


Val 


Thr 


Thr Leu 


Thr 


Trp 


Gly Val 


Gin 


Cys Phe 


Ser 


Arg 










565 








570 






575 




Tyr 


Pro 


Asp 


His 


Met 


Lys Gin 


His 


Asp 


Phe Phe 


Lys 


Ser Ala 


Met 


Pro 








580 








585 






590 






Glu 


Gly 


Tyr 


Val 


Gin 


Glu Arg 


Thr 


He 


Phe Phe 


Lys 


Asp Asp 


Gly 


Asn 






595 








600 








605 






Tyr 


Lys 


Thr 


Arg 


Ala 


Glu val 


Lys 


Phe 


Glu Gly 


Asp 


Thr Leu 


Val 


Asn 




610 








615 








620 








Arg 


lie 


Glu 


Leu 


Lys 


Gly He 


Asp 


Phe 


Lys Glu 


Asp 


Gly Asn 


He 


Leu 


625 










630 






635 








640 


Gly 


His 


Lys 


Leu 


Glu 


Tyr Asn 


Tyr 


lie 


Ser His 


Asn 


Val > Tyr 


lie 


Thr 










645 








650 






655 




Ala 


Asp 


Lys 


Gin 


Lys Asn Gly 


He 


Lys 


Ala Asn 


Phe 


Lys He 


Arg 


His 








660 








665 






670 






Asn 


lie 


Glu 


Asp 


Gly Ser Val 


Gin 


Leu 


Ala Asp 


His 


Tyr Gin 


Gin 


Asn 






675 






680 








685 






Thr 


Pro 


He 


Gly 


Asp Gly Pro 


Val 


Leu 


Leu Pro 


Asp 


Asn His 


Tyr 


Leu 




690 








695 








700 








Ser 


Thr 


Gin 


Ser 


Ala 


Leu Ser 


Lys 


Asp 


Pro Asn 


Glu 


Lys Arg 


Asp 


His 


705 










710 






715 








720 


Met 


Val 


Leu 


Leu 


Glu 


Phe Val 


Thr 


Ala 


Ala Gly 


He 


Thr Leu 


Gly 


Met 










725 








730 






735 




Asp 


Glu 


Leu 


Gly 


Arg 


Ser Arg 


Lys 


Met 


Ser Lys 


Asp 


Gly Lys 


Lys 


Lys 








74 0 








745 






750 






Lys 


Lys 


Lys 


Ser 


Lys 


Thr Lys 


Cys 


Val 


He Met 














755 








760 















<210> 153 

<211> 2289 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1)..{2289) 

<400> 153 



atg 


gtg 


age 


aag ggc gag gag 


ctg 


ttc 


acc 


ggg 


gtg 


gtg 


ccc 


ate 


ctg 


48 


Met 


Val 


Ser 


Lys Gly Glu Glu 


Leu 


Phe 


Thr Gly 


Val 


Val 


Pro 


He 


Leu 




1 






5 






10 










15 






gtc 


gag 


ctg 


gac ggc gac gta 


aac 


ggc 


cac 


aag 


ttc 


age 


gtg 


tec 


ggc 


96 


Val 


Glu 


Leu 


Asp Gly Asp Val 
20 


Asn 


Gly 
25 


His 


Lys 


Phe 


ser 


Val 
30 


Ser 


Gly 




gag 


ggc 


gag 


ggc gat gec acc 


tac 


ggc 


aag 


ctg 


acc 


ctg aag 


ttc 


ate 


144 


Glu Gly 


Glu 


Gly Asp Ala Thr 


Tyr 


Gly 


Lys 


Leu 


Thr 


Leu 


Lys 


Phe 


lie 








35 




40 










45 










tgc 


acc 


acc 


ggc aag ctg ccc 


gtg 


ccc 


tgg 


ccc 


acc 


etc 


gtg 


acc 


acc 


192 


Cys 


Thr 
50 


Thr 


Gly Lys Leu Pro 
55 


Val 


Pro 


Trp 


Pro 


Thr 
60 


Leu 


Val 


Thr 


Thr 
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ctg ggc tac ggc ctg cag tgc ttc gcc cgc tac ccc gac cac atg aag 240 

Leu Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 

65 70 75 80 

cag cac gac ttc ttc aag tec gcc atg ccc gaa ggc tac gtc cag gag 288 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gcc gag " 6 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 " 105 HO 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 3»4 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

115 * ' 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
He Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 

130 135 I 40 

aac tac aac age cac aac gtc tat ate acc gcc gac aag cag aag aac 480 
Asn Tyr Asn Ser His Asn Val Tyr lie Thr Ala Asp Lys Gin Lys Asn 
145 ' 150 I 55 160 

ggc ate aag gcc aac ttc aag ate ego cac aac ate gag gac ggc ggc 528 
Gly He Lys Ala Asn Phe Lys He Arg His Asn He Glu Asp Gly Gly 

165 170 175 

gtg cag etc gcc gac cac tac cag cag aac acc ccc ate ggc gac ggc svb 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

ceo gtg ctg ctg ccc gac aac cac tac ctg age tac cag tec gcc ctg 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 

195 200 205 

age aaa gac ceo aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

210 * 215 220 

gtg acc gcc gee etc gag aaa gag aaa gag egg cca gag att tct etc 720 
Val Thr Ala Ala Leu Glu Lys Glu Lys Glu Arg Pro Glu He Ser Leu 
225 230 235 240 

cct tea gat ttt gaa cac aca att cat gtc ggt ttt gat get gtc aca 768 
Pro Ser Asp Phe Glu His Thr He His Val Gly Phe Asp Ala Val Thr 

245 250 255 

ggg gag ttt acg gga atg cca gag cag tgg gcc cgc ttg ctt cag aca 816 
Gly Glu Phe Thr Gly Met Pro Glu Gin Trp Ala Arg Leu Leu Gin Thr 

260 265 270 

tea aat ate act aag teg gag cag aag aaa aac ccg cag get gtt ctg 864 
Ser Asn He Thr Lys Ser Glu Gin Lys Lys Asn Pro Gin Ala Val Leu 

275 280 285 

cat crtg ttg gag ttt tac aac teg aag aag. aca tee aac age eag aaa 912 
Isp Val Leu Glu Phe Tyr Asn Ser Lys Lys Thr Ser Asn Ser Gin Lys 

290 295 300 

tae atg age ttt aca gat aag tea get tee gga ggt gga acc ggt ggt 960 
Tyr Met Ser Phe Thr Asp Lys Ser Ala Ser Gly Gly Gly Thr Gly Gly 
305 310 315 320 

gga ggt acc cag aca att aag tgt gtt gtt gtg ggc gat ggt get gtt 1008 
Glv Gly Thr Gin Thr He Lys Cys Val Val Val Gly Asp Gly Ala Val 

325 330 335 

crgt aaa aca tgt etc ctg ata tec tac aca aca aac aaa ttt cca teg 1056 
Gly Lys Thr Cys Leu Leu He ser Tyr Thr Thr Asn Lys Phe Pro Ser 

340 345 350 

gaa tat gta ccg act gtt ttt gac aac tat gca gtc aca gtt atg att 1104 
Glu Tyr Val Pro Thr Val Phe Asp Asn Tyr Ala Val Thr Val Met He 

355 360 36S 

ggt gga gaa cca tat act ctt gga ctt ttt gat act gca ggg caa gag 1152 
Gly Gly Glu Pro Tyr Thr Leu Gly Leu Phe Asp Thr Ala Gly Gin Glu 

370 375 380 

gat tat gac aga tta cga ccg ctg agt tat cca caa aca gat gta ttt 1200 
Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe 
385 " 390 395 400 

eta gtc tgt ttt tea gtg gte tct cca tct tea ttt gaa aac gtg aaa 1248 
Leu Val Cys Phe Ser Val Val Ser Pro Ser Ser Phe Glu Asn Val Lys 

405 410 415 

aaa aag tgg gtg cct gag ata act cac cac tgt cca aag act cct ttc 1296 
Glu Lys Trp val Pro Glu He Thr His His Cys Pro Lys Thr Pro Phe 
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<210> 154 
<211> 762 
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<212> PRT 

<213> Homo sapiens 

<400> 154 m _ „ 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
! 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 

35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

50 " ~ 55 60 

Leu Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 
65 " 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 HO 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 

130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Thr Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Ala Asn Phe Lys He Arg His Asn He Glu Asp Gly Gly 

165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 

195 200 " 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

210 215 220 

Val Thr Ala Ala Leu Glu Lys Glu Lys Glu Arg Pro Glu He Ser Leu 
225 230 235 240 

Pro Ser Asp Phe Glu His Thr He His Val Gly Phe Asp Ala Val Thr 

245 250 255 

Gly Glu Phe Thr Gly Met Pro Glu Gin Trp Ala .Arg Leu Leu Gin Thr 

260 265 270 

Ser Asn He Thr Lys Ser Glu Gin Lys Lys Asn Pro Gin Ala Val Leu 

275 280 285 

Asp Val Leu Glu Phe Tyr Asn Ser Lys Lys Thr Ser Asn Ser Gin Lys 

290 295 300 

Tyr Met Ser Phe Thr Asp Lys Ser Ala Ser Gly Gly Gly Thr Gly Gly 
305 310 315 320 

Glv Gly Thr Gin Thr lie Lys Cys Val Val Val Gly Asp Gly Ala Val 

325 330 335 

Glv Lys Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Lys Phe Pro Ser 

340 345 350 

Glu Tyr Val Pro Thr Val Phe Asp Asn Tyr Ala Val Thr Val Met He 

355 360 365 

Gly Gly Glu Pro Tyr Thr Leu Gly Leu Phe Asp Thr Ala Gly Gin Glu 

370 375 380 

Aso Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe 
385 390 395 400 

Leu Val Cys Phe Ser Val Val Ser Pro Ser Ser Phe Glu Asn Val Lys 

405 410 415 

Glu Lys Trp Val Pro Glu He Thr His His Cys Pro Lys Thr Pro Phe 

420 425 430 

Leu Leu Val Gly Thr Gin He Asp Leu Arg Asp Asp Pro Ser Thr He 
435 440 445 

• Glu Lys Leu Ala Lys Asn Lys Gin Lys Pro He Thr Pro Glu Thr Ala 
450 455 460 

Glu Lys Leu Ala Arg Asp Leu Lys Ala Val Lys Tyr Val Glu Cys Ser 
4 65 470 " 475 480 

Ala Leu Thr Gin Arg Gly Leu Lys Asn Val Phe Asp Glu Ala He Leu 

485 490 495 

Ala Ala Gly Gly Arg Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly 
500 ~ 505 510 
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<210> 155 
<211> 1139 
<212> DNA 

<213> Arabidopsis thaliana 

<220> 

<221> CDS 

<222> (66) . . (875) 

<400> 155 

tcgacccacg cgtccggtct agtcttttat attgtagata atatttctat ctggacataa 60 



caaca atg gca gat gca aac gaa gta gtg gaa tgg_gcc aac aat tta aag 110 
Met Ala Asp Ala Asn Glu Val Val Glu Trp Ala Asn Asn Leu Lys 
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Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro 

115 120 
ctg etc tgt ttc tec acc gtc act gec teg tea 
Leu Leu Cys Phe Ser Thr Val Tlir Ala Ser Ser 

130 135 
gaa aag tgg tac ccc gaa gtc aac cac- tac gtg 
Glu Lys Trp Tyr Pro Glu Val Asn His Tyr Val 

145 " 150 
att ttg gtc ggc acc aag etc gac ttg cgt gag 
He Leu Val Gly Thr Lys Leu Asp Leu Arg Glu 
160 165 170 

ccc tec act ggt gag tea gat ccc gtc acc ccc 
Pro Ser Thr Gly Glu Ser Asp Pro Val Thr Pro 

180 185 
atg cga aag caa ate aaa gcg ttg aag tat att 
Met Arg Lys Gin He Lys Ala Leu Lys Tyr He 
195 200 
" aca * cga aag 'aac ttg cag act ctg ttt tea gaa 
Thr Arg Lys Asn Leu Gin Thr Leu Phe Ser Glu 

210 215 
ttg get gaa cga cgc gca cac gag cct ccc aag 
Leu Ala Glu Arg Arg Ala His Glu Pro Pro Lys 

225 230 
teg acg aac ggc act aac "ggc act aag act gag 
Ser Thr Asn Gly Thr Asn Gly Thr Lys Thr Glu 
240 245 250 

gag ccc ate cga aag aag aag gag cgt cgc tgc 
Glu Pro He Arg Lys Lys Lys Glu Arg Arg Cys 

260 265 
gtgattggtc actctaacat cgactcgaga tctgeactat 
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140 
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Pro Asp 
175 
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Glu Glu 
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att gtc 
He Val 
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Ala Ser 

cgc gaa 
Arg Glu 
255 
taagagcect 



ttg ttg atg 
Leu Leu Met 



tactatttta tttgtcgttt 



ttaactttat ttaatcaaaa aaccctttca accaataagc cacgataaga gtcgtgcagt 



494 

542 

590 

638 

686 

734 

782 

830 

882 

942 

1002 



caaatttatt ttggtagccc ttggatctgc tatcaagatt ttcttgtatt atcgacttgt 
aattaatagg actgtagega tategtgett ctttatttat ttcaataaaa acaaaaaatt 



1062 



1122 



aaaaaaaaaa aaaaaaa 



1139 



<210> 156 
<211> 269 
<212> PRT 

<213> Arabidopsis thaliana 

Met°Ali 5 Asp Ala Asn Glu Val Val Glu Trp Ala Asn Asn Leu Lys Trp 

15 10 15 

Asp Lys Tyr Ala Gly Asn Lys Glu He Val Asp Gly Leu Ala Ala Leu 

20 25 30 

Arq Gin Lys He Thr Ala Leu Asp Gly Ala Glu Gly Gly Ser Gly Gly 

35 40 45 

Asn Gin Arg Val Lys Leu Val Val Val Gly Asp Gly Ala Val Gly Lys 

50 ~ 55 60 

Thr Ser Leu Leu He Ser Phe Ala Glu Asn Lys Phe Pro Glu Asp Tyr 
65 70 75 80 

Val Pro Thr Val Phe Glu Asn Tyr Thr Ser Lys He Thr Arg Asp Asp 

85 90 95 

Gly Thr Leu Val Leu Leu His Leu Trp Asp Thr Ala Gly Gin Glu Asp 

100 105 Hp 

Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gly Ala Asp Val He Leu 
115 120 125 
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ucu 


L / s 


Phe 


Ser 


Thr 


Val 


Thr 




130 










135 


JUjr o 


xxp 




Pro 


Glu 


Val 


Asn 


145 








150 




■UC u. 


V CLX 


vjxy 


Thr 


Lys 


Leu 


Asp 










165 






Ser 




Gly 


Glu 


Ser 


Asp 


Pro 








180 








Arg 


— 

jjys 




lie 


Lys 


Ala 


Leu 






-L J J 










Airy 


T.TTC! 
XJjr S 


Asn 


Leu 


Gin 


Thr 


Leu 




210 










215 


Aid 


uIU 


Arg 


Arg 


Ala 


His 


Glu 


225 










230 




Thr 


Asn 


Gly 


Thr 


Asn 


Gly 


Thr 










245 






Pro 


lie 


Arg 


Lys 


Lys 


Lys 


Glu 








260 
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Ala Ser Ser Phe Ala Ser lie Lys Glu 
140 

His Tyr Val Pro Asp Ala Pro Tyr He 
155 160 
Leu Arg Glu Ala Gly Leu Pro Asp Pro 

170 175 
Val Thr Pro Glu Lys Ala Glu Glu Met 

185 190 
Lys Tyr He Glu Val Ser Ala Lys Thr 
200 205 
Phe Ser Glu Ala Val Asp He Val Leu 
220 

Pro Pro Lys Pro Thr Ala Ala Ser Ser 
235 240 
Lys Thr Glu His Thr Pro Arg Glu Glu 

250 255 
Arg Arg Cys Leu Leu Met 
265 ~ 

<210> 157 
<211> 582 
<212> DNA 

<213> Caenorhabditis elegans 

<220> 

<221> CDS 

<222> (1) . . (582) 

<400> 157 

atg caa gca ate aaa tgt gtc gtc 'gtt ggt gac gga gec gtc ggt aag 48 

Met Gin Ala He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 

1 •. 5 10 15 

acg tgt etc ctg eta teg tac act aca aac get ttt ccc gga gaa tat 96 

Thr Cys Leu Leu Leu Ser Tyr Thr Thr Asn Ala Phe Pro Gly Glu Tyr 

20 25 30 

att eta acg gtg ttc gac acc tac tea aca aat gtg atg gtc gac gga 144 
lie Leu Thr Val Phe Asp Thr Tyr Ser Thr Asn Val Met Val Asp Gly 

35 40 45 

egg cca ata aat etc age eta tgg gac aca get gga cag gaa gat tac 192 
Arg Pro He Asn Leu Ser Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr 

50 55 60 

gat caa ttc cgc caa cat ttc cac aaa aaa ttt cca caa aca gac gta 240 
Asp Gin Phe Arg Gin His Phe His Lys Lys Phe Pro Gin Thr Asp Val 
65 70 75 80 

ttt etc gta tgc ttt gca ttg aat aat aat gtt cgt gca aaa tgg tat 288 
Phe Leu Val Cys Phe Ala Leu Asn Asn Asn Val Arg Ala Lys Trp Tyr 

85 90 95 

cca gaa gta tea cat cat tgc cca aat aca ccg att att ttg gtt gga 336 
Pro Glu Val Ser His His Cys Pro Asn Thr Pro He He Leu Val Gly 

100 105 110 

acg aaa get gat ttg cgc gag gat cga gat act gtt gaa egg etc cgc 384 
Thr Lys Ala Asp Leu Arg Glu Asp Arg Asp Thr Val Glu Arg Leu Arg 

115 120 125 

gaa age egg etc caa cca gtg age cac acc cag ggt tac gtg atg gca 432 
Glu Ser Arg Leu Gin Pro Val Ser His Thr Gin Gly Tyr Val Met Ala 

130 135 140 

aag gaa ate aag gcg gtc aag tac ctg gaa tgc teg gcg ctt acq caa 480 
Lys Glu He Lys Ala Val Lys Tyr Leu Glu Cys Ser Ala Leu Thr Gin 
145 150 155 160 

att gga ttg aaa caa gtt ttc gat gag gca att cgt act ggg etc acc 528 
He Gly Leu Lys Gin Val Phe Asp Glu Ala He Arg Thr Gly Leu Thr 

165 170 175 

ccg cca caa aca cca caa acg aga gee aaa aag age aat tgc acg gtg 576 
Pro Pro Gin Thr Pro Gin Thr Arg Ala Lys Lys Ser Asn Cys Thr Val 

180 185 " 190 

ctt taa 582 
Leu 
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<210> 158 
<211> 193 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 158 

Met Gin Ala He Lys Cys Val Val Val Gly. Asp Gly Ala Val Gly Lys 

1 5 10 15 

Thr Cys Leu Leu Leu Ser Tyr Thr Thr Asn Ala Phe Pro Gly Glu Tyr 

20 25 30 

He Leu Thr Val Phe Asp Thr Tyr Ser Thr Asn Val Met Val Asp Gly 

35 40 45 

Arg Pro He Asn Leu Ser Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr 

50 55 60 

Asp Gin Phe Arg Gin His Phe His Lys Lys Phe.. Pro. Gin. Thr Asp Val 
65 70 75 80 

Phe Leu Val Cys Phe Ala Leu Asn Asn Asn Val Arg Ala Lys Trp Tyr 

85 90 95 

Pro Glu Val Ser His His Cys Pro Asn Thr Pro He He Leu Val Gly 

100 105 HO 

Thr Lys Ala Asp Leu Arg Glu Asp Arg Asp Thr Val Glu Arg Leu Arg 

115 120 125 

Glu Ser Arg Leu Gin Pro Val Ser His Thr Gin Gly Tyr Val Met Ala 

130 135 140 

Lys Glu He Lys Ala Val Lys Tyr Leu Glu Cys Ser Ala Leu Thr Gin 
145 150 ' 155 160 

He Gly Leu Lys Gin Val Phe Asp Glu Ala He Arg Thr Gly Leu Thr 

165 ^ 170 175 

Pro Pro Gin Thr Pro Gin Thr Arg Ala Lys Lys Ser Asn Cys Thr Val 
180 185 '. 190 

Leu 



<210> 159 
<211> 576 
<212> DNA 

<213> Ciona savignyi 

<220> 

<221> CDS 

<222> (1) . . (576) . 

<400> 159 

atg caa act ata aag tgt 
Met Gin Thr He Lys Cys 
1 5 
acc tgt ttg eta ata tec 
Thr Cys Leu Leu He Ser 
20 

gtt cca acg gtc ttc gac 
Val Pro Thr Val Phe Asp 
35 

gaa cca tac aca ttg ggt 
Glu Pro Tyr Thr Leu Gly 
50 

gac agg tta agg cct etc 
Asp Arg Leu Arg Pro Leu 
65 " 70 
tgt ttt tct gtt gta tea 
Cys Phe Ser Val Val Ser 
85 

tgg gtc ccg gaa ate acc 
Trp Val Pro Glu He Thr 
100 

gtt gga acc cag gtt gat 
Val Gly Thr Gin Val Asp 



gta gtt gtt gga gat ggt 
Val Val Val Gly Asp Gly 
10 

tat acc aca aac aag ttt 
Tyr Thr Thr Asn Lys Phe 
25 

aac tat gec gtc act gtt 
Asn Tyr Ala Val Thr Val 
40 

tta ttt gat act gca ggc 
Leu Phe Asp Thr Ala Gly 
55 60 
agt tat cca caa act gat 
Ser Tyr Pro Gin Thr Asp 
75 

ccg tea tea tat gaa aat 
Pro Ser Ser Tyr Glu Asn 
90 

cat cat tgt cca aag acg 
His His Cys Pro Lys Thr 
105 

ttg cga gat gat get gca 
Leu Arg Asp Asp Ala Ala 



get gtg ggt aaa 48 
Ala Val Gly Lys 
15 

cca caa gaa tat 96 
Pro Gin Glu Tyr 
30 

atg ate gga ggg 144 

Met He Gly Gly 

45 

cag gaa gat tat 192 
Gin Glu Asp Tyr 

gtg ttt ttg gtt 240 
Val Phe Leu Val 
80 

ata aaa gaa aag 288 
He Lys Glu Lys 
95 

cca ttt tta ttg 336 
Pro Phe Leu Leu 
110 

aca att gaa aaa 384 
Thr He Glu Lys 
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115 120 125 

aaa cag aaa get ata act cca gat ttg ggt gat aaa 432 
Lys Gin Lys Ala lie Thr Pro Asp Leu Gly Asp Lys 

135 140 
tta aaa gca gta aaa tat gtc gag tgc tct gec ctt 480 
Leu Lys Ala Val Lys- Tyr Val Glu Cys Ser Ala Leu 
150 155 . * 160 

tta aag aat gta ttt gat gaa get att tta get get 528 
Leu Lys Asn Val Phe Asp Glu Ala lie Leu Ala Ala 
165 170 175 

gag cct aaa aga cgt cgc cga tgt caa att ttg 573 



ctt 


tea 


aag 


aac 


Leu 


Ser 


Lys 


Asn 




130 






ctg 


gca 


aga 


gag 


Leu 


Ala 


Arg 


Glu 


145 








act 


caa 


aaa 


gga 


Thr 


Gin 


Lys 


Gly 


tta 


gag 


cca 


ccg 


Leu 


Glu 


Pro 


Pro 








180 


tga 









185 190 



576 



<210> 160 
<211> .191 
<212> PRT 

<213> Ciona savignyi 
<400> 160 

Met Gin Thr He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 

1 5 10 15 

Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Lys Phe Pro Gin Glu Tyr 

20 25 30 

Val Pro Thr Val Phe Asp Asn Tyr Ala Val Thr Val Met lie Gly Gly 

35 • 40 45 

Glu Pro Tyr Thr Leu Gly Leu Phe Asp Thr Ala Gly Gin Glu Asp Tyr 

50 55 60 

Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu Val 
65 70 75 80 

Cys Phe Ser Val Val Ser Pro Ser Ser Tyr Glu Asn He Lys Glu Lys 

85 90 95 

Trp Val Pro Glu He Thr His His Cys Pro Lys Thr Pro Phe Leu Leu 

100 105 110 

Val Gly Thr Gin Val Asp Leu Arg Asp Asp Ala Ala Thr He Glu Lys 

115 120 125 

Leu Ser Lys Asn Lys Gin Lys Ala lie Thr Pro Asp Leu Gly Asp Lys 

130 " 135 140 

Leu Ala Arg Glu Leu Lys Ala Val Lys Tyr Val Glu Cys Ser Ala Leu 
145 150 *" 155 160 

Thr Gin Lys Gly Leu Lys Asn Val Phe Asp Glu Ala He Leu Ala Ala 

165 170 175 

Leu Glu Pro Pro Glu Pro Lys Arg Arg Arg Arg Cys Gin He Leu 
180 185 190 

<210> 161 
<211> 757 
<212> DNA 

<213> Cryptococcus neoformans 

<220> 

<221> CDS 

<222> (9) . . (602) 

<400> 161 

cacacacc atg tec gga gaa ate agg aga aag etc gtt att gtc ggc gac 50 
Met Ser Gly Glu He Arg Arg Lys Leu Val He Val Gly Asp 
1 5 10 

ggt get tgt ggt aag aca tgt; ctt ctt ate gtg ttc age aag ggc atg 98 
Gly Ala Cys Gly Lys Thr Cys Leu Leu He Val Phe Ser Lys Gly Met 
15 20 25 " " 30 

ttc ccc gag gtg tac gtg ccc ace gtc ttt gaa aac tat gtc gee gat 146 
Phe Pro Glu Val Tyr Val Pro Thr Val Phe Glu Asn Tyr Val Ala Asp 
35 40 45 
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gta gag gtg gac ggg aag aag gtc gag ctg gcg ttg tgg gat act get 194 
Val Glu Val Asp Gly Lys Lys Val Glu Leu Ala Leu Trp Asp Thr Ala 

50 55 60 

gga cag gag gac tat gac cga etc cga cct ctt tct tac ccc gac tec 242 
Gly Gin Glu Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp Ser 

65 70 75 

cat gtc att etc ate tgt ttc get att gac tec ccc gac teg ctt gac 290 
His Val He Leu He Cys Phe Ala He Asp Ser Pro Asp Ser Leu Asp 

80 85 90 

aat gtt caa gaa aag tgg att tec gag gtc ctt cac ttc tgt cag ggt 338 
Asn Val Gin Glu Lys Trp He Ser Glu Val Leu His Phe Cys Gin Gly 
95 100 105 HO 

etc ccc ate gtc etc gtg gcg tgc aag aag gat etc cgc gac gat ggc 386 
Leu Pro He Val Leu Val Ala Cys Lys Lys Asp Leu Arg Asp Asp Gly 

115 120 125 

aag act ate cag gac ctt gee agg atg aac cag agg cct gtt tec egg 434 
Lys Thr He Gin Asp Leu Ala Arg Met Asn Gin Arg Pro Val Ser Arg 

130 " ' 135 140 

gec gag ggt atg gee gtt gee caa aag att ggg gcg cag ggc tac gtc 482 
Ala Glu Gly Met Ala Val Ala Gin Lys He Gly Ala Gin Gly Tyr Val 

145 150 155 

gag tgc agt gee aag acg ggc gag ggt gtc agg gaa gtg ttc cag act 530 
Glu Cys Ser Ala Lys Thr Gly Glu Gly Val Arg Glu Val Phe Gin Thr 

160 165 170 

get ace agg cat get etc cag age aaa aag tec aag tct ggc aga ggg • 578 
Ala Thr Arg His Ala Leu Gin Ser Lys Lys Ser Lys Ser Gly Arg Gly 
175 180 185 190 

aag aag ggt tgt gtg gtg ctt tagggctcta taaatcgagt tgtactttat 629 
Lys Lys Gly Cys Val Val Leu 
195 

cttgttcgtc getggegtet ttatacctct ttctcttcac gccattctca cgccttggcc 689 
caagtgtttg gggcttattc tctcttttac ttgctgtgat ttgatgaatc gatgagtagt 749 

757 

tat gee eg 



<210> 162 
<211> 197 
<212> PRT 

<213> Crypt ococcus neoformans 
<400> 162 

Met Ser Gly Glu He Arg Arg Lys Leu Val He Val Gly Asp Gly Ala 

1 5 10 15 

Cvs Gly Lys Thr Cys Leu Leu He Val Phe Ser Lys Gly Met Phe Pro 

20 25 30 

Glu Val Tyr Val Pro Thr Val Phe Glu Asn Tyr Val Ala Asp Val Glu 

35 40 45 

Val Asp Gly Lys Lys Val Glu Leu Ala Leu Trp Asp Thr Ala Gly Gin 

50 55 60 

Glu Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp Ser His Val 
65 ^ 70 75 80 

He Leu He Cys Phe Ala He Asp Ser Pro Asp Ser Leu Asp Asn Val 

85 90 95 

Gin Glu Lys Trp He Ser Glu Val Leu His Phe Cys Gin Gly Leu Pro 

100 105 HO 

He Val Leu Val Ala Cys Lys Lys Asp Leu Arg Asp Asp Gly Lys Thr 

115 120 125 

He Gin Asp Leu Ala Arg Met Asn Gin Arg Pro Val Ser Arg Ala Glu 

130 ~ 135 140 

Gly Met Ala Val Ala Gin Lys He Gly Ala Gin Gly Tyr Val Glu Cys 
145 150 155 160 

Ser Ala Lys Thr Gly Glu Gly Val Arg Glu Val Phe Gin Thr Ala Thr 
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165 170 175 

Arg His Ala Leu Gin Ser Lys Lys Ser Lys Ser Gly Arg Gly Lys Lys 

180 185 * 190 

Gly Cys Val Val Leu 
195 

<210> 163 
<211> 626 
<212> DNA 

<213> Gallus gallus 

<220> 

<221> CDS 

<222> (27) . . (605) 

<400> 163 

tctcggcggc gcgccatcgc gggcag atg cag gcg ate aag tgt gtg gtg gtg 53 

Met Gin Ala lie Lys Cys Val Val Val 



ggc 


gac 


gga 


get 


gta ggg 


aag 


1 
acc 


tgc 


ttg 


ctg 


5 
ate 


agt 


tac acc 


acg 


101 


Gly 


Asp 


Gly Ala 


Val Gly 


Lys 


Thr 


Cys 


Leu 


Leu 


He 


Ser 


Tyr Thr 


Thr 




10 








15 










20 








25 




aat 


gec 


ttt 


cct 


gga gag 


tac 


ate 


ccc 


act 


gta 


ttt 


gat 


aac tat 


tct 


149 


Asn 


Ala 


Phe 


Pro 


Gly Glu 
30 


Tyr 


He 


Pro 


Thr 
35 


Val 


Phe 


Asp 


Asn Tyr 
40 


Ser 




gec 


aat 


gtc 


atg 


gta gat 


ggg 


aag 


cca 


gtg 


aat 


eta 


ggc 


etc tgg 


gat 


197 


Ala 


Asn 


val 


Met 


Val Asp 


Gly 


Lys 


Pro 


Val 


Asn 


Leu 


Gly 


Leu Trp Asp 










45 








50 










55 






aca 


gca 


gga 


caa 


gag gat 


tat 


gac 


cga 


ctg 


egg 


cct 


ctt 


tec tac 


cca 


245 


Thr 


Ala 


Gly Gin 


Glu Asp 


Tyr 


Asp 


Arg 


Leu 


Arg 


Pro 


Leu 


Ser Tyr 


Pro 








60 








65 










70 








cag 


aca 


gat 


gtt 


ttc ttg 


att 


tgc 


ttc 


tec 


etc 


gtg 


agt 


cca gee 


•.tec 


293 


Gin 


Thr 


Asp Val 


Phe Leu 


He 


Cys 


Phe 


Ser 


Leu 


Val 


Ser 


Pro Ala 


Ser 






75 








80 










85 










ttt 


gag 


aat 


gtc 


aga gec 


aag 


tgg 


tac 


cct 


gag 


gtc 


cga 


cac cac 


tgc 


341 


Phe 


Glu 


Asn 


val 


Arg Ala 


Lys 


Trp 


Tyr 


Pro 


Glu 


Val 


Arg 


His His 


Cys 




90 








95 










100 








105 




cca 


aat 


aca 


cct 


ate ate 


ttg 


gtg 


ggc 


acc 


aag 


ctg 


gac 


tta agg 


gat 


389 


Pro 


Asn 


Thr 


Pro 


lie lie 


Leu 


Val 


Gly 


Thr 


Lys 


Leu 


Asp 


Leu Arg Asp 












110 








115 








120 






gat 


aag 


gac 


acc 


att gaa 


agg 


tta 


cgt 


gat 


aag 


aaa 


ctg 


get ccc 


ate 


437 


Asp 


Lys 


Asp 


Thr 
125 


lie Glu 


Arg 


Leu 


Arg 
130 


Asp 


Lys 


Lys 


Leu 


Ala Pro 
135 


He 




acc 


tac 


ccc 


caa 


ggt ttg 


gec 


atg 


get 


egg 


gag 


att 


ggc 


teg gta 


aag 


485 


Thr 


Tyr 


Pro 
140 


Gin 


Gly Leu 


Ala 


Met 
145 


Ala 


Arg 


Glu 


He 


Gly 
150 


Ser Val 


Lys 




tac 


ctt 


gag 


tgc 


tct gec 


ctg 


aca 


cag 


egg 


ggc 


ttg 


aag 


acg gtg 


ttt 


533 


Tyr 


Leu 
155 


Glu 


Cys 


Ser Ala 


Leu 
160 


Thr 


Gin 


Arg 


Gly 


Leu 
165 


Lys 


Thr Val 


Phe 




gat 


gaa 


gec 


ate 


egg get 


gtg 


etc 


tgc 


cca 


ccg 


ccc 


gtg 


aag aag 


cct 


581 


Asp 


Glu 


Ala 


lie 


Arg Ala 


Val 


Leu 


Cys 


Pro 


Pro 


Pro 


Val 


Lys Lys 


Pro 




170 








175 










180 








185 




ggc 


aaa 


aag 


tgc 


acc gtg 


ttc 


tgagggctgt ggcctaggtg ctgg 




626 


Gly 


Lys 


Lys 


Cys 


Thr Val 
190 


Phe 





















<210> 164 
<211> 192 
<212> PRT 

<213> Gallus gallus 
<400> 164 

Met Gin Ala He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 

1 5 10 15 

Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Ala Phe Pro Gly Glu Tyr 

20 25 30 

He Pro Thr Val Phe Asp Asn Tyr Ser Ala Asn Val Met Val Asp Gly 
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35 










4.0 








45 


TjVS 


Pro 


Val 


Asix 


Leu 


Gly 


Leu 


TrD 


map 


Thr 


Ala Gly 


Gin Glu Asp Tyr 


50 










55 








60 




Asp 


Arcr 


Leu 


Ara 


Pro 


Leu 


Ser 


Tvr 


Pro 


Gin 


Thr Asp 


Val Phe Leu lie 


65 








70 










75 


80 


Cys 


Phe 


Ser 


Leu 


Val 


Ser 


Pro 


Ala 


Ser 


Phe 


Glu Asn 


Val Arg Ala Lys 








85 










90 




95 


Tiro 


Tvr 


Pro 


Glu 


Val 


Arg 


His 


His 


W° 


Pro 


Asn Thr 


Pro lie lie Leu 




100 
















110 


Val 


GlV 


Thr 


Lvs 


Leu 


Asp 


Leu 






Asp 


Lys Asp 


Thr lie Glu Arg 




115 










ion 








125 






Asp 


Lys 


Lys 


Leu 


Ala 


Prn 
srJL w 


Tie 


Thr 


Tyr Pro 


Gin Gly Leu Ala 




13 0 










135 








140 




Met 


Ala 


Ara 


Glu 


lie 


Gly 


Ser 


Val 


Lys 


Tyr 


Leu Glu 


Cys Ser Ala Leu 


145 








150 










155 


160 


Thr 


Gin 


Arg 


Gly 


Leu 


Lys 


Thr 


Val 


Phe 


Asp 


Glu Ala 


lie Arg Ala Val 






165 










170 




175 


Leu 


Cys 


Pro 


Pro 


Pro 


Val 


Lys 


Lys 


Pro 


Gly 


Lys Lys 


Cys Thr Val Phe 






180 










185 






190 



200 



<210> 165 
<211> 731 
<212> DNA 

<213> Gallus gallus 

<220> 

<221> CDS 

<222> (48) . . (638) 

<400> 165 

cggccatcgt cctgctgcac ggcaaggcca gcttggcgag cctggcc atg gcc gcc 56 

Met Ala Ala 
1 

ate cgc aag aag ctg gtg gtg gtg gga gac ggc gcc tgt ggc aag acc 104 
lie Arg Lys Lys Leu Val Val Val Gly Asp Gly Ala Cys Gly Lys Thr 

5 ~ 10 15 

tgc etc etc ate gtc ttc age aag gac gag ttc ccc gag gtt tac gtg 152 
Cys Leu Leu He Val Phe Ser Lys Asp Glu Phe Pro Glu Val Tyr Val 
20 25 30 35 

ccc acc gtc ttt gag aac tac gtg gcc gac ate gag gtg gac ggc aag 
Pro Thr Val Phe Glu Asn Tyr Val Ala Asp He Glu Val Asp Gly Lys 

40 45 50 

cag gtg gag ctg gcg ctg tgg gac acg gcc ggc cag gag gac tac gac 248 
Gin Val Glu Leu Ala Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr Asp 

55 60 65 

cgc ctg cgc cct etc tec tac cca gac acg gac gtg ate etc atg tgc 296 
Arg Leu Arg Pro Leu Ser Tyr Pro Asp Thr Asp Val He Leu Met Cys 

70 75 80 

ttc tea gtg gac age ccg gac teg ctg gag aac ate ccg gag aag tgg 344 
Phe Ser Val Asp Ser Pro Asp Ser Leu Glu Asn He Pro Glu Lys Trp 

85 90 95 

gtg ccc gaa gtc aag cac ttc tgc ccc aac gtc ccc ate ate ctg gtg 392 
Val Pro Glu Val Lys His Phe Cys Pro Asn Val Pro He He Leu Val 
100 105 HO 115 

gcc aac aag aaa gac ctg cgc aac gac gag cac gtg cgt aac gag ctg 440 
Ala Asn Lys Lys Asp Leu Arg Asn Asp Glu His Val Arg Asn Glu Leu 

120 125 130 

gcc cgc atg aag cag gag ccg gtg cgc act gag gat ggc cgc gcc atg 488 
Ala Arg Met Lys Gin Glu Pro Val Arg Thr Glu Asp Gly Arg Ala Met 

135 140 145 

gcc ate cgc ate cag gcc tac gac tac ctg gag tgc teg gcc aag acc 536 
Ala He Arg He Gin Ala Tyr Asp Tyr Leu Glu Cys Ser Ala Lys Thr 

150 155 160 

aag gag ggt gtg egg gag gtc ttt gag acg gcc acc egg gcg gcc ttg 584 
Lys Glu Gly Val Arg Glu Val Phe Glu Thr Ala Thr Arg Ala Ala Leu 

165 ~* 170 175 

cag aag cgc tac ggc act cag aac ggc tgc ate aat tgc tgc aag gtc 632 
Gin Lys Arg Tyr Gly Thr Gin Asn Gly Cys He Asn Cys Cys Lys Val 
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180 185 190 195 

eta tagggecegg ctggagccgg cgctgggcac ggctctgggt cacctgttgg 685 

L eu - 

caggeggaga ggagctgggg cacgcatgca cacagcatct gectgt 731 



<210> 166 
<211> 196 
<212> PRT 

<213> Gallus gallus 
<400> 166 



Met 


Ala Ala 


lie 


Arg 


Lys 


Lys 


Leu 


Val 


Val 


Val Gly 


Asp Gly Ala Cys 


1 


Lys Thr 




5 










10 




15 


Gly 


Cys 


Leu 


Leu 


He 


Val 


Phe 


Ser 


Lys Asp 


Glu Phe Pro Glu 






20 










25 






30 


Val 


Tyr Val 


Pro 


Thr 


Val 


Phe 


Glu 


Asn 


Tyr 


Val Ala 


Asp He Glu Val 




35 










40 








45 


Asp 


Gly Lys 


Gin 


Val 


Glu 


Leu 


Ala 


Leu 


Trp 


Asp Thr 


Ala Gly Gin Glu 




50 








55 








60 




Asp 


Tyr Asp 


Arg 


Leu 


Arg 


Pro 


Leu 


Ser 


Tyr 


Pro Asp 


Thr Asp Val He 


65 








70 










75 


80 


Leu 


Met Cys 


Phe 


Ser 


Val 


Asp 


Ser 


Pro 


Asp 


Ser Leu 


Glu Asn He Pro 








.85 










90 




95 


Glu 


Lys Trp 


Val 


Pro 


Glu 


Val 


Lys 


His 


Phe 


Cys Pro 


Asn Val Pro He 






100 










105 






110 


lie 


Leu Val 


Ala 


Asn 


Lys 


Lys 


Asp 


Leu 


Arg 


Asn Asp 


Glu His Val Arg 




115 










120 








125 


Asn 


Glu Leu 


Ala 


Arg 


Met 


Lys 


Gin 


Glu 


Pro 


Val Arg 


Thr Glu Asp Gly 




13 0 








135 








. 140 




Arg 


Ala Met 


Ala 


He 


Arg 


He 


Gin 


Ala 


Tyr 


Asp Tyr 


Leu Glu Cys Ser 


145 








150 










155 


160 


Ala 


Lys Thr 


Lys 


Glu 


Gly 


Val 


Arg 


Glu 


Val 


Phe Glu 


Thr Ala Thr Arg 








165 










170 




175 


Ala 


Ala Leu 


Gin 


Lys 


Arg 


Tyr 


Gly 


Thr 


Gin 


Asn Gly 


Cys He Asn Cys 






180 










185 






190 


Cys 


Lys Val 


Leu 



















195 

<210> 167 
<211> 883 
<212> DNA 

<213> Hemicentrotus pulcherrimus 

<220> 

<221> CDS 

<222> (61) . . (639) 

<400> 167 

cgggtgtaat ctgcctcttc ctggtactat tttatgaaat ttgtgtgaaa acctaagaca 60 



atg 


get 


get 


ata 


agg 


aaa 


aag 


ttg 


gtt 


ate 


gta 


gga 


gat 


ggt 


get 


tgt 


108 


Met 


Ala 


Ala 


He 


Arg 


Lys 


Lys 


Leu 


Val 


He 


Val 


Gly 


Asp 


Gly 


Ala 


Cys 




1 








5 










10 










15 






ggg 


aag 


acg 


tgt 


ctg 


etc 


ata 


gta 


ttt 


age 


aaa 


gac 


cag 


ttc 


cct 


gaa 


156 


Gly 


Lys 


Thr 


Cys 


Leu 


Leu 


He 


Val 


Phe 


Ser 


Lys 


Asp 


Gin 


Phe 


Pro 


Glu 










20 










25 








30 








gtc 


tat 


gtc 


cca 


aca 


gtt 


ttt 


gag 


aac 


tat 


gta 


get 


gat 


ata 


gaa 


gta 


204 


val 


Tyr 


Val 


Pro 


Thr 


Val 


Phe 


Glu 


Asn 


Tyr 


Val 


Ala 


Asp 


He 


Glu 


Val 








35 










40 










45 










gac 


ggt 


aaa 


cag 


gtt 


gag 


ttg 


gca 


tta 


tgg 


gat 


aca 


gca 


ggt 


cag 


gaa 


252 


Asp 


Gly 


Lys 


Gin 


Val 


Glu 


Leu 


Ala 


Leu 


Trp 


Asp 


Thr 


Ala 


Gly 


Gin 


Glu 






50 










55 










60 












gac 


tac 


gac 


aga 


ctg 


aga 


ccg 


etc 


tea 


tat 


cca 


gac 


aca 


gac 


gtt 


ata 


300 



tttcttt 



<210> 168 
<211> 192 
<212> PRT 

<213> Hemicentrotus pulcherrimus 



<400> 168 

Met Ala Ala He Arg Lys Lys Leu Val He Val Gly Asp Gly Ala Cys 

15 10 15 

Gly Lys Thr Cys Leu Leu He Val Phe Ser Lys Asp Gin Phe Pro Glu 

20 25 30 

Val Tyr Val Pro Thr Val Phe Glu Asn Tyr Val Ala Asp He Glu Val 

35 40 45 

Asp Gly Lys Gin Val Glu Leu Ala Leu Trp Asp Thr Ala Gly Gin Glu 

50 55 60 

Asd Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp Thr Asp Val He 
65 70 75 80 

Leu Met Cys Phe Ser He Asp Asn Pro Asp Ser Leu Glu Thr He Pro 

85 90 95 

Glu Lys Trp Thr Pro Glu Val Lys His Phe Cys Pro Asn Val Pro Val 

100 105 HO 

He Leu Val Gly Asn Lys Lys Asp Leu Arg Asn Asp Asp Ala Thr Lys 

115 120 125 

Arg Glu Leu Ser Lys Met Lys Gin Glu Pro Val Lys Tyr Asn Asp Ala 

130 135 140 

Gin Thr Met Ser Asp Lys He Asn Ala Tyr Lys Tyr Leu Glu Cys Ser 
145 150 155 160 

Ala Lys Ser Asn Asp Gly Val Arg Glu Val Phe Glu Thr Ala Thr Arg 
165 170 175 



348 



492 



540 
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Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp Thr Asp Val He 
65 * 70 75 80 

etc atg tgc ttt tea ate gac aac cca gac agt tta gaa acc ate cca 
Leu Met Cys Phe Ser He Asp Asn Pro Asp Ser Leu Glu Thr He Pro 

85 90 95 

gaa aaa tgg aca cca gag gtg aag cac ttt tgc cct aat gta cct gtc 396 
Glu Lys Trp Thr Pro Glu Val Lys His Phe Cys Pro Asn Val Pro Val 

100 105 HO 

ate ttg gtc ggc aac aag aaa gat ctt cga aat gac gat gee aca aaa 444 
He Leu Val Gly Asn Lys Lys Asp Leu Arg Asn Asp Asp Ala Thr Lys 

115 120 125 

egg gaa ctg agt aag atg aag cag gaa ccg gtg aaa tat aat gat gee 
Arg Glu Leu Ser Lys Met Lys Gin Glu Pro Val Lys Tyr Asn Asp Ala 

13 0 135 140 

cag acc atg tea gat aag ate aac gec tac aaa tac eta gaa tgc tea 
Gin Thr Met Ser Asp Lys He Asn Ala Tyr Lys Tyr Leu Glu Cys Ser 
145 150 155 160 

gee aag tct" aac gat ggt gtc agg gaa gtg* ttt gag aca gca acc aga 588 
Ala Lys Ser Asn Asp Gly Val Arg Glu Val Phe Glu Thr Ala Thr Arg 

165 170 175 

gca gca eta caa gtt aaa aag aag aag tea tea aaa tgc age etc ttg 636 
Ala Ala Leu Gin Val Lys Lys Lys Lys Ser Ser Lys Cys Ser Leu Leu 

180 185 190 

taagagaagt ettcatgeaa gctggcaagg catggtagtg gtcaacaaga ggtcctcaaa 696 

gaataatgea tttaatatac atgtatgtgt ttggggaaaa gagegattat gtaatatttc 756 

tggatgtgct ttgaaggtca aagacagegg tagtgecage atactttgtt taattgctgt 816 

tgtgataata agaggcaatc ggcttaattt tgtagttgca gatttacgaa accacttttc 876 



883 
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Ala Ala Leu Gin Val Lys Lys Lys Lys Ser Ser Lys Cys Ser Leu Leu 
180 185 190 

<210> 169 
<211> 958 
<212> DNA 

<213> Arabidopsis thaliana 

<220> 

<221> CDS 

<222> (146) . . (742) 

<400> 169 

gtcgacccac gcgtccggag agactttcag cagctatttc tttgtcttct ttctctataa 60 

agaacctcac tctctctctc tttagtcaaa aatccccaat tcgtgaacaa attagggttt 120 

ttgaagtttg aagaaggaga gaaag atg agt get tea agg ttt ate aag tgt 172 

Met Ser Ala Ser Arg Phe lie Lys Cys 
1 5 

gtc act gtc ggc gac ggt get gtt gga aag act tgt ctt etc ate tec 220 

Val Thr Val Gly Asp Gly Ala Val Gly Lys Thr Cys Leu Leu lie Ser 

10 *** 15 20 25 

tac act age aac act ttc ccc acg gat tat gtg cca act gtg ttc gat 268 

Tyr Thr Ser Asn Thr Phe Pro Thr Asp Tyr Val Pro Thr Val Phe Asp 

30 35 40 

aat ttc agt gec aat gtg att gtt gat ggc aac act ate aac ttg gga 316 
Asn Phe Ser Ala Asn Val lie Val Asp Gly Asn Thr lie Asn Leu Gly 

45 50 55 

ttg tgg gat act gca ggg caa gag gac tac aat aga eta aga cct ttg 364 
Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr Asn Arg Leu Arg Pro Leu 

60 65 70 

age tat cgc ggt gca gat gtc ttc tta ctt gca ttc tea ctt gtc age 412 
Ser Tyr Arg Gly Ala Asp Val Phe Leu Leu Ala Phe Ser Leu Val Ser 

75 B0 85 

aaa get age tat gaa aat gtt tct aaa aag tgg gtt cct gaa ctg aga 460 
Lys Ala Ser Tyr Glu Asn Val Ser Lys Lys Trp Val Pro Glu Leu Arg 
90 95 100 105 

cat tat get cct ggt gtt ccc ate ate etc gtt gga aca aag ctt gat 508 
His Tyr Ala Pro Gly Val Pro lie lie Leu Val Gly Thr Lys Leu Asp 

110 115 120 

ctt cga cat gat aag caa ttc ttt gee gag cac cct ggt get gtg cct 556 
Leu Arg His Asp Lys Gin Phe Phe Ala Glu His Pro Gly Ala Val Pro 

125 " 130 135 

ate tct ace get cag ggt gaa gaa eta aag aag ctg att ggg gcg cct 604 
lie Ser Thr Ala Gin Gly Glu Glu Leu Lys Lys Leu lie Gly Ala Pro 

140 145 150 

get tat ate gaa tgc agt gca aaa act caa cag aat gtg aaa gca gtg 652 
Ala Tyr lie Glu Cys Ser Ala Lys Thr Gin Gin Asn Val Lys Ala Val 

155 160 • 165 

ttt gat gcg get ate aag gtc gtt etc cag cca cca aaa aac aag aag 700 
Phe Asp Ala Ala lie Lys Val Val Leu Gin Pro Pro Lys Asn Lys Lys 
170 175 180 185 

aag aag aag aga aaa tct cag aaa ggt tgt tct ata etc tgattcaaca 749 
Lys Lys Lys Arg Lys Ser Gin Lys Gly Cys Ser He Leu 

190 195 
agaagaaagg aatcaacagg agaattagag ctataagtat ttgttttctc ctttttcata 809 

acttgtttac ccaaggaatc aaaaatctgt tgtttgtagt ctgaaaatga caaaagttga 869 

agaactagta acaagtattt gtattcttgt ttgcattttg atctgtaaat ataccacgaa 929 



tccttgattt ctctgtttcc taaaaaaaa 



958 
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<210> 170 
<211> 198 
<212> PRT 

<213> Arabidopsis thaliana 



<400> 170 



Met 


Ser Ala 


Ser Arg 


Phe 


He Lys 


fVq Val Thr* Val 


Glv AST) Glv 


Ala 


1 




c 
b 








15 




vai 


Gly Lys 


Thr Cys 


Leu 


Leu He 


Oq-t r P\r w r m TViy* Gar 

Del iyi XJJU. OCX. 


Asn Thr Phe 


Pro 




20 






ZD 


30 




Thr 


Asp Tyr 
35 


vai Fro 


Thr 


Val Phe 
40 


7v a-»-» A an DVia Cot" 
ABp £\£}XX trXLC 0 Ci. 


Ala Asn Val 

X'XJLCL XMSAJk vaj. 

AC* 


He 


Val 


Asp Gly 


Asn Thr 


lie 


Asn Leu 


\j±y Leu x rp nsp 


XXIX vj -1- 


Gin 




50 






55 


O VJ 






Glu 


Asp Tyr 


Asn Arg 


Leu Arg Pro 


Leu Ser Tyr Arg 


Gly Ala Asp 


Val 


65 




70 








80 


Phe 


Leu Leu 


Ala Phe 
85 


Ser 


Leu Val 


Ser Lys Ala Ser 
90 


Tyr Glu Asn 
95 


Val 


Ser 


Lys Lys 


Trp Val 


Pro 


Glu Leu 


Arg His Tyr Ala 


Pro Gly Val 


Pro 




100 






105 


110 




lie 


lie Leu 


Val Gly 


Thr Lys Leu 


Asp Leu Arg His 


Asp Lys Gin 


Phe 




115 




120 




125 




Phe 


Ala Glu 


His Pro 


Gly Ala Val 


Pro He Ser Thr 


Ala Gin Gly 


Glu 




130 






135 


140 






Glu 


Leu Lys 


Lys Leu 


He 


Gly Ala 


Pro Ala Tyr He 


Glu Cys Ser 


Ala 


145 


150 




155 




160 


Lys 


Thr Gin 


Gin Asn 


Val 


Lys Ala 


Val Phe Asp Ala 


Ala He Lys 


Val 




165 






170 


175 




Val 


Leu Gin 


Pro Pro 


Lys 


Asn Lys 


Lys Lys Lys Lys 


Arg Lys Ser 


Gin 






180 




185 


190 




Lys 


Gly Cys 
195 


Ser lie 


Leu 











<210> 171 
<211> 780 
<212> DNA 

<213> Anopheles gambiae str 

<220> 

<221> CDS 

<222> (208).. (780) 

<400> 171 

gcggccgcaa aagcaaatga aaagttgaac tgaaacttcc aagcgcacaa agtcgagccg 



tgaggtgttt tccttgtttt tccgtgttcg agtgtgtgtg tggtggttct gtgtttcgta 



cgtgtgcggt gcgcgtttgt gttggaggtg tttcggtttg tattttcttc gtgtttccag 



tttgtttttc ctaggtttcg ctaaaag atg tea tec gga aga cct ate aaa tgt 

Met Ser Ser Gly Arg Pro He Lys Cys 
1 5 
gtg gtg gtc ggc gac ggc acg gtg ggg aag acg tgc atg ttg ate age 
Val Val Val Gly Asp Gly Thr Val Gly Lys Thr Cys Met Leu He Ser 
10 ~ 15 20 25 

tac acg acc gac age ttt ccc ggc gaa tac gta ccc acg gtc ttc gac 
Tyr Thr Thr Asp Ser Phe Pro Gly Glu Tyr Val Pro Thr Val Phe Asp 

30 35 40 

aac tac tec gec ccg atg gtg gtg gac ggt gtg caa gtg teg etc ggg 
Asn Tyr Ser Ala Pro Met Val Val Asp Gly Val Gin Val Ser Leu Gly 
45 50 55 
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ctg tgg gat acg gcc ggg cag gaa gac tac gat egg eta agg ccc ctg 426 
Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr Asp Arg Leu Arg Pro Leu 

60 65 70 

tec tac cca cag acg gac gtg ttc etc ata tgc tac agt gtg gcc age 474 
Ser Tyr Pro Gin Thr Asp Val Phe Leu He Cys Tyr Ser Val Ala Ser 

75 80 85 

ccg teg teg ttc gaa aac gtt acc tec aaa tgg tat ccc gag ate aag 522 
Pro Ser Ser Phe Glu Asn Val Thr Ser Lys Trp Tyr Pro Glu He Lys 
90 95 100 105 

cac cac tgc ccg gat gcg ccc ate att tta' gtc gga acc aaa ate gat 570 
His His Cys Pro Asp Ala Pro He He Leu Val Gly Thr Lys He Asp 

110 115 ~ 120 

ctg cgc gag gat egg gaa acg ata age ttg ctg gcg gac cag ggc ctt 618 
Leu Arg Glu Asp Arg Glu Thr He Ser Leu Leu Ala Asp Gin Gly Leu 

125 130 135 

tec gcg ctg aag cgc gaa cag ggc caa aag eta gcg aac aag ata egg 666 
Ser Ala Leu Lys Arg Glu Gin Gly Gin Lys Leu Ala Asn Lys lie Arg 

140 145 150" 

gcg gta aag tat atg gaa tgt teg gca eta acc cag egg ggc eta aag 714 
Ala Val Lys Tyr Met Glu Cys Ser Ala Leu Thr Gin Arg Gly Leu Lys 

155 160 ■ 165 

cag gtg ttt gac gaa gcg ctt tgc gcc acg gaa gag aaa gag cga atg 762 
Gin Val Phe Asp Glu Ala Leu Cys Ala Thr Glu Glu Lys Glu Arg Met 
170 175 180 185 

ccg gta gaa gag gag tag 780 
Pro Val Glu Glu Glu 
190 

<210> 172 
<211> 190 
<212> PRT 

<213> Anopheles garribiae str 
<400> 172 

Met Ser Ser Gly Arg Pro He Lys Cys Val Val Val Gly Asp Gly Thr 

1 5 10 15 

Val Gly Lys Thr Cys Met Leu lie Ser Tyr Thr Thr Asp Ser Phe Pro 

20 25 30 

Gly Glu Tyr Val Pro Thr Val Phe Asp Asn Tyr Ser Ala Pro Met Val 

35 40 45 

Val Asp Gly Val Gin Val Ser Leu Gly Leu Trp Asp Thr Ala Gly Gin 

50 55 60 

Glu Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val 
65 70 75 80 

Phe Leu He Cys Tyr Ser Val Ala Ser Pro Ser Ser Phe Glu Asn Val 

85 90 95 

Thr Ser Lys Trp Tyr Pro Glu He Lys His His Cys Pro Asp Ala Pro 

100 105 110 

He He Leu Val Gly Thr Lys lie Asp Leu Arg Glu Asp Arg Glu Thr 

115 120 125 

He Ser Leu Leu Ala Asp Gin Gly Leu Ser Ala Leu Lys Arg Glu Gin 

130 135 140 

Gly Gin Lys Leu Ala Asn Lys He Arg Ala Val Lys Tyr Met Glu Cys 
145 150 . * 155 ~ 160 

Ser Ala Leu Thr Gin Arg Gly Leu Lys Gin Val Phe Asp Glu Ala Leu 

165 170 175 

Cys Ala Thr Glu Glu Lys Glu Arg Met Pro Val Glu Glu Glu 
180 185 190 

<210> 173 
<211> 657 
<212> DNA 

<2i3> Giardia larablia ATCC 50803 



<220> 

<221> CDS 

<222> (1) . . (657) 
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<400> 173 


atg 




age aca yy a dctu 




nix 


OCX XJJLL \JJ~y noli 


X 




c 
3 


ate 


aaa 


gec gca gcg get 


lie 


Lys 


Ala Val Vet J. Val 








ctt 


4-+- 4- 


gtc eat gcg aac 


T Ail 

Lieu 


irXie 


Val -tyr Ala -rvoii 








gtc 


ttt 


gac aac tat tct 


Val 


Ftie 


Asp Asn Tyr Ser 




50 




aat 


att 


ggt etc tgg gat 


Asn 


lie 


Gly Leu Trp Asp 


65 




70 


cga 


ccc 


ttg age tat cca 


Arg 


Pro 


Leu Ser Tyr Pro 






85 


gtc 


gtt 


tec age act teg 


Val 


Val 


Ser Ser Thr Ser 






100 


gag 


gtc 


aag gag tac tgt 


Glu 


Val 


Lys Glu Tyr Cys 






115 


aag 


tat 


gac ctg ctg tec 


Lys 


Tyr 


Asp Leu Leu Ser 




130 




aag 


aac 


caa tec ccg gtg 


Lys 


Asn 


Gin ser pro vax 


145 




150 


gag 


ata 


aag gec att aag 


Glu 


lie 


Lys Ala lie Lys 






165 


cgc 


gta 


aag gac gtg ttt 


Arg val 


Lys Asp Val Phe 






180 


atg 


ggg 


atg atg ggt agt 


Met 


Gly 


Met Met Gly Ser 






195 


gga 


tct 


gga. aag gga aag 


Gly Ser 


Gly Lys Gly Lys 




210 





gag gat aca gcg ggt gca 
Glu Asp Tnr Ala Gly Ala 
10 

gga gac ggc tea gtg ggc 
Gly Asp Gly Ser Val Gly 
25 

aac tea ttt cca gag gac 
Asn Ser Phe Pro Glu Asp 
40 

gee aac gtc gtt gtt gac 
Ala Asn Val Val Val Asp 
55 60 
act gcg ggc cag gag gat 
Thr Ala Gly Gin Glu Asp 
75 

ggg gcg cac gta ttt ttg 

Gly Ala His Val Phe Leu 
90 

ttt gca aac ate aga tec 
Phe Ala Asn He Arg Ser 
105 

cca aac gtg ccg atg ate 
Pro Asn Val Pro Met He 
120 

gac gag get tac ctg gee 
Asp Glu Ala Tyr Leu Ala 
135 140 
tct gat gaa cgt gca gag 
Ser Asp Glu Arg Ala Glu 
155 

tac att age tgc agt gee 
Tyr He Ser Cys Ser Ala 
170 

gat agt gca ate cgc gcg 
Asp Ser Ala He Arg Ala 
185 

ggg act tec aag gca gga 
Gly Thr Ser Lys Ala Gly 
200 

tgt gtt ata ttc tag 
Cys Val He Phe 
215 



agg atg ata cat 48 
Arg Met He His 
15 

aag ace tgc ctt 96 
Lys Thr Cys Leu 
30 

tac ttg cca aca 144 
Tyr Leu Pro Thr 

45 ' 

aac ttg acg ata 192 
Asn Leu Thr He 

tac gac aaa ttg 240 
Tyr Asp Lys Leu 
80 

"etc tgc ttt age 2 88 

Leu Cys Phe Ser 
95 

aaa tgg tac aca 336 
Lys Trp Tyr Thr 
110 

ctt gtc gga aca 384 

Leu Val Gly Thr 

125 

aag atg aaa gag 432 
Lys Met Lys Glu 

gaa gtt gca aaa 480 
Glu Val Ala Lys 
160 

cga tgc caa ctt 528 
Arg Cys Gin Leu 
175 

get etc aag aat 576 
Ala Leu Lys Asn 
190 

aag aaa aag gat 624 

Lys Lys Lys Asp 

205 

657 



<210> 174 
<211> 218 
<212> PRT 

<213> Giardia lamblia ATCC 50803 
<400> 174 



Met 


Thr 


Ser 


Thr 


Gly 


Asn 


Glu 


Asp 


Thr Ala Gly Ala Arg Met 


He 


His 


1 








5 










15 




He 


Lys 


Ala 


Val 


Val 


Val 


Gly 


Asp 


Gly Ser Val Gly Lys Thr 


Cys 


Leu 








20 










25 30 






Leu 


Phe 


Val 


Tyr 


Ala 


Asn 


Asn 


Ser 


Phe Pro Glu Asp Tyr Leu 


Pro 


Thr 






35 








40 


45 






Val 


Phe 


Asp 


Asn 


Tyr 


Ser 


Ala 


Asn 


Val Val Val Asp Asn Leu 


Thr 


He 




50 










55 




60 






Asn 


He 


Gly 


Leu 


Trp 


Asp 


Thr 


Ala 


Gly Gin Glu Asp Tyr Asp 


Lys 


Leu 


65 










70 






75 




80 


Arg 


Pro 


Leu 


Ser 


Tyr 


Pro 


Gly 


Ala 


His Val Phe Leu Leu Cys 


Phe 


Ser 








85 








90 


95 




Val 


Val 


Ser 


Ser 


Thr 


Ser 


Phe 


Ala 


Asn He Arg Ser Lys Trp 


Tyr 


Thr 








100 










105 110 






Glu 


Val 


Lys 


Glu 


Tyr 


Cys 


Pro 


Asn 


Val Pro Met He Leu Val 


Gly Thr 






115 










120 


125 
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Lys 


Tyr 


Asp Leu Leu Ser 




130 




Lys 


Asn 


Gin Ser Pro Val 


145 




150 


Glu 


lie 


Lys Ala lie Lys 






165 


Arg 


Val 


Lys Asp Val Phe 






180 


Met 


Gly 


Met Met Gly Ser 






195 


Gly 


Ser 


Gly Lys Gly Lys 




210 





<210> 175 
<211> 624 
<212> DNA 

<213> Neurospora crassa 

<220> 

<221> CDS 

<222> (1) ... (624) 



<400> 175 



aty 


gtg 


acg 


gga 


act 


• ate 




V cL 1 


X 1 1 1 






11C 


1 












aga 


gag 


tgc 


yua 


gtc 


gtc 


Arg 




Cys 


Val 


val 


V CLl 














etc 


ate 


age 


4- -J *- 

tat 


aca 


acg 


Leu 


lie 


OCX 


xyr 


xxxx 


xxxx 






35 








gtt 


ttc 


gac 


aac 


tat 


gee 


Val 


Phe 


Asp 


Asn 


Tyr 


Ala 




50 










acg 


etc 


ggc 


ctg 


ttc 


gat 


Thr 


Leu 


Gly 


Leu 


Phe 


Asp 


65 










70 


cgt 


ccg 


eta 


tea 


tac 


CCC 


Arg 


Pro 


Leu 


Ser 


Tyr 


Pro 










85 




gtt 


gca 


teg 


ccc 


gee 


teg 


Val 


Ala 


Ser 


Pro 


Ala 


Ser 








100 






gaa 


gtc 


aac 


cat 


cac 


tgc 


Glu 


val 


Asn 


His 


His 


Cys 






115 








cag 


aag 


gat 


ttg 


cgc 


tec 


Gin 


Lys 


Asp 


Leu 


Arg 


Ser 




130 










cgc 


aag 


cag 


teg 


atg 


ata 


Arg 


Lys 


Gin 


Ser 


Met 


lie 


145 










150 


gat 


ctt 


gac 


gee 


gtc 


aaa 


Asp 


Leu 


Asp 


Ala 


val 


Lys 










165 




ggt 


etc 


aag 


aac 


gtg 


ttt 


Gly 


Leu 


Lys 


Asn 


Val 


Phe 








180 






cct 


cag 


aaa 


aag 


aca 


agt 


Pro 


Gin 


Lys 


Lys 


Thr 


Ser 



195 



tga 
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Asp 


Glu 


Ala 


Tyr 


Leu 


Ala 


135 










140 


Ser 


Asp 


Glu 


Arg 


Ala 


Glu 










155 




Tyr 


lie 


Ser 


Cys 


Ser 


Ala 








170 






Asp 


Ser 


Ala 


lie 


Arg 


Ala 






185 








Gly 


Thr 


Ser 


Lys 


Ala 


Gly 




200 










Cys 


Val 


He 


Phe 






215 












aag 


tat 


gee 


aaa 


aca 


aac 


Lys 


Tyr 


Ala 


Lys 


Thr 


Asn 








10 






ggt 


gac 


ggt 


gee 


gtt 


gga 


Gly 


Asp 


Gly 


Ala 


Val 


Gly 






25 








aat 


aag 


ttc 


ccc 


teg 


gaa 


Asn 


Lys 


Phe 


Pro 


Ser 


Glu 




40 










gtc 


ace 


gtc 


atg 


ate 


ggt 


Val 


Thr 


Val 


Met 


He 


Gly 


55 










60 


aca 


gca 


gga 


caa 


gaa 


gat 


Thr 


Ala 


Glv 


Gin 


Glu 


Asp 










75 




cag 


ace 


gac 


gtc 


ttc 


etc 


Gin 


Thr 


Asp 


Val 


Phe 


Leu 








90 






ttc 


gaa 


aac 


gtg 


tec 


cag 


Phe 


Glu 


Asn 


Val 


Ser 


Gin 






105 








ccc 


ggc 


gtg 


ccc 


ttc 


etc 


Pro 


Gly 


Val 


Pro 


Phe 


Leu 




120 










gac 


aag 


gaa 


ctg 


agg 


gac 


Asp 


Lys 


Glu 


Leu 


Arg 


Asp 


135 










140 


gag 


ttc 


aag 


cag 


gga 


gag 


Glu 


Phe 


Lys 


Gin 


Gly 


Glu 










155 




tac 


gtc 


gag 


tgc 


agt 


gcg 


Tyr 


Val 


Glu 


Cys 


Ser 


Ala 








170 






gac 


gag 


gee 


ate 


gtt 


gcg 


Asp 


Glu 


Ala 


He 


Val 


Ala 






185 








aaa 


agg 


gac 


aag 


aag 


tgc 


Lys 


Arg 


Asp 


Lys 


Lys 


Cys 



200 
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Lys 


Met 


Lys 


Glu 




Glu 


Val 


Ala 


Lys 










160 




Arg 


Cys 


Gin 


Leu 








175 






Ala 


Leu 


Lys 


Asn 






190 








Lys 


Lys 


Lys 


Asp 




205 










cac 


ccc 


tta 


teg 


48 


His 


Pro 


Leu 


Ser 








15 






aag 


aca 


tgt 


etc 


96 


Lys 


Thr 


Cys 


Leu 






30 








tat 


gtg 


ccg 


aca 


144 


Tyr 


Val 


Pro 


Thr 




45 










gat 


gag 


ccc 


tat 


192 


Asp 


Glu 


Pro 


Tyr 




tac 


gac 


cgt 


tta 


240 


Tyr 


Asp 


Arg 


Leu 










80 




ate 


tgc 


ttc 


age 


288 


He 


Cys 


Phe 


Ser 






95 






aaa* 


tgg 


gee 


ccc 


336 


Lys 


Trp 


Ala 


Pro 






110 








ate 


gtc 


gga 


acg 


384 


He 


Val 


Gly 


Thr 




125 










aag 


ctt 


gee 


cag 


432 


Lys 


Leu 


Ala 


Gin 




aag 


ctt 


get 


cag 


480 


Lys 


Leu 


Ala 


Gin 










160 




ctg 


aca 


cag 


gaa 


528 


Leu 


Thr 


Gin 


Glu 








175 






gca 


ctg 


gag 


ccg 


576 


Ala 


Leu 


Glu 


Pro 






190 








ttg 


att 


ctg 




621 


Leu 


He 


Leu 







205 

624 



<210> 176 



WO 2005/014828 PCT/EP2004/008136 

140/291 

<211> 207 
<212> PRT 

<213> Neurospora crassa 
<400> 176 

Met Val Thr Gly Thr lie Lys Tyr Ala Lys Thr Asn His Pro Leu Ser 

1 5 10 15 

Arg Glu Cys Val Val Val Gly Asp Gly Ala Val Gly Lys Thr Cys Leu 

20 25 3 0 

Leu lie Ser Tyr Thr Thr Asn Lys Phe Pro' Ser Glu Tyr Val Pro Thr 

35 40 45 

Val Phe Asp Asn Tyr Ala Val Thr Val Met He Gly Asp Glu Pro Tyr 

50 55 60 

Thr Leu Gly Leu Phe Asp Thr Ala Gly Gin Glu Asp Tyr Asp Arg Leu 
65 70 75 80 

Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu He Cys Phe Ser 

85 90 95 

Val Ala Ser ^>ro Ala Ser Phe Glu Asn Val Ser Gin Lys Trp Ala Pro 

100 105 110 

Glu Val Asn His His Cys Pro Gly Val Pro Phe Leu He Val Gly Thr 

115 120 125 

Gin Lys Asp Leu Arg Ser Asp Lys Glu Leu Arg Asp Lys Leu Ala Gin 

130 135 140 

Arg Lys Gin Ser Met He Glu Phe Lys Gin Gly Glu Lys Leu Ala Gin 
145 150 155 160 

Asp Leu Asp Ala Val Lys Tyr Val Glu Cys Ser Ala Leu Thr Gin Glu 

165 170 175 

Gly Leu Lys Asn Val Phe Asp Glu Ala He Val Ala Ala Leu Glu Pro 

180 185 190 

Pro Gin Lys Lys Thr Ser Lys Arg Asp Lys Lys Cys Leu He Leu 
195 200 205 

<210> 177 
<211> 603 
<212> DNA 

<213> Neurospora crassa 

<220> 

<221> CDS 

<222> (1) . . (603) 

<400> 177 

atg gca tea ggc age cct cag aat gtc ate cgc agg aaa etc gtc att 48 
Met Ala Ser Gly Ser Pro Gin Asn Val He Arg Arg Lys Leu Val He 
15 10 15 

ate ggc gac ggt get tgc gga aaa acc agt eta ctg age gtc ttc act 96 
He Gly Asp Gly Ala Cys Gly Lys Thr Ser Leu Leu Ser Val Phe Thr 

20 * 25 30 

ttg ggt ttt ttc cca gec aca tat ate ccc acc gtg ttc gag aat tac 144 
Leu Gly Phe Phe Pro Ala Thr Tyr He Pro Thr Val Phe Glu Asn Tyr 

35 40 45 

gtt acc gac tgc cga gta gat ggc aag teg gtt caa eta gcg eta tgg 192 
Val Thr Asp Cys Arg Val Asp Gly Lys Ser Val Gin Leu Ala Leu Trp 

50 55 60 

gac aca gee gga caa gaa gac tat gag cga tta cga cca eta gca tac 240 
Asp Thr Ala Gly Gin Glu Asp Tyr Glu Arg Leu Arg Pro Leu Ala Tyr 
65 ' 70 75 80 

tea aag gcg cat gtc att eta ata ggg ttt tct gtc gac acg cca gat 288 
Ser Lys Ala His Val He Leu He Gly Phe Ser Val Asp Thr Pro Asp 

85 90 95 

tct ttg gac aac gtt aaa cac aag tgg gtt act gag gee aat gaa aga 336 
Ser Leu Asp Asn Val Lys His Lys Trp Val Thr Glu Ala Asn Glu Arg 

100 105 HO 

tgt ccc aat gtt ccc ate att ttg gtg ggt etc aag aag gat etc agg 384 
Cys Pro Asn Val Pro He He Leu Val Gly Leu Lys Lys Asp Leu Arg 

115 120 125 

gga gat cct gtt gee ate gaa gaa atg cga aag cga tec caa cga ttc 432 
Gly Asp Pro Val Ala He Glu Glu Met Arg Lys Arg Ser Gin Arg Phe 
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130 135 140 

gtg atg gag gac gaa ggc cag agg ata gca aag gag att ggc get cga 480 
Val Met Glu Asp Glu Gly Gin Arg lie Ala Lys Glu lie Gly Ala Arg 
145 150 155 160 

aag tat etc gaa tgt tec agt ctt acc gga gaa ggt gtt gac gac gta 528 
Lys Tyr Leu Glu Cys Ser Ser Leu Thr Gly Glu Gly Val Asp Asp Val 

165 170 175 

ttc gag gca gec acg cga gcg gcg ctg eta acg ttt gag aag aag gaa 576 
Phe Glu Ala Ala Tlir Arg Ala Ala Leu Leu Thr Phe Glu Lys Lys Glu 

180 " 185 190 

gga age ggg tgc tgt gtg att eta tga 603 
Gly Ser Gly Cys Cys Val lie Leu 
195 200 

<210> 178 
<211> 200 

<212> PRT ... 
<213> Neurospora crassa 

<400> 178 

Met Ala Ser Gly Ser Pro Gin Asn Val lie Arg Arg Lys Leu Val lie 

1 5 10 15 

lie Gly Asp Gly Ala Cys Gly Lys Thr Ser Leu Leu Ser Val Phe Thr 

20 25 30 

Leu Gly Phe Phe Pro Ala Thr Tyr He Pro Thr Val Phe Glu Asn Tyr 

35 40 45 

Val Thr Asp Cys Arg Val Asp Gly Lys Ser Val Gin Leu Ala Leu Trp 

50 55 60 

Asp Thr Ala Gly Gin Glu Asp Tyr Glu Arg > Leu Arg Pro Leu Ala Tyr 
65 70 75 80 

Ser Lys Ala His Val He Leu He Gly Phe Ser Val Asp Thr Pro Asp 

85 90 ■ 95 ~ .■ 

Ser Leu Asp Asn Val Lys His Lys Trp Val Thr Glu Ala Asn Glu Arg 

100 105 110 

Cys Pro Asn Val Pro He He Leu Val Gly Leu Lys Lys Asp Leu Arg 

115 120 125 

Gly Asp Pro Val Ala He Glu Glu Met Arg Lys Arg Ser Gin Arg Phe 

130 135 140 

Val Met Glu Asp Glu Gly Gin Arg He Ala Lys Glu He Gly Ala Arg 
145 150 . 155 160 

Lys Tyr Leu Glu Cys Ser Ser Leu Thr Gly Glu Gly Val Asp Asp Val 

165 170 175 

Phe Glu Ala Ala Thr Arg Ala Ala Leu Leu Thr Phe Glu Lys Lys Glu 

180 185 190 

Gly Ser Gly Cys Cys Val He Leu 
195 200 

<210> 179 
<211> 624 
<212> DNA 

<213> Neurospora crassa 

<220> 

<221> CDS 

<222> (1) . . (624) 

<400> 179 

atg cct tgc gga etc gga ggg teg aaa aca gtg cag cgc aag etc gtc 48 

Met Pro Cys Gly Leu Gly Gly Ser Lys Thr Val Gin Arg Lys Leu Val 

1 5 10 15 

ttg etc ggc gat ggt gee tgc gga aaa acc tea ctg etc aac gtt ttc 96 

Leu Leu Gly Asp Gly Ala Cys; Gly Lys Thr Ser Leu Leu Asn Val Phe 

20 25 30 

aca aga ggc tat ttc ccc act gta tac gag ccg aca gtc ttc gag aac 144 
Thr Arg Gly Tyr Phe Pro Thr Val Tyr Glu Pro Thr Val Phe Glu Asn 

35 40 45 

tat gtg cac gac att ttc gtc gac aat gtc cat att gaa ctg tea ttg 192 
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Tyr 


Val 


His 


Asp 


He 


Phe 




50 










tgg 


gat 


ace 


gee 


ggc 


cag 


Trp 


Asp 


Thr 


Ala 


Gly Gin 


65 










70 


tac 


gat 


gae 


ace 


gac 


tta 


Tyr 


Asp 


Asp 


Thr 


Asp 


Leu 










85 




gac 


teg 


etc 


gaa 


aac 


gtc 


Asp 


Ser 


Leu 


Glu 


Asn 


Val 






100 






aac 


tgc 


ccg 


ggc 


gtc 


aag 


Asn 


Cys 


Pro Gly 


Val 


Lys 






115 








cgc 


gag 


gcg 


acc 


gag 


gac 


Arg 


Glu 


Ala 


Thr 


Glu Asp 




130 










get 


caa 


cgt 


gag 


aag aag 


Ala 


Gin 


Arg Glu 


Lys 


Lys 


145 










150 


gtg 


get 


cgc 


egg 


ate 


aag 


val 


Ala 


Arg Arg 


He 


Lys 










165 




cga 


aac 


egg 


gga 


gtc 


aac 


Arg 


Asn 


Arg 


Gly 


Val 


Asn 








180 






ttg 


tea 


gtg 


aag 


aag 


gac 


Leu 


Ser 


val 


Lys 


Lys 


Asp 






195 









taa 
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Val Asp- Asn Val His He 
55 * 60 
gaa gaa ttt gac agg eta 
Glu Glu Phe Asp Arg Leu 
75 

ate atg ctt- tgc tac tet 
He Met Leu Cys Tyr .Ser 
90 

gaa age aaa tgg caa ggc 
Glu Ser Lys Trp' Gin Gly 
105 

att gtc ttg gtc gcg ttg 
He Val Leu Val Ala Leu 
120 

gaa gag gat ggc gga gec 
Glu Glu Asp Gly Gly Ala 
135 140 
cca atg ate age tac gac 
Pro Met He Ser Tyr Asp 
155 

gec ctt cga tat etc gaa 
Ala Leu Arg Tyr Leu Glu 
170 

gaa gec ttc acc gaa gca 
Glu Ala Phe Thr Glu Ala 
185 

egg gag gag teg aag tgc 
Arg Glu Glu Ser Lys Cys 
200 
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Glu Leu Ser Leu 

cga agt eta tea 240 
Arg Ser Leu Ser 
80 

gtc gac age aag 288 
Val Asp Ser Lys 
95 

gag att get gaa 336 
Glu He Ala Glu 
110 

aag tgc gac ctg 384 

Lys Cys Asp Leu 

125 

aac gag gac ggt 432 
Asn Glu Asp Gly 

cag ggc ctg gaa 480 
Gin Gly Leu Glu 
160 

tgc tec gee atg 528 
Cys Ser Ala Met 
175 

gcg cgt gtc get 576 
Ala Arg Val Ala 
190 

gtg gtc atg 621 

Val Val Met 

205 

624 



<210> 180 
<211> 207 
<212> PRT 

<213> Neurospora crassa 
<400> 180 

Met Pro Cys Gly Leu Gly Gly Ser Lys Thr Val Gin Arg Lys Leu Val 

1 5 " 10 15 

Leu Leu Gly Asp Gly Ala Cys Gly Lys Thr Ser Leu Leu Asn Val Phe 

20 25 30 

Thr Arg Gly Tyr Phe Pro Thr Val Tyr Glu Pro Thr Val Phe Glu Asn 

35 * 40 45 

Tyr val His Asp He Phe Val Asp Asn Val His He Glu Leu Ser Leu 

50 55 60 

Trp Asp Thr Ala Gly Gin Glu Glu Phe Asp Arg Leu Arg Ser Leu Ser 
65 70 75 80 

Tyr Asp Asp Thr Asp Leu He Met Leu Cys Tyr Ser Val Asp Ser Lys 

85 90 95 

Asp Ser Leu Glu Asn Val Glu Ser Lys Trp Gin Gly Glu He Ala Glu 

100 105 110 

Asn Cys Pro Gly Val Lys He Val Leu Val Ala Leu Lys Cys Asp Leu 

115 120 125 

Arg Glu Ala Thr Glu Asp Glu Glu Asp Gly Gly Ala Asn Glu Asp Gly 

130 135 140 

Ala Gin Arg Glu Lys Lys Pro Met He Ser Tyr Asp Gin Gly Leu Glu 
145 150 155 160 

Val Ala Arg Arg He Lys Ala Leu Arg Tyr Leu Glu Cys Ser Ala Met 

165 170 175 

Arg Asn Arg Gly Val Asn Glu Ala Phe Thr Glu Ala Ala Arg Val Ala 

180 185 190 

Leu Ser Val Lys Lys Asp Arg Glu Glu Ser Lys Cys Val Val Met 
195 200 205 



<210> 181 
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<211> 645 
<212> DNA 

<213> Gallus gallus 
<220> 

<221> CDS - 
<222> (1) . . (645) 

<400> 181 

atg ccc ggg gga ggg egg get ccg gec atg gec cat ggc age gee gcg 48 
Met Pro Gly Gly Gly Arg Ala Pro Ala Met Ala His Gly Ser Ala Ala 
1-5 10 15 

gtc atg ctg aag tgc gtg gtg gtg ggg gac ggc gec gtg ggc aag acg 96 
Val Met Leu Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys Thr 

20 25 30 

tgc ctg ctg atg age tac gec aac gac gee ttc cct gag gag tac gtg 144 
Cys Leu lieu Met Ser Tyr Ala Asn Asp Ala Phe Pro Glu Glu Tyr Val 

35 40 45 

ccc ace gtc ttc gac cac tac gca gtc age gtt acc gtg gag ggc aag 192 
Pro Thr Val Phe Asp His Tyr Ala Val Ser Val Thr Val Glu Gly Lys 

50 55 60 

cag tac ctg ctg ggg etc tac gac acc gee ggg cag gaa gac tat gac 240 
Gin Tyr Leu Leu Gly Leu Tyr Asp Thr Ala Gly Gin Glu Asp Tyr Asp 
65 70 75 80 

cgt ctg aga cct tta tct tat cct atg acc gat gtc ttc ctt ate tgc 288 
Arg Leu Arg Pro Leu Ser Tyr Pro Met Thr Asp Val Phe Leu lie Cys 

85 90 95 

ttc tea gtg gta aac cct get tea ttt caa aac gtg aag gaa gaa tgg 336 
Phe Ser Val Val Asn Pro Ala Ser Phe Gin Asn Val Lys Glu Glu Trp 

100 105 110 

gta ccg gag ttg aag gaa tat gca cct aat gtt cct ttt tta eta gta 384 
Val Pro Glu Leu Lys Glu Tyr Ala Pro. Asn Val Pro Phe Leu Leu Val 

115 120 125 

gga aca cag att gat ctt cgt gat gac ccc aaa act ctg gca aga ttg 432 
Gly Thr Gin lie Asp Leu Arg Asp Asp Pro Lys Thr Leu Ala Arg Leu 

130 135 " 140 

aat gat atg aaa gag aag cct tta tct gtg gaa caa gga cag aaa tta 480 
Asn Asp Met Lys Glu Lys Pro Leu Ser Val Glu Gin Gly Gin Lys Leu 
145 150 155 160 

gca aaa gag ata gga gee tac tgc tat gtg gag tgt tea get tta aca 52 8 

Ala Lys Glu lie Gly Ala Tyr Cys Tyr Val Glu Cys Ser Ala Leu Thr 

165 170 175 

cag aaa gga ctg aag act gtt ttt gat gaa get att ata gee att eta 576 
Gin Lys Gly Leu Lys Thr Val Phe Asp Glu Ala lie lie Ala He Leu 

180 185 190 

act cca aag aaa cac acg gtg aag aag aga ata ggt teg aga tgc ata 624 
Thr Pro Lys Lys His Thr Val Lys Lys Arg He Gly Ser Arg Cys He 

195 200 " 205 

aac tgc tgt ttg ate acg tga 645 
Asn Cys Cys Leu He Thr 
210 



<210> 182 
<211> 214 
<212> PRT 

<213> Gallus gallus 
<400> 182 

Met Pro Gly Gly Gly Arg Ala Pro 
1 5 
Val Met Leu Lys Cys Val Val Val 
20 

Cys Leu Leu Met Ser Tyr Ala Asn 

35 40 
Pro Thr Val Phe Asp His Tyr Ala 

50 55 
Gin Tyr Leu Leu Gly Leu Tyr Asp 



Ala Met Ala His Gly Ser Ala Ala 

10 15 
Gly Asp Gly Ala Val Gly Lys Thr 
25 30 
Asp Ala Phe Pro Glu Glu Tyr Val 
45 

Val Ser Val Thr Val Glu Gly Lys 
60 

Thr Ala Gly Gin Glu Asp Tyr Asp 
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65 70 75 80 

Arg Leu Arg Pro. Leu Ser Tyr Pro Met Thr Asp Val Phe Leu lie Cys 

85 * 90 95 

Phe Ser Val Val Asn Pro Ala Ser Phe Gin Asn Val Lys Glu Glu Trp 

100 105 HO 

Val Pro Glu Leu Lys Glu Tyr Ala Pro Asn Val Pro Phe Leu Leu Val 

115 120 125 

Gly Thr Gin lie Asp Leu Arg Asp Asp Pro Lys Thr Leu Ala Arg Leu 

130 135 ~ 140 

Asn Asp Met Lys Glu Lys Pro Leu Ser Val Glu Gin Gly Gin Lys Leu 
145 * 150 155 160 

Ala Lys Glu lie Gly Ala Tyr Cys Tyr Val Glu Cys Ser Ala Leu Thr 

165 170 175 

Gin Lys Gly Leu Lys Thr Val Phe Asp Glu Ala lie lie Ala He Leu 

180 185 190 

Thr Pro Lys Lys His Thr Val Lys Lys Arg He Gly Ser Arg Cys He 

195 200 205 

Asn Cys Cys Leu" lie Thr 
210 

<210> 183 
<211> 582 
<212> DNA 

<213> Brachydanio rerio 

<220> 

<221> CDS 

<222> (1) (582) 

<400> 183 

atg get gca att cga aag aaa ctt gtc ata gtc ggt gat gga gec tgt 48 
Met Ala Ala He Arg Lys Lys Leu Val He Val Gly Asp Gly Ala Cys 
15 10 15 

gga aag acc tgt ttg etc ata gtt ttc agt aaa gat cag ttt ccc gaa 96 
Gly Lys Thr Cys Leu Leu He Val Phe Ser Lys Asp Gin Phe Pro Glu 

20 25 30 

gtc tac gtg ccg aca gtc ttc gag aac tac gtt gca gat ate gag gtc 144 
Val Tyr Val Pro .Thr Val Phe Glu Asn Tyr Val Ala Asp He Glu Val 

35 40 45 

gat tea aaa cag gtt gag ctt gec tta tgg gat act get gga cag gag 192 
Asp Ser Lys Gin Val Glu Leu Ala Leu Trp Asp Thr Ala Gly Gin Glu 

50 55 60 

gac tat gat egg tta aga ccc etc tec tat cct gac aca gat gtt att 240 
Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp Thr Asp Val He 
65 70 75 80 

etc atg tgc ttc tec ate gac agt cct gat age ttg gaa aat ate cca 
Leu Met Cys Phe Ser He Asp Ser Pro Asp Ser Leu Glu Asn He Pro 

85 90 95 

gaa aaa tgg aca cca gag gtg aag cat ttc tgt ccc aat gtt ccc ate 336 
Glu Lys Trp Thr Pro Glu Val Lys His Phe Cys Pro Asn Val Pro He 

100 105 HO 

ate etc gtg ggt aac aaa aag gat etc egg aat gat gag cac aca cga 384 
He Leu Val Gly Asn Lys Lys Asp Leu Arg Asn Asp Glu His Thr Arg 

115 " 120 125 

agg gaa ctt gec aaa atg aaa cag gag ccg gta aag cca gaa gaa ggg 432 
Arg Glu Leu Ala Lys Met Lys Gin Glu Pro Val Lys Pro Glu Glu Gly 

130 " 135 140 

cga gac atg gee aac cga ate aat gec ttt ggt tat eta gag tgt teg 480 
Arg Asp Met Ala Asn Arg He Asn Ala Phe Gly Tyr Leu Glu Cys Ser 
145 150 155 160 

gec aaa aca aaa gat ggc gtg aga gaa gtc ttt gaa atg gee acc agg 528 
Ala Lys Thr Lys Asp Gly Val Arg Glu Val Phe Glu Met Ala Thr Arg 

165 170 175 

gcg gcg ctg caa gec aga aaa cgt ggt aaa aag age ggc tgc ctg ctg 576 
Ala Ala Leu Gin Ala Arg Lys Arg Gly Lys Lys Ser Gly Cys Leu Leu 
180 185 190 

tta taa 
Leu 



288 



582 
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<210> 184 
<211> 193 
<212> PRT 

<213> Brachydanio rerio 
<400> 184 

Met Ala Ala He Arg Lys Lys Leu Val He' Val Gly Asp Gly Ala Cys 

15 10 15 

Gly Lys Thr Cys Leu Leu He Val Phe Ser Lys Asp Gin Phe Pro Glu 

20 25 " 30 

Val Tyr Val Pro Thr Val Phe Glu Asn Tyr Val Ala Asp He Glu Val 

35 40 45 

Asp Ser Lys Gin Val Glu Leu Ala Leu Trp Asp Thr Ala Gly Gin Glu 

50 55 60 

Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp Thr Asp Val He 
65 70 75 80 

Leu Met Cys Phe Ser He Asp Ser Pro Asp Ser Leu Glu Asn He Pro 

85 90 95 

Glu Lys Trp Thr Pro Glu Val Lys His Phe Cys Pro Asn Val Pro He 

100 105 110 

He Leu Val Gly Asn Lys Lys Asp Leu Arg Asn Asp Glu His Thr Arg 

115 120 125 

Arg Glu Leu Ala Lys Met Lys Gin Glu Pro Val Lys Pro Glu Glu Gly 

130 135 140 

Arg Asp Met Ala Asn Arg He Asn Ala Phe Gly Tyr Leu Glu Cys Ser 
145 150 155 " 160 

Ala Lys Thr Lys Asp Gly Val Arg Glu Val Phe Glu Met Ala Thr Arg 

165 170 175 

Ala Ala Leu Gin Ala Arg Lys Arg Gly Lys Lys Ser Gly Cys Leu Leu 
180 185 190 

Leu 

<210> 185 
<211> 582 
<212> DNA 

<213> Brachydanio rerio 

<220> 

<221> CDS 

<222> (1) . . (582) 

<400> 185 

atg gcg get ate agg aaa aag ctg gtg ate gtg gga gat gga gca tgt 48 

Met Ala Ala lie Arg Lys Lys Leu Val He Val Gly Asp Gly Ala Cys 

1 5 10 15 

ggg aag ace tgt eta etc ata gtg ttc age aaa gac cag ttt cca gaa 96 

Gly Lys Thr Cys Leu Leu He Val Phe Ser Lys Asp Gin Phe Pro Glu 

20 25 30 

gtc tac gtg cct act gtg ttc gag aac tac att get gac att gaa gtc 144 
Val Tyr Val Pro Thr Val Phe Glu Asn Tyr He Ala Asp He Glu Val 

35 40 45 

gac age aaa cag gtg gag ctg gca ttg tgg gac aca gca gga cag gag 192 
Asp Ser Lys Gin Val Glu Leu Ala Leu Trp Asp. Thr Ala Gly Gin Glu 

50 55 60 

gac tat gac cgt etc aga cct ctg tct tac cca gac aca gat gtc ate 240 
Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp Thr Asp Val He 
65 70 75 80 

etc atg tgc ttc tec ata gac agt ccc gac agt tta gag aat ate cca 288 
Leu Met Cys Phe Ser He Asp: Ser Pro Asp Ser Leu Glu Asn He Pro 

85 90 95 

gaa aag tgg acg ccg gag gta aag cac ttc tgc ccc aac gtt ccc ata 336 
Glu Lys Trp Thr Pro Glu Val Lys His Phe Cys Pro Asn Val Pro He 

100 105 110 

ate ctg gtg ggc aat aag aga gat ctg cgt act gat gag aac aca egg 384 
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lie 


Leu 


Val 


Gly 


Asn 


Lys 






115 








agg 


gag 


ctg 


act 


aag 


atg 


Arg 


Glu 


Leu 


Thr 


Lys 


Met 




130 










agg 


gac 


atg 


get 


aac 


cgc 


Arg 


Asp 


Met 


Ala 


Asn 


Arg 


145 










150 


get 


aag 


act 


aag 


gat 


ggc 


Ala 


Lys 


Thr 


Lys 


Asp 


Gly 










165 




gcg 


gcg 


ctg 


cag 


gtt 


cgc 


Ala 


Ala 


Leu 


Gin 


val 


Arg 








180 






ttg 


tga 










Leu 
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Arg Asp Leu Arg Thr Asp 
120 

aag cag gag cca gtt aaa 

Lys Gin Glu Pro Val Lys 

135 140 

att age gec ttc ggc tac 

lie Ser Ala Phe Gly Tyr 
155 

gtg agg gaa gtt. ttt gaa 

val Arg Glu Val' Phe Glu 
170 

aag agg aag aag agg age 

Lys Arg Lys Lys Arg Ser 
185 
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Glu Asn Thr Arg 
125 

at a gaa gag ggc 432 
lie Glu Glu Gly 

ctg gaa tgc tea 480 
Leu Glu Cys Ser 
160 

atg gee ace aga 528 
Met Ala Thr Arg 
175 

ggg tgc tea ctg 576 
Gly Cys Ser Leu 
190 

582 



<210> 186 
<211> 193 
<212> PRT 

<213> Brachydanio rerio 



<400> 186 



Met 


Ala 


Ala 


He 


Arg 


Lys 


Lys Leu Val He Val Gly Asp 


Gly Ala Cys 


1 








5 


10 




Gly 


Lys 


Thr 


Cys 


Leu 


Leu 


He Val Phe Ser Lys Asp Gin 


Phe Pro Glu 




20 






25 


30 


Val 


Tyr 


Val 


Pro 


Thr 


Val 


Phe Glu Asn Tyr He Ala Asp 


He Glu Val 




35 








40 45 




Asp 


Ser 


Lys 


Gin 


Val 


Glu 


Leu Ala Leu Trp Asp Thr Ala '.Gly Gin Glu 


50 








55 60 


i 


Asp 


Tyr 


Asp 


Arg 


Leu 


Arg 


Pro Leu Ser Tyr Pro Asp Thr 


Asp Val He 


65 










70 


75 


80 


Leu 


Met 


Cys 


Phe 


Ser 


He 


Asp Ser Pro Asp Ser Leu Glu 


Asn He Pro 








85 




90 


95 


Glu 


Lys 


Trp 


Thr 


Pro 


Glu 


Val Lys His Phe Cys Pro Asn 


Val Pro He 




100 






105 


110 


He 


Leu 


Val 


Gly 


Asn 


Lys 


Arg Asp Leu Arg Thr Asp Glu 


Asn Thr Arg 






115 








120 125 




Arg 


Glu 


Leu 


Thr 


Lys 


Met 


Lys Gin Glu Pro Val Lys He 


Glu Glu Gly 


130 










135 140 




Arg 


Asp 


Met 


Ala 


Asn 


Arg 


He Ser Ala Phe Gly Tyr Leu 


Glu Cys Ser 


145 








150 


155 


160 


Ala 


Lys 


Thr 


Lys 


Asp 


Gly 


Val Arg Glu Val Phe Glu Met 


Ala Thr Arg 








165 




170 


175 


Ala 


Ala 


Leu 


Gin 


Val 


Arg 


Lys Arg Lys Lys Arg Ser Gly 


Cys Ser Leu 








180 






185 


190 



Leu 



<210> 187 
<211> 806 
<212> DNA 

<213> Ciona intestinalis 

<220> 

<221> CDS 

<222> (1) . . (750) 

<400> 187 

atg gaa agg ata ccc aat gac gtc aaa ggt get ttc ccc 

Met Glu Arg He Pro Asn Asp Val Lys Gly Ala Phe Pro 
15 10 

ccc aaa att aaa tgt gtt ata gtt ggg gat ggt gga tgc 

Pro Lys He Lys Cys Val He Val Gly Asp Gly Gly Cys 

20 25 



gag cag caa 48 
Glu Gin Gin 
15 

ggt aaa act 96 

Gly Lys Thr 

30 
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4- 

tea 


aug 


ctg 


gtt 


gta 


tat 


act 


gaa 


ggc 


gaa 


ttc 


cct 


gaa 


gtg 


tac 


acg 


T A A 

X44 


oer 


Met 


Leu 


vax 


val 


Tyr 


Thr 


f«l mm 


oxy 


trXU 


pne 


Pro 


CsXU 


vax 


Tyr 


Thr 








JO 








40 










45 








ccc 


acg 


gtg 


ttt 


gaa 


aat 


tac 


age 


aga 


gag 


gtg 


gaa 


tta 


aat 


cga 


aag 


192 


Pro 


Thr 


Val 


Phe 


Glu 


Asn 


Tyr 


Ser 


Arg 


Glu 


Val 


Glu 


Leu 


Asn 


Arg Lys 






50 










55 










60 












aag 


gcg 


,—4- _ 

gtg 


eta 


aca 


ttg 


tgg 


gat 


acg 


gcg 


ggt 


cag 


gaa 


gat 


tat 


gac 


240 


Lys 


Ala 


Val 


Leu 


Thr 


Leu 


Tip 


Asp 


Thr 


Ala 


Gly 


Gin 


Glu 


Asp 


Tyr Asp 




65 










70 










75 










80 




egg 


Cut 


cga 


cct 


etc 


tct 


tac 


aac 


aac 


gca 


gac 


ate 


ata 


ate 


ate 


tgc 


288 


Arg 


Leu 


Arg 


Pro 


Leu 
85 


Ser 


Tyr 


Asn 


Asn 


Ala 
90 


Asp 


He 


He 


He 


He 
95 


Cys 




tat 


gac 


gtc 


aca 


aac 


ctt 


tea 


age 


ttc 


cag 


aac 


gta 


gaa 


ate 


cga 


tgg 


336 


Tyr 


Asp 


val 


Thr 
100 


Asn 


Leu 


Ser 


Ser 


Phe 
105 


Gin 


Asn 


Val 


Glu 


He 
110 


Arg 


Trp 




teg 


cca 


gaa 


ate 


cgc 


cat 


ttt 


tgc 


ccc 


cac 


acg 


cct 


acg 


atg 


ctg 


gtt 


384 


Ser. 


.Pro 


Glu 


lie. 


Arg 


His 


Phe 


Cys 


Pro 


His 


Thr. 


Pro 


Thr 


Met. 


Leu Val 








115 










120 










125 










gcg 


tgt 


aaa 


act 


gac 


ctg 


agg 


tct 


cag 


gtg 


aac 


teg 


gat 


tea 


att 


tea 


432 


Ala 


Cys 
130 


Lys 


Thr 


Asp 


Leu 


Arg 

135 


Ser 


Gin 


Val 


Asn 


Ser 
140 


Asp 


Ser 


He 


Ser 




aac 


tct 


atg 


act 


cga 


tct 


get 


tec 


atg 


gca 


acc 


aca 


tta 


aca 


aca 


act 


480 


Asn 


Ser 


Met 


Thr 


Arg 


Ser 


Ala 


Ser 


Met 


Ala 


Thr 


Thr 


Leu 


Thr 


Thr 


Thr 




145 








150 










155 










160 




gga 


gta 


gat 


tta 


aca 


gaa 


atg 


acg 


tea 


tct 


acg 


tta 


agt 


gta 


aat 


teg 


528 


Gly 


Val 


Asp 


Leu 


Thr 
165 


Glu 


Met 


Thr 


Ser 


Ser 
170 


Thr 


Leu 


Ser 


Val 


Asn 
175 


Ser 




gta 


cgc 


gca 


gaa 


gat 


gga 


aca 


gac 


aga 


cac 


gat 


gat 


caa 


gca 


ata 


gtt 


576 


Val 


Arg 


Ala 


Glu 
180 


Asp 


Gly 


Thr 


Asp 


Arg 
185 


His 


Asp 


Asp 


Gin 


Ala 
190 


He 


Val 




aca 


aca 


gaa 


atg 


ggc 


gag 


aag 


atg 


teg 


aaa 


cac 


ate 


aac 


gtc 


aat 


cat 


624 


Thr 


Thr 


Glu 


Met 


Gly 


Glu 


Lys 


Met 


Ser 


Lys 


His 


He 


Asn 


Val 


Asn 


His 








195 






200 










205 










ttt 


gtt 


gaa 


tgt 


tct 


gee 


aag 


gat 


ggc 


aca 


aac 


att 


gaa 


caa 


gta 


ttt 


672 


rne 


vax 
210 


/-» ~\ mm 


Cys 


Ser 


ax a. 


Lys 
215 


Asp 


VS-Ly 


Thr 


Asn 


He 
220 


Glu 


Gin 


Val 


Phe 




aaa 


ctt 


gca 


ate 


gaa 


tgc 


gtc 


att 


aaa 


cgt 


gaa 


gca 


aaa 


gtc 


ggt 


cca 


720 


Lys 


Leu 


Ala 


lie 


Glu 


Cys 


Val 


He 


Lys 


Arg 


Glu 


Ala 


Lys 


Val 


Gly Pro 




225 










230 










235 










240 




aaa 


aac 


tgc 


tgc 


tgt 


tgt 


ttg 


eta 


ttg 


tgatacgctg ttacgtaaat 




767 


Lys 


Asn 


Cys 


Cys 


Cys 


Cys 


Leu 


Leu 


Leu 



















245 

aaattgactg tgtaggaaat gectaacaaa teaatcgeg 806 



<210> 188 
<211> 249 
<212> PRT 

<213> Ciona intestinalis 



<400> 188 



Met 


Glu 


Arg 


lie 


Pro 


Asn 


Asp 


Val 


Lys 


Gly 


Ala 


Phe 


Pro Glu Gin Gin 


1 








5 










10 






15 


Pro 


Lys 


He 


Lys 


Cys 


Val 


He 


Val 


Gly 


Asp 


Gly Gly Cys Gly Lys Thr 








20 










25 








30 


Ser 


Met 


Leu 


Val 


Val 


Tyr 


Thr 


Glu 


Gly 


Glu 


Phe 


Pro 


Glu Val Tyr Thr 






35 










40 










45 


Pro 


Thr 


Val 


Phe 


Glu 


Asn 


Tyr 


Ser 


Arg 


Glu 


Val 


Glu 


Leu Asn Arg Lys 




50 










55 










60 




Lys 


Ala 


Val 


Leu 


Thr 


Leu 


Trp 


Asp 


Thr 


Ala 


Gly Gin Glu Asp Tyr Asp 


65 










70 








75 




80 


Arg 


Leu 


Arg 


Pro 


Leu 


Ser 


Tyr 


Asn 


Asn 


Ala 


Asp 


He 


He lie He Cys 










85 










90 




95 


Tyr 


Asp 


Val 


Thr 


Asn 


Leu 


Ser 


Ser 


Phe 


Gin 


Asn 


Val 


Glu He Arg Trp 








100 










105 








110 


Ser 


Pro 


Glu 


He 


Arg 


His 


Phe 


Cys 


Pro 


His 


Thr 


Pro 


Thr Met Leu Val 
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115 










120 


Ala 


Cys 
130 


Lys 


Thr 


Asp 


Leu 


Arg 
135 


Ser 


Asn 


Ser 


Met 


Thr 


Arg 


Ser 


Ala 


Ser 


145 










150 






Gly 


Val 


Asp 


Leu 


Thr 


GlU 


Met 


Thr 








165 








Val 


Arg 


Ala 


Glu 
180 


Asp 


Gly 


Thr 


Asp 


Thr 


Thr 


Glu 
195 


Met 


Gly 


Glu 


Lys 


Met 
200 


Phe 


Val 


Glu Cys 


Ser 


Ala 


Lys 


Asp 




210 










215 




Lys 


Leu 


Ala 


lie 


Glu 


Cys 


Val 


He 


225 










230 






Lys 


Asn 


Cys 


Cys 


Cys 
245 


Cys 


Leu 


Leu 



<210> 189 
<211> 726 
<212> DNA 

<213> Ciona intestinalis 



125 



GXn 


vax 


7V ein 

Asn 


ser Asp ser xxe oer 










Mec 


ax a 


Txir 


inr Lieu inr xxjx xxxx 






T C C 


xo u 


Ser 


Ser 


Thr 


Leu ser vax ash osr 




x / u 




X / w 


Arg 


HIS 


Asp 


Asp Gin Ala xxe vax 


185 






190 


Ser 


Lys' 


His 


He Asn Val Asn His 








205 


Gly 


Thr 


Asn 


He Glu Gin Val Phe 








220 


Lys 


Arg 


Glu 


Ala Lys Val Gly Pro 






235 


240 


Leu 









<220> 

<221> CDS 

<222> (14) . . (598) 



<400> 189 

ttgagttcat gaa atg cag tct at a aaa ttg gtc gtg gtg ggc gat ggt 49 
Met Gin Ser He Lys Leu Val Val Val Gly Asp Gly 
1 '5 10 



gcc 


gtg 


gga 


aaa 


acg 


tgt 


eta 


ctt 


ate 


age 


tac 


acg 


gcg 


aat 


gcc 


ttt 


97 


Ala 


Val 


Gly 


Lys 


Thr 


Cys 


Leu 


Leu 


He 


Ser 


Tyr 


Thr 


Ala 


Asn 


Ala 


Phe 








15 










20 










25 










cca 


agg 


gag 


tat 


gtt 


ccg 


acc 


gtg 


ttt 


gaa 


aac 


tac 


atg 


gca 


aat 


ate 


145 


Pro 


Arg 


Glu 


Tyr 


Val 


Pro 


Thr 


Val 


Phe 


Glu 


Asn 


Tyr 


Met 


Ala 


Asn 


He 






30 








35 










40 












acg 


gtc 


aac 


aat 


cag 


caa 


att 


tgc 


tta 


agt 


ctt 


tgg 


gat 


acc 


get 


ggt 


193 


Thr 


val 


Asn 


Asn 


Gin 


Gin 


He 


Cys 


Leu 


Ser 


Leu 


Trp 


Asp 


Thr 


Ala 


Gly 




45 










50 










55 










60 




caa 


gag 


gat 


ttt 


gac 


agg 


ttg 


aga 


ccg 


ctt 


tea 


tat 


cca 


gac 


acc 


gat 


241 


Gin 


Glu 


Asp 


Phe 


Asp 


Arg 


Leu 


Arg 


Pro 


Leu 


Ser 


Tyr 


Pro 


Asp 


Thr 


Asp 












65 










70 










75 






gtc 


ttt 


gtt 


etc 


tgc 


ttc 


teg 


ata 


att 


tct 


cca 


ace 


tec 


ttt 


gaa 


aat 


289 


val 


Phe 


Val 


Leu 


Cys 


Phe 


Ser 


He 


He 


Ser 


Pro 


Thr 


Ser 


Phe 


Glu 


Asn 










80 










85 










90 








etc 


cag 


cac 


aaa 


tgg 


ttg 


ccc 


gag 


tta 


cga 


gaa 


cat 


tgt 


cct 


aat 


gtg 


337 


Leu 


Gin 


His 


Lys 


Trp 


Leu 


Pro 


Glu 


Leu 


Arg 


Glu 


His 


Cys 


Pro 


Asn 


val 








95 










100 










105 










ccc 


ate 


ctg 


eta 


gtg 


ggt 


acg 


aag 


ctt 


gac 


ttg 


aga 


gaa 


gac 


acc 


gaa 


385 


Pro 


He 


Leu 


Leu 


Val 


Gly 


Thr 


Lys 


Leu 


Asp 


Leu 


Arg 


Glu 


Asp 


Thr 


Glu 






110 










115 










120 












att 


eta 


caa 


cag 


ctt 


tec 


teg 


aaa 


aat 


eta 


aaa 


cca 


ata 


aca 


cca 


gaa 


433 


He 


Leu 


Gin 


Gin 


Leu 


Ser 


Ser 


Lys 


Asn 


Leu 


Lys 


Pro 


He 


Thr 


Pro 


Glu 




125 










130 










135 










140 




gaa 


ggg 


gcg 


aaa 


atg 


gcg 


aaa 


gat 


att 


aaa 


gca 


gtt 


aaa 


tat 


tta 


gag 


481 


Glu 


Gly 


Ala 


Lys 


Met 


Ala 


Lys 


Asp 


He 


Lys 


Ala 


Val 


Lys 


Tyr 


Leu 


Glu 












145 










150 










155 






tgt 


tct 


get 


eta 


act 


cag 


gag 


tgt 


etc 


age 


caa 


gta 


ttt 


gat 


gac 


get 


529 


Cys 


Ser 


Ala 


Leu 


Thr 


Gin 


Glu 


Cys 


Leu 


Ser 


Gin 


Val 


Phe 


Asp 


Asp 


Ala 








160 










165 










170 








gtt 


att 


gca 


gtg 


eta 


aac 


cca 


tea 


cac 


ttt 


tea 


agt 


aac 


aat 


gac 


aac 


577 


Val 


He 


Ala 


val 


Leu 


Asn 


Pro 


Ser 


His 


Phe 


Ser 


Ser 


Asn 


Asn 


Asp 


Asn 





175 180 185 



age tgt tgt aag ate gtt taaatttaca tctcgttgtt tacttacagt 625 
Ser Cys Cys Lys He Val 
190 

gttatgtgta ttaacattac cgtgatacac acacgagttc atagggttaa aaacgagagt 685 
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tatattttct ttaaaattag ataaacactc attcccgtca a 726 



<210> 190 
<211> 194 
<212> PRT 

<213> Ciona intestinalis 
<400> 190 



Met 


Gin 


Ser He 


Lys 


Leu 


Val 


Val 


val 


Gly 


Asp Gly Ala 


Val Gly Lys 


1 






5 










10 




15 


Thr 


Cys 


Leu Leu 


He 


Ser 


Tyr 


Thr 


Ala 


Asn 


Ala Phe Pro 


Arg Glu Tyr 


Val 




20 


Phe 








25 






30 


Pro 


Thr Val 


Giu 


Asn 


Tyr 


Met 


Ala 


Asn He Thr 


Val A.qti Asn 






35 








40 






45 




Gin 


Gin 


He Cys 


Leu 


Ser 


Leu 


Trp Asp 


Thr 


Ala Gly Gin 


Glu Asp Phe 




50 








55 








60 


Asp 


Arg 


Leu Arg 


Pro 


Leu 


Ser 


Tyr 


Pro 


Asp 


Thr Asp Val 


Phe Val Leu 
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70 










75 


80 


Cys 


Phe 


Ser He 


He 


Ser 


Pro 


Thr 


Ser 


Phe 


Glu Asn Leu 


Gin His Lys 








85 










90 




95 


Trp 


Leu 


Pro Glu 


Leu 


Arg 


Glu 


His 


Cys 


Pro 


Asn Val Pro 


He Leu Leu 






100 










105 






110 m 


Val 


Gly 


Thr Lys 


Leu 


Asp 


Leu 


Arg 


Glu 


Asp 


Thr Glu He 


Leu Gin Glii 






115 








120 






125 




Leu 


Ser 


Ser Lys 


Asn 


Leu 


Lys 


Pro 


He 


Thr 


Pro Glu Glu 


Gly Ala Lys 




130 








135 








140 


Met 


Ala 


Lys Asp 


He 


Lys 


Ala 


Val 


Lys 


Tyr 


Leu Glu Cys 


Ser Ala Leu 


145 








150 










155 


160 


Thr 


Gin 


Glu Cys 


Leu 


Ser 


Gin 


Val 


Phe 


Asp 


Asp Ala Val 


He Ala Val 








165 










170 


175 


Leu 


Asn 


Pro Ser 


His 


Phe 


Ser 


Ser 


Asn 


Asn 


Asp Asn Ser 


Cys Cys Lys 






180 










185 






190 


lie 


Val 





















<210> 191 
<211> 827 
<212> DNA 

<213> Ciona intestinalis 

<220> 

<221> CDS 

<222> (34) . . (615) 

<400> 191 

atttgtaagg acgagaacca acacaacaaa ata atg cag tec gtg aaa tgt gtt 54 

Met Gin Ser Val Lys Cys Val 



gtg 


gtt 


gga 


gat 


gga 


get 


gtt 


ggt 


aaa 


1 
acc 


tgt 


ttg 


ctg 


5 
ate 


age 


tac 
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Val 


val 


Gly 
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Val 


Gly 


Lys 


Thr 


Cys 


Leu 


Leu 


He 


Ser 


Tyr 
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acc 


aat 


get 


ttc 


cct 


gga 


gaa 


tat 


att 


ccc 


act 


gtg 


ttt 


gaa 


age 


150 


Thr 


Thr 


Asn 


Ala 


Phe 


Pro 


Gly 


Glu 


Tyr 


He 


Pro 


Thr 


Val 


Phe 


Glu 


Ser 






25 










30 










35 












tat 


get 


gca 


aat 


gtt 


gtg 


gtg 


gat 


ggg 


aac 


cca 


gtg 


aac 


ate 


ggt 


tta 


198 


Tyr 


Ala 


Ala 


Asn 


Val 


Val 


Val 


Asp 


Gly 


Asn 


Pro 


val 


Asn 


He 


Gly 


Leu 




40 










45 










50 








55 




tgg 


gac 


aca 


get 


ggc 


cag 


gaa 


gat 


tac 


gat 


aaa 


ctt 


cga 


ccg 


eta 


tea 


246 


Trp 


Asp 


Thr 


Ala 


Gly 


Gin 


Glu 


Asp 


Tyr 


Asp 


Lys 


Leu 


Arg 


Pro 


Leu 


Ser 












60 
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70 






tac 


cca 


cag 


tea 


gat 


gtc 


ttt 


gtc 


atg 


tgc 


ttc 


teg 


ttg 


gta 


aac 


cca 


294 


Tyr 


Pro 


Gin 


Ser 


Asp 


Val 


Phe 


Val 


Met 


Cys 


Phe 


Ser 


Leu 


Val 


Asn 


Pro 
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cLCcl 


LCt 




gaa 


aac 


att 


gca 


gaa 


aaa 




tat 


ccg 


y CLCL 


gtt 


eat 


gaa 


342 




ber 


Tyr 

an 


UlU 


noil 


Ale 


rU-o. 


v?XU 
Q C 


Lys 


Trp 


ryx 


riu 


JL\J\J 


VCL-L 


His 


Glu 




cac 


tgt 


ccc 


rra f» 


at*ri 




all 








rrrtrt 

yyy 


a r^a 


aaa 

CLCLCL 




gat 


tta 


390 


His 


Cys 

105 


Pro 


Asp 


He 


Pro 


He 
Tin 


val 


Leu 


Val 


Gly 


Thr 

JLJ.O 


Lys 


Leu 


Asp 


Leu 




Cyt 


gaa 


gac 


cca 


gaa 


aca 


etc 


aaa 


acg 


tug 


ggt 


gag 


a rra 
cxy d 


aa t- 
clgLU 


Lit 






Arg 


Glu 


Asp 


Pro 


Glu 


Thr 


Leu 


Lys 


Thr 


Leu 


Gly 


Glu 


Arg 


Asn 


Leu 


Val 




120 










l^sb 










xiU 














cca 


get 


acc 


aaa 


acc 


cag 




etc 


cag 


ut.g 


gcg 


aaa 


aaa 




srsjg" 


r*Yf"<a 

yea 


ft OD 


Pro 


val 


Thr 


Lys 


Thr 

1 A f\ 

140 




vjiy 


Leu 


Gm 


Leu 

145 


Ala 


Lys 


Lys 


vai 


vaiy 
loU 


iud 




aag 


aaa 


tat 


tut 


gag 


tgc 


teg 


gcg 


CtL 


act 


cgt 


gaa 


aac 


tta 


gac 


gaa 




Lys 


Lys 


Tyr 


Phe 
155 


Glu 


Cys 


Ser 


Ala 


Leu 
160 


Thr 


Arg 


GlU 


Asn 


Leu 
165 


Asp 


GlU 




etc 


ttt 


ctt 


gaa 


gca 


atg 


cgc 


aat 


gee 


ctg 


aaa 


gga 


ccg 


gac 


aaa 


att 


582 


Leu 


Phe 


Leu 


Glu 


Ala 


Met 


Arg 


Asn 


Ala 


Leu 




Gly 


Pro 


Asp 


Lys 


He 








170 










175 










180 










gta 


aaa 


gag 


aaa 


cca 


aat 


tgt 


eaa 


att 


ate 


taatcagcat ttaagaattt 


632 


Val 


Lys 
185 


Glu 


Lys 


Pro 


Asn 


Cys 
190 


Gin 


He 


He 
















tgagttaacg ctcgaataat tgttcagtat acaactgtat 


aecaaagtce ataaaaataa 


692 



aaattaaaac tatagacttt gctgtacaca ttatacatgt cgttaccgca tgtggtgtag 752 



cttaaacgat tgtgcaatac ctttatatta aaggttaatg ttttacagtc gtgagtaact 812 
ggaactctct gcgta 827 



<210> 192 
<211> 193 
<212> PRT 

<213> Ciona intestinalis 



<400> 192 
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Gin 


Ser 


Val 


Lys 


Cys 


Val 


Val 


Val 


Gly 


Asp 


Gly 


Ala 


Val 


Gly 


Lys 
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Thr 


Cys 


Leu 


Leu 
20 


He 


Ser 


Tyr 


Thr 


Thr 
25 


Asn 


Ala 


Phe 


Pro 


Gly 
30 


Glu 


Tyr 


He 


Pro 


Thr 

35 


Val 


Phe 


Glu 


Ser 


Tyr 
40 


Ala 


Ala 


Asn 


Val 


Val 
45 


Val 


Asp 


Gly 


Asn 


Pro 
50 


Val 


Asn 


lie 


Gly 


Leu 
55 


Trp 


Asp 


Thr 


Ala 


Gly 
60 


Gin 


Glu 


Asp 


Tyr 


Asp 


Lys 


Leu 


Arg 


Pro 


Leu 


Ser 


Tyr 


Pro 


Gin 


Ser 


Asp 


Val 


Phe 


Val 


Met 


65 










70 










75 










80 


Cys 


Phe 


Ser 


Leu 


Val 


Asn 


Pro 


Thr 


Ser 


Tyr 


Glu 


Asn 


He 


Ala 


Glu 


Lys 








85 










90 










95 




Trp 


Tyr 


Pro 


Glu 


Val 


His 


Glu 


His 


Cys 


Pro 


Asp 


He 


Pro 


He 


Val 


Leu 




100 










105 










110 






Val 


Gly 


Thr 


Lys 


Leu 


Asp 


Leu 


Arg 


Glu 


Asp 


Pro 


Glu 


Thr 


Leu 


Lys 


Thr 




115 










120 










125 








Leu 


Gly 


Glu 


Arg 


Asn 


Leu 


Val 


Pro 


Val 


Thr 


Lys 


Thr 


Gin 


Gly 


Leu 


Gin 




13 0 








135 










140 










Leu 


Ala 


Lys 


Lys 


Val 


Gly 


Ala 


Lys 


Lys 


Tyr 


Phe 


Glu 


Cys 


Ser 


Ala 


Leu 


145 










150 










155 










160 


Thr 


Arg 


Glu 


Asn 


Leu 


Asp 


Glu 


Leu 


Phe 


Leu 


Glu 


Ala 


Met 


Arg 


Asn 


Ala 








165 








170 










175 




Leu 


Lys 


Gly 


Pro 
180 


Asp 


Lys 


He 


Val 


Lys 
185 


Glu 


Lys 


Pro 


Asn 


Cys 
190 


Gin 


He 



He 
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<210> 193 
<211> 718 
<212> DNA 

<213> Ciona intestinalis 

' <220> 
<221> CDS 
<222> (6) . . (584) 

<400> 193 

aagat atg caa gcg ata aag tgc gtt gtg gtt gga gat gga get gtt ggt 50 
Met Gin Ala lie Lys Cys Val Val Val Gly Asp Gly Ala Val Gly 
1 "5 10 is 

aaa acc tgt ttg ctg ate age tac aca act aac get ttc cct gga gaa 98 
Lys Thr Cys Leu Leu lie Ser Tyr Thr Thr Asn Ala Phe Pro Gly Glu 

20 25 30 

tat att ccc act gtg ttt gat aac tac tct gec aat gtc atg gta gat 146 
Tyr lie Pro Thr" Val Phe Asp" Asn Tyr Ser Ala" Asn Val Met Val Asp 

35 40 45 

ggc cgc cct gtc aac ttg gga tta tgg gat aca gca gga cag gag gat 194 
Gly Arg Pro Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp 

50 55 60 

tat gac aga etc cga cct etc tec tac cca caa acc gat gtt ttt etc 242 
Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu 

65 70 75 

att tgt ttc tct gtg get tct ccc get tec tac gaa aac gtg cgc gca 290 
lie Cys Phe Ser Val Ala Ser Pro Ala Ser Tyr Glu Asn Val Arg Ala 
80 85 90 95 

aag tgg cap ccg gag gtc gca cac cac tgc ccg gaa acg ccc gta ctt 338 
Lys Trp His Pro Glu Val Ala His His Cys Pro Glu Thr Pro Val Leu 

100 105 110 

etc gtg gga aca aaa ctt gat tta cgt gac gat gcg gac act gtg aac 386 
Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Ala- Asp Thr Val Asn 

115 120 125 

aag eta get gag aaa aag etc tec acc att act act acc caa ggt tta 434 
Lys Leu Ala Glu Lys Lys Leu Ser Thr lie Thr Thr Thr Gin Gly Leu 

130 135 140 

caa atg gcg aag gaa ctg ggg gcg gtt aaa tac caa gag tgc tct get 482 
Gin Met Ala Lys Glu Leu Gly Ala Val Lys Tyr Gin Glu Cys Ser Ala 

145 150 155 

ctg acg caa aag ggg ctg aaa aat gtt ttc gac gaa gcg att egg gcg 530 
Leu Thr Gin Lys Gly Leu Lys Asn Val Phe Asp Glu Ala He Arg Ala 
160 165 170 175 

gtt ctt aac cca aca aga aga gtc gtc cga acg aag aac tgc gaa att 578 
Val Leu Asn Pro Thr Arg Arg Val Val Arg Thr Lys Asn Cys Glu He 

180 185 190 

eta tgattacatt gacatgtaga gcggcctcga acgtttacaa taatttggaa 631 
Leu 

ttatgttcct atttaataaa tggagttcgg tggctttata tattctctct tategtttta 691 



agcattttaa aategtcget accgtgt 718 



<210> 194 
<211> 192 
<212> PRT 

<213> Ciona intestinalis 
<400> 194 

Met Gin Ala He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 

1 5 10 15 

Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Ala Phe Pro Gly Glu Tyr 

20 25 30 

He Pro Thr Val Phe Asp Asn Tyr Ser Ala Asn Val Met Val Asp Gly 
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35 40 45 

Arg Pro Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr 

50 55 60 

Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu lie 
65 " 70 75 80 

Cys Phe Ser Val Ala Ser Pro Ala Ser Tyr Glu Asn Val Arg Ala Lys 

85 90 95 

Trp His Pro Glu Val Ala His His Cys Pro Glu Thr Pro Val Leu Leu 

100 105 110 

Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Ala Asp Thr Val Asn Lys 

115 120 125 

Leu Ala Glu Lys Lys Leu Ser Thr lie Thr Thr Thr Gin Gly Leu Gin 

130 " 135 140 

Met Ala Lys Glu Leu Gly Ala Val Lys Tyr Gin Glu Cys Ser Ala Leu 
145 * 150 155 160 

Thr Gin Lys Gly Leu Lys Asn Val Phe Asp Glu Ala lie Arg Ala Val 

165 170 175 

Leu Asn "Pro Thr Arg Arg Val Val Arg Thr Lys Asn Cys Glu He Leu 
180 " ~ 185 190 



<210> 195 
<211> 579 
<212> DNA 

<213> Ciona intestinalis 



<220> 

<221> CDS 

<222> (1) . . (579) 



<400> 195 








atg 


cag 


gcg 


ate 


aag 


tgt 


Met 


Gin 


Ala 


He Lys 


Cys 


1 








5 




acc 


tgt 


ttg 


ctg ate 


age 


Thr 


Cys 


Leu 


Leu 


He 


Ser 








20 






att 


ccc 


act 


gtg 


ttt 


gat 


He 


Pro 


Thr 


Val 


Phe 


Asp 






35 








cgc 


cct 


gtc 


aac 


ttg 


gga 


Arg 


Pro 


Val 


Asn 


Leu 


Gly 




50 










gac 


aga 


ctt 


egg 


cct 


etc 


Asp 


Arg 


Leu 


Arg 


Pro 


Leu 


65 










70 


tgt 


ttc 


tea 


etc 


gtg 


tea 


Cys 


Phe 


Ser 


Leu Val 


Ser 








85 




tgg 


tat 


cca 


gaa gtc 


gca 


Trp 


Tyr 


Pro 


Glu 


Val 


Ala 








100 






gtg 


gga 


aca 


aaa 


ctt 


gat 


Val 


Gly 


Thr 


Lys 


Leu 


Asp 






115 








ctg 


aaa 


gaa 


aag 


aaa 


ctt 


Leu Lys 


Glu 


Lys 


Lys 


Leu 




130 










atg 


gcg 


aaa 


gaa 


gtg 


aac 


Met 


Ala 


Lys 


Glu 


val 


Asn 


145 










150 


acc 


caa 


aaa 


ggc 


ctg 


aaa 


Thr 


Gin 


Lys 


Gly Leu 


Lys 










165 




ctg 


tgc 


cct 


gag 


caa 


aag 


Leu Cys 


Pro 


Glu 


Gin 


Lys 








180 






tga 













gtt gtg gtt gga gat gga 
Val Val Val Gly Asp Gly 
10 

tac aca acc aat get ttc 
Tyr Thr Thr Asn Ala Phe 
25 

aac tac tct gee aat gtc 
Asn Tyr Ser Ala Asn Val 
40 

tta tgg gat aca gca ggg 
Leu Trp Asp Thr Ala Gly 
55 60 
tec tac cca caa acg gac 
Ser Tyr Pro Gin Thr Asp 
75 

cca gec tea ttc gaa aac 
Pro Ala Ser Phe Glu Asn 
90 

cac cac tgt cca gat aca 
His His Cys Pro Asp Thr 
105 

tta cgt gac gac cag gaa 
Leu Arg Asp Asp Gin Glu 
120 

gee cca ate etc tac cca 
Ala Pro He Leu Tyr Pro 
135 140 
gee gta aag tac ctg gag 
Ala Val Lys Tyr Leu Glu 
155 

acc gtt ttc gac gag gcg 
Thr Val Phe Asp Glu Ala 
170 

ccg aag aaa aag aag ccc' 
Pro Lys Lys Lys Lys Pro 
185 



get gtt ggt aaa 48 
Ala Val Gly Lys 
15 

cct gga gaa tat 96 
Pro Gly Glu Tyr 
30 

atg gta gat ggc 144 

Met Val Asp Gly 

45 

cag gag gat tac 192 
Gin Glu Asp Tyr 

gtt ttt ctg ate 240 
Val Phe Leu He 
80 

gtg cgc gca aag 2 88 

Val Arg Ala Lys 
95 

ccg gtc att etc 336 
Pro Val He Leu 
110 

acc ate caa aag 384 

Thr He Gin Lys 

125 

caa ggg ttg cag 432 
Gin Gly Leu Gin 

tgc teg get etc 480 
Cys Ser Ala Leu 
160 

ate cgc gec gtc 528 
He Arg Ala Val 
175 

tgc gag ctt ttg 576 
Cys Glu Leu Leu 
190 

579 



WO 2005/014828 



153/291 



PCT/EP2004/008136 



<210> 196 
<211> 192 
<212> PRT 

<213> Ciona intestinalis 
<400> 196 

Met Gin Ala He Lys Cys Val Val Val Gly* Asp Gly Ala Val Gly Lys 

15 io 15 

Thr Cys Leu Leu lie Ser Tyr Thr Thr Asn Ala Phe Pro Gly Glu Tyr 

20 25 30 

He Pro Thr Val Phe Asp Asn Tyr Ser Ala Asn Val Met Val Asp Gly 

35 40 45 

Arg Pro Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr 

50 55 60 

Asp Arg Levi Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu He 
65 70 75 80 

Cys Phe Ser Leu Val Ser Pro Ala Ser Phe Glu Asn Val Arg Ala Lys 

85 90 95 

Trp Tyr Pro Glu Val Ala His His Cys Pro Asp Thr Pro Val He Leu 

100 105 110 

Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Gin Glu Thr He Gin Lys 

115 120 125 

Leu Lys Glu Lys Lys Leu Ala Pro He Leu Tyr Pro Gin Gly Leu Gin 

130 135 "* 140 

Met Ala Lys Glu Val Asn Ala Val Lys Tyr Leu Glu Cys Ser Ala Leu 
145 150 155 160 

Thr Gin Lys Gly Leu Lys Thr Val Phe Asp Glu Ala He Arg Ala Val 

165 170 175 

Leu Cys Pro Glu Gin Lys Pro Lys Lys Lys Lys Pro Cys Glu Leu Leu 
180 185 " 190 

<210> 197 
<211> 727 
<212> DNA 

<213> Ciona intestinalis 

<220> 

<221> CDS 

<222> (1) . . (582) 

<400> 197 

atg get tea att cgt aag aag ctg gta at a gtt ggt gat ggt get tgt 48 

Met Ala Ser He Arg Lys Lys Leu Val He Val Gly Asp Gly Ala Cys 

1 5 10 15 

ggt aaa act tgc tta etc ate gtg ttt age aaa gat caa ttc cca gaa 96 

Gly Lys Thr Cys Leu Leu He Val Phe Ser Lys Asp Gin Phe Pro Glu 

20 25 30 

gtt tac gtc cca act gtt ttt gaa aac tat gtg get gat att gaa gta 144 
Val Tyr Val Pro Thr Val Phe Glu Asn Tyr Val Ala Asp He Glu Val 

35 40 45 

gac tct aaa cag gtt gag ctt get ttg tgg gat aca get ggt caa gaa 192 
Asp Ser Lys Gin Val Glu Leu Ala Leu Trp Asp Thr Ala Gly Gin Glu 

50 55 60 

gat tac gac agg ctt cgt cca ctt tec tac ccc gat act gat gtt att 240 
Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp Thr Asp Val He 
65 70 75 80 

ctt atg tgt ttc tea ate gac age cca gat tea ctt gag aac att ccc 288 
Leu Met Cys Phe Ser He Asp Ser Pro Asp Ser Leu Glu Asn He Pro 

85 90 95 

gaa aaa tgg ace cct gag gta: agg cat ttt tgc cca agt gtt cca ate 336 
Glu Lys Trp Thr Pro Glu Val Arg His Phe Cys Pro Ser Val Pro He 

100 " 105 110 

att ttg gtt gga aac aaa aaa gat ctt cgt aac gac agt tea aca ata 3 84 

He Leu Val Gly Asn Lys Lys Asp Leu Arg Asn Asp Ser Ser Thr lie 
115 120 125 



480 



528 
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aag gaa ctt get aaa atg aaa caa tct gec gtt agt aac gaa gat gga 432 
Lys Glu Leu Ala Lys Met Lys Gin Ser Ala Val Ser Asn Glu Asp Gly 

130 135 140 

atg gcg atg gca gag aag att ggc get tac gga tat atg gaa tgt tea 
Met Ala Met Ala Glu Lys lie Gly Ala Tyr Gly Tyr Met Glu Cys Ser 
145 150 155 160 

gcg cgc ace aaa gaa ggt gtt egg gaa gtg ttt .gag ctt gca act aaa 
Ala Arg Thr Lys Glu Gly Val Arg Glu Val Phe Glu Leu Ala Thr Lys 

165 170 175 

gca get tta caa acc aag aaa aga aag aaa' aag agt gga tgt gaa gtc 576 
Ala Ala Leu Gin Thr Lys Lys Arg Lys Lys Lys Ser Gly Cys Glu Val 

180 185 190 

ttg taaagaaatt ttaactggag tctagcaatt gacttaacat tggacgcatg 629 
Leu 

ctttcatact ccccaacaaa ttaatattta caatgttgga aaegtattet gcatgacttg 689 
aactcaatta ctcaaataga tggcaggatt tatacaca 727 



<210> 198 
<211> 193 
<212> PRT 

<213> Ciona intestinalis 

<400> 198 n _ 

Met Ala Ser He Arg Lys Lys Leu Val He Val Gly Asp Gly Ala Cys 
1 5 10 15 

Gly Lys Thr Cys Leu Leu He Val Phe Ser Lys Asp Gin Phe Pro Glu 

"20 25 30 

Val Tyr Val Pro Thr Val Phe Glu Asn Tyr Val Ala Asp He Glu Val 

35 40 45 

Asp Ser Lys Gin Val Glu Leu Ala Leu Trp Asp Thr Ala Gly Gin Glu 

50 55 60 

Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp Thr Asp Val He 
65 ~ 70 75 80 

Leu Met Cys Phe Ser He Asp Ser Pro Asp Ser Leu Glu Asn He Pro 

85 90 95 

Glu Lys Trp Thr Pro Glu Val Arg His Phe Cys Pro Ser Val Pro He 

100 105 HO 

He Leu Val Gly Asn Lys Lys Asp Leu Arg Asn Asp Ser Ser Thr He 

115 * 120 125 

Lys Glu Leu Ala Lys Met Lys Gin Ser Ala Val Ser Asn Glu Asp Gly 

130 " 135 140 

Met Ala Met Ala Glu Lys He Gly Ala Tyr Gly Tyr Met Glu Cys Ser 
145 150 155 160 

Ala Arg Thr Lys Glu Gly Val Arg Glu Val Phe Glu Leu Ala Thr Lys 

165 170 175 

Ala Ala Leu Gin Thr Lys Lys Arg Lys Lys Lys Ser Gly Cys Glu Val 
180 185 190 

Leu 



<210> 199 
<211> 714 
<212> DNA 

<213> Trichomonas vaginalis 

<220> 

<221> CDS 

<222> (11).. (613) 



<400> 199 

atttegctag atg aag cat ate aaa tgc gtt gtt gtc ggt gat ggt gca 
Met Lys His He Lys Cys Val Val Val Gly Asp Gly Ala 



49 
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1 5 10 



gtc 


ggt 


aag 


act 


tgc 


ctt ctt ate tea 


ttc 


act aca aat get ttc 


cca 


97 


Vai 


Gly 


Lys 


Thx 


Cys 


Leu Leu He Ser 


Phe 


Thr Thr Asn Ala Phe 


Pro 






15 








20 




25 






ggt 


gaa 


tat 


ate 


cca 


aca gtt ttc gat 


aac 


tac tct gca aac gtt 


atg 


145 


Gly 


Glu 


Tyr 


He 


Pro 


Thr Val Phe Asp- Asn 


Tyr Ser Ala Asn Val 


Met 




30 










35 




40 


45 




get 


gaa 


gga 


cat 


cct 


cca gtc aac etc 


cag 


ctt tgg gat aca get 


ggc 


193 


Ala 


Glu 


Gly His 


Pro 


Pro Val Asn Leu 


Gin 


Leu Trp Asp Thr Ala 


Gly 












50 




55 


60 




cag 


gaa 


gac 


tac 


aag 


aag ctt cgc cca 


ctt 


tct tac cct caa aca 


gat 


241 


Gin 


Glu 


Asp Tyr Lys 


Lys Leu Arg Pro 


Leu 


Ser Tyr Pro Gin Thr 


Asp 










65 




70 




75 




gtt 


ttc 


etc 


etc 


tgt 


ttc tct tta gtt 


tat 


cca get tct etc gag 


aac 


289 


Val 


Phe 


Leu 


Leu 


Cys 


Phe Ser Leu Val 


Tyr 


Pro Ala Ser Leu Glu 


Asn 








80 






85 




90 






ate 


gag 


acc 


atg 


tgg 


ate aaa gaa ate 


aag 


gaa tac tgc cca gac 


aa JET 


337 


lie 


"Glu 


Thr 


Met 


Trp 


He Lys Glu He Lys 


Glu Tyr Cys Pro Asp 


Lys 






95 








100 




105 




cca 


tac 


ate 


ctt 


gtc 


ggt ctt aag tec 


gat 


ctt cgt gat gaa ttc 


gat 


385 


Pro 


Tyr 


He 


Leu Val 


Gly Leu Lys Ser Asp 


Leu Arg Asp Glu Phe 


Asp 




110 










115 




120 


125 




cag 


cgc 


gca 


gat 


gaa 


etc cgc get aag 


ggt 


tac gag cca ate cca 


cgc 


433 


Gin 


Arg 


Ala 


Asp 


Glu 


Leu Arg Ala Lys 


Gly 


Tyr Glu Pro He Pro 


Arg 












130 




135 


140 




get 


aag 


ggt 


gag 


gaa 


atg gca aag aag 


ate 


aac gee tgc tec tac 


ata 


481 


Ala 


Lys 


Gly Glu Glu 


Met Ala Lys Lys 


He 


Asn Ala Cys Ser Tyr 


He 










145 




150 




155 






•gaa 


tgc 


tct 


gee 


ctt 


aaa tea tac aac 


etc 


act gaa gtt ttc gat 


gaa 


529 


Glu 


Cys 


Ser 


Ala 


Leu 


Lys Ser Tyr Asn Leu 


Thr Glu Val Phe Asp 


Glu 








160 






165 




170 






gec 


gtc 


aag tac get 


etc gaa cca cca 


get 


cag cag acc cag act 


aaa 


577 


Ala 


Val 


Lys 


Tyr Ala 


Leu Glu Pro Pro 


Ala 


Gin Gin Thr Gin Thr 


Lys 






175 








180 




185 




gaa 


aag 


aca 


ggt 


ggt 


ggc tgc tgc gaa 


ctt 


att taaatttttt gcatatatgg 


630 


Glu 


Lys 


Thr Gly Gly 


Gly Cys Cys Glu 


Leu 


He 






190 










195 




200 







taccagagtt aaaataaaac atgttgaatg ctgtgacaaa caactttata atccgactcg 690 



gtgtaataat aaaagggaat tagg 714 



<210> 200 
<211> 200 
<212> PRT 

<213> Trichomonas vaginalis 



<400> 200 



Met 


Lys 


His 


He 


Lys 


Cys 


Val 


Val 


Val 


Gly 


Asp Gly 


Ala 


Val 


Gly 


Lys 


1 








5 










10 








15 




Thr 


Cys 


Leu 


Leu 


He 


Ser 


Phe 


Thr 


Thr 


Asn 


Ala Phe 


Pro 


Gly 


Glu 


Tyr 








20 










25 








30 




He 


Pro 


Thr 


Val 


Phe 


Asp 


Asn 


Tyr 


Ser 


Ala 


Asn Val 


Met 


Ala 


Glu 


Gly 






35 










40 








45 






His 


Pro 


Pro 


Val 


Asn 


Leu 


Gin 


Leu 


Trp 


Asp 


Thr Ala 


Gly 


Gin 


Glu 


Asp 




50 










55 








60 








Tyr 


Lys 


Lys 


Leu 


Arg 


Pro 


Leu 


Ser 


Tyr 


Pro 


Gin Thr 


Asp 


Val 


Phe 


Leu 


65 










70 










75 








80 


Leu 


Cys 


Phe 


Ser 


Leu 


Val 


Tyr 


Pro 


Ala 


Ser 


Leu Glu 


Asn 


He 


Glu 


Thr 










85 








90 








95 




Met 


Trp 


He 


Lys 
100 


Glu 


He 


Lys 


Glu 


Tyr 
105 


Cys 


Pro Asp 


Lys 


Pro 
110 


Tyr 


He 


Leu 


Val 


Gly 


Leu 


Lys 


Ser 


Asp 


Leu 


Arg 


Asp 


Glu Phe 


Asp 


Gin 


Arg 


Ala 






115 










120 








125 






Asp 


Glu 


Leu 


Arg 


Ala 


Lys 


Gly 


Tyr 


Glu 


Pro 


He Pro 


Arg 


Ala 


Lys 


Gly 
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130 135 140 

Glu Glu Met Ala Lys Lys He Asn Ala Cys Ser Tyr He Glu Cys Ser 
145 150 155 160 

Ala Leu Lys Ser Tyr Asn Leu Thr Glu Val Phe Asp Glu Ala Val Lys 

165 170 175 

Tyr Ala Leu Glu Pro Pro Ala Gin Gin Thr Gin Thr Lys Glu Lys Thr. 

180 185 190 

Gly Gly Gly Cys Cys Glu Leu He 
195 200 

<210> 201 
<211> 600 
<212> DNA 

<213> Colletotrichum trifolii 
<220> 

<221> CDS 

" <222> (1) . . (600) 

<400> 201 _ AO 

atg get caa cca gga gta cag teg ctg aag tgt gtg gtg acg ggt gac 48 
Met Ala Gin Pro Gly Val Gin Ser Leu Lys Cys Val Val Thr Gly Asp 
15 10 15 

ggt get gtc gga aag aca tgt ttg ctg att tec tac aca acg aac gee 96 
Gly Ala Val Gly Lys Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Ala 

20 25 30 

ttt cct ggc gag tac att cct act gtc ttc gac aac tac teg gcg agt 144 
Phe Pro Gly Glu Tyr He Pro Thr Val Phe Asp Asn Tyr Ser Ala Ser 

35 ^ 40 45 

gta atg gtg gac gga aag cct att age ttg gga ctg tgg gat act gee 192 
Val Met Val Asp Gly Lys Pro He Ser Leu Gly Leu Trp Asp Thr Ala 

50 55 • 60 

ggc cag gaa gat tac gac aga ctg cga ccg ctt tec tac ccc cag acc 
Glv Gin Glu Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr 
65 70 ~ 75 80 

gac gtc ttc ctg att tgc ttc tec ate gtc age ccc cca teg ttt gac 
Asp Val Phe Leu He Cys Phe Ser He Val Ser Pro Pro Ser Phe Asp 

85 90 95 

aac gtc aag gee aag tgg tac ccc gaa ate gat cat cac gee ccc aac 
Asn Val Lys Ala Lys Trp Tyr Pro Glu He Asp. His His Ala Pro Asn 

100 105 HO 

ate ccc att ate etc gtc ggc acc aag ctg gat ttg agg gag gat ccc 
He Pro He He Leu Val Gly Thr Lys Leu Asp Leu Arg Glu Asp Pro 

115 . 120 125 

aac acc etc gag tec etc cgc caa aag egg atg gag ccc gtg teg tac 
Asn Thr Leu Glu Ser Leu Arg Gin Lys Arg Met Glu Pro Val Ser Tyr 

130 135 140 

gat caa gee ctg ate tgc gee aag gaa att aag gca cac aaa tac ctg 480 
Asp Gin Ala Leu He Cys Ala Lys Glu He Lys Ala His Lys Tyr Leu 
145 150 155 160 

gag tgt tct gee ctg aca cag agg aat ctg aag age gtt ttt gac gag 528 
Glu Cys Ser Ala Leu Thr Gin Arg Asn Leu Lys Ser Val Phe Asp Glu 

165 170 175 

gee att cgt get gtc ctg aac ccc agg ccc gtc gcg cag cag aag aag 576 
Ala He Arg Ala Val Leu Asn Pro Arg Pro Val Ala Gin Gin Lys Lys 

180 185 190 

aag teg aag tgt acg att ttg tga 
Lys Ser Lys Cys Thr He Leu 
195 



240 
288 
336 
384 
432 



600 



<210> 202 
<211> 199 
<212> PRT 

<213> Colletotrichum trifolii 
<400> 202 

Met Ala Gin Pro Gly Val Gin Ser Leu Lys Cys Val Val Thr Gly Asp 
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1 








5 




Gly 


Ala 


Val 


Gly 
20 


Lys 


Thr 


Phe 


Pro 


Gly 
35 


Glu 


Tyr 


He 


Val 


Met 
50 


Val 


Asp 


Gly 


Lys 


Gly 


Gin 


Glu 


Asp 


Tyr 


Asp 


65 










70 


Asp 


Val 


Phe 


Leu 


He 
85 


Cys 


Asn 


Val 


Lys 


Ala 
100 


Lys 


Trp 


lie 


Pro 


He 
115 


He 


Leu 


Val 


Asn 


Thr 
130 


Leu 


Glu 


Ser 


Leu 


Asp 


Gin 


Ala" 


Leu 


lie 


Cys 


145 










150 


Glu 


Cys 


Ser 


Ala 


Leu 


Thr 








165 




Ala 


He 


Arg 


Ala 
180 


Val 


Leu 


Lys 


Ser 


Lys 
195 


Cys 


Thr 


He 



<210> 203 
<211> 618 
<212> DNA 

<213> Brachydanio rerio 

<220> 

<221> CDS 

<222> (1) . . (618) 



<400> 203 



atg 


gca 


aac 


gga acc 


ggc 


Met 


Ala 


Asn 


Gly Thr 


Gly 


1 






5 




gac 


ggc 


gcc 


gtt gga 


aaa 


Asp 


Gly 


Ala 


Val Gly 


Lys 








20 




gcc 


ttt 


cca 


gag gaa 


tat 


Ala 


Phe 


Pro 


Glu Glu 


Tyr 






35 






agt 


gtg 


acg 


gtc ggg 


ggg 


Ser 


Val 


Thr 


Val Gly 


Gly 




50 








gcg 


ggt 


cag 


gag gac 


tat 


Ala 


Gly 


Gin 


Glu Asp 


Tyr 


65 








70 


acg 


gat 


gtc 


ttc etc 


att 


Thr 


Asp 


Val 


Phe Leu 


He 






85 




cag 


aat 


gta 


aga gag 


gaa 


Gin 


Asn 


Val 


Arg Glu 


Glu 








100 




aac 


ate 


cct 


tac ctg 


etc 


Asn 


He 


Pro 


Tyr Leu 


Leu 






115 






ccc 


aag 


act 


ate gcc 


aag 


Pro 


Lys 


Thr 


He Ala 


Lys 




130 








acg 


gaa 


caa 


ggc caa 


aag 


Thr 


Glu 


Gin 


Gly Gin 


Lys 


145 








150 


gtt 


gaa 


tgc 


tea gca 


eta 


Val 


Glu 


Cys 


Ser Ala 


Leu 



157/291 
10 



Cys 


Leu 


Leu 


He 


Ser 


Tyr 






25 








Pro 


Thr 


Val 


Phe 


Asp 


Asn 




40 










Pro 


He 


Ser Leu 


Gly Leu 


55 










.60 


Arg 


Leu 


Arg 


Pro 


Leu 


Ser 










75 




Phe 


Ser 


He 


Val' 


Ser 


Pro 








90 






Tyr 


Pro 


Glu 


He 


Asp 


His 






105 








Gly 


Thr 


Lys 


Leu 


Asp Leu 




120 










Arg 


Gin 


Lys Arg 


Met 


Glu 


135 










140 


Ala 


Lys 


Glu 


He 


Lys 


Ala 










155 




Gin 


Arg 


Asn 


Leu 


Lys 


Ser 








170 






Asn 


Pro 


Arg 


Pro 


Val 


Ala 






185 








Leu 












age 


ate 


atg 


ttg 


aaa 


tgc 


Ser 


He 


Met 


Leu 


Lys 


Cys 








10 






act 


tgt 


ctg ttg 


atg 


age 


Thr 


Cys 


Leu 


Leu 


Met 


Ser 






25 








gtg 


ccc 


acc 


gtc 


ttt 


gat 


Val 


Pro 


Thr 


Val 


Phe Asp 




40 










aaa 


cag 


tac 


ctg 


ctg ggg 


Lys 


Gin 


Tyr Leu 


Leu Gly 


55 










60 


gat 


cgc 


ctg 


cga 


ccc 


ctg 


Asp 


Arg 


Leu Arg 


Pro 


Leu 










75 




tgc 


ttc 


tec 


gtg 


gtc 


aat 


Cys 


Phe 


Ser 


Val 


val 


Asn 








90 






tgg 


gtc 


cct 


gag 


ctg 


cag 


Trp 


Val 


Pro 


Glu 


Leu 


Gin 






105 








att 


ggc 


act 


cag 


att 


gac 


He 


Gly Thr Gin 


He 


Asp 




120 










etc 


aac 


gac 


gtg 


aag 


gag 


Leu 


Asn Asp Val 


Lys 


Glu 


135: 










140 


eta 


gca 


aag 


gag 


att 


ggt 


Leu 


Ala 


Lys 


Glu 


He Gly 










155 




aca 


cag 


aag 


ggc 


ctg 


aag 


Thr 


Gin Lys 


Gly 


Leu Lys 
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15 




Thr 


Thr 


Asn 


Ala 




30 






Tyr 


Ser 


Ala 


Ser 


45 








Trp 


Asp 


Thr 


Ala 


Tyr 


Pro 


Gin 


Thr 








80 


Pro 


Ser 


Phe 


Asp 






95 




His 


Ala 


Pro 


Asn 




110 






Arg- 


GlU 


Asp 


Pro 


125 








Pro 


val 


Ser 


Tyr 


His 


Lys 


Tyr 


Leu 








160 


Val 


Phe 


Asp 


Glu 






175 




Gin 


Gin 


Lys 


Lys 



190 



gtg 


gtc 


gtc 


ggt 


48 


Val 


Val 


Val 


Gly 








15 






tat 


gcc 


aac 


gac 


96 


Tyr 


Ala 


Asn 


Asp 






30 








cac 


tat 


gca 


gtg 


144 


His 


Tyr 


Ala 


val 




45 










etc 


tat 


gac 


act 


192 


Leu 


Tyr 


Asp 


Thr 




tec 


tat 


cct 


atg 


240 


Ser 


Tyr 


Pro 


Met 










80 




ccg 


gcc 


age 


ttc 


288 


Pro 


Ala 


Ser 


Phe 








95 






gaa 


tat 


gca 


cca 


336 


Glu 


Tyr 


Ala 


Pro 






110 








ctg 


egg 


gat 


gac 


384 


Leu 


Arg 


Asp 


Asp 




125 










aag 


ccc 


att 


gtg 


432 


Lys 


Pro 


He 


Val 




gcc 


tgc 


tgc 


tat 


480 


Ala 


Cys 


Cys 


Tyr 










160 




act 


gtg 


ttt 


gat 


528 


Thr 


val 


Phe 


Asp 





WO 2005/014828 

165 

gag gcc att ata gcc att 
Glu Ala He He Ala He 
180 

aga eta ggc ccg cgc tgc 
Arg Leu Gly Pro Arg Cys 
195 



158/291 

170 

ctg get cct aag aaa ggc 
Leu Ala Pro Lys Lys Gly 
185 

att aac tgc tgc etc ate 
He Asn Cys Cys Leu He 

200 . 



PCT/EP2004/008136 
175 

gca ctg aag cga 576 
Ala Leu Lys Arg 
190 

aca tga 618 

Thr 

205 



<210> 204 
<211> 205 
<212> PRT 

<213> Brachyclanio rerio 
<400> 204 

Met Ala Asn Gly Thr Gly Ser He Met Leu Lys Cys Val Val Val Gly 
1 5 10 15 . 

Asp Gly Ala Val Gly Lys Thr Cys Leu Leu Met Ser Tyr Ala Asn Asp 

20 25 30 

Ala Phe Pro Glu Glu Tyr Val Pro Thr Val Phe Asp His Tyr Ala Val 

35 40 45 

Ser Val Thr Val Gly Gly Lys Gin Tyr Leu Leu Gly Leu Tyr Asp Thr 

50 55 60 

Ala Gly Gin Glu Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Met 
65 *" 70 75 80 

Thr Asp Val Phe Leu He Cys Phe Ser Val Val Asn Pro Ala Ser Phe 

85 90 95 

Gin Asn Val Arg Glu Glu Trp Val Pro Glu Leu Gin Glu Tyr Ala Pro 

100 105 HO 

Asn He Pro Tyr Leu Leu lie Gly Thr Gin He Asp Leu Arg Asp Asp 

115 120 125 

Pro Lys Thr He Ala Lys Leu Asn Asp Val Lys Glu Lys Pro He Val 

130 135 140 

Thr Glu Gin Gly Gin Lys Leu Ala Lys Glu He Gly Ala Cys Cys Tyr 
145 150 155 160 

Val Glu Cys Ser Ala Leu Thr Gin Lys Gly Leu Lys Thr Val Phe Asp 

165 170 175 

Glu Ala He He Ala He Leu Ala Pro Lys Lys Gly Ala Leu Lys Arg 

180 185 190 

Arq Leu Gly Pro Arg Cys He Asn Cys Cys Leu He Thr 
195 200 205 

<210> 205 
<211> 582 
<212> DNA 

<213> Xenopus laevis 

<220> 

<221> CDS 

<222> (1)..(582) 

atg°gca°|cg att egg aaa aag ctg gtg att gtt gga gat ggt gcc tgt 48 
Met Ala Ala He Arg Lys Lys Leu Val He Val Gly Asp Gly Ala Cys 
1 5 io 15 

qqq aag ace tgc ctg etc att gta ttc age aag gat cag ttc cct gag 
Gly Lys Thr Cys Leu Leu He Val Phe Ser Lys Asp Gin Phe Pro Glu 

20 25 30 

gtt tat gtc cca act gtc ttt gag aac tac att gcc gat ata gaa gtc 
Val Tyr Val Pro Thr Val Phe Glu Asn Tyr He Ala Asp He Glu Val 

35 40 45 

gat gga aaa cag gtg gaa ctg get ctg tgg gac aca gca gga cag gag 
Asp Gly Lys Gin Val Glu Leu Ala Leu Trp Asp Thr Ala Gly Gin Glu 

50 55 60 

gat tat gac agg eta cgc cca ctt tct tac cct gat act gat gtg att 
Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp Thr Asp Val He 
65 70 75 80 

ttg atg tgt ttc tea att gac age cct gac agt tta gag aat att cct 



96 



144 



192 



240 



288 
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Leu 


Met 


Cvs 


Phe 


Ser 


He Ast5 

■J* nwLy 


Ser 


Pro 




Ser 


Leu Gin Ann Tie Pt*o 












85 








90 








aacr 


aaa 


tcra 


act 


cca 


craa crtcr 


aag 


cat 


ttc 


tgt 


ccc aat erf- a nrr att 


o o o 


Glu 


Lvs 


Tttd 


Thr 


Pro 


Glu Val 


Lys 


His 




Cys 


Pro Aqti Va 1 *Pt"o Tie 
nou vclx zr*.\J) ixc 










100 








105 






1JLU 




att 


ctt 


crtc 


crcrt 


aac 


aaa aac7 


gat 


eta 


aga 


aat 


crat craa <^a<** aft a era 




lie 


Leu 


Val 


Glv 


As ii 


Lys Lys 


Asp Leu 


A T*fT 

^9 


Asn 


no^j vjiu nxo x xix rtxy 








115 








120 












aga 


oracr 


ctg 


aca 


aag 


atcr aacr 


cag 


gag 


t ^_»CL 


gtt 


aaa ppa rrasa rrat nnr* 

0.0.0. Ct yCLCL y CL U (j^C 






Glu 


Leu 


Thr 


Lys 


Mph TiVB 

1 Km J— 1 jr 0 


Gin 


Glu 


Pro 

xr.X LJ 


Val 

V CLJL 


xjjro r*XO olu HSp oiy 






130 


















1 4.0 




aga 


gaa 


atg 


y 


aac 


a era atr* 


aat 


gca 


L. L- L. 


yy<- 


tar* ohrr rraa 4- rr+" 

l-cll* ctg yaa ty t tut 


/Oft 


Ar-cr 


Glu 


Met 


Ala 


Asn 


At~ct Tie* 


Asn 


Ala 


It 11C 


Gly 


j-yj. ucu VzlU uyo OCX 




1 


















IDS 


160 




QfCt 


aag 


aca 


aag 


yat 


ggc gtg 


aga 


gaa 


gtt 




gaa atg get aca egg 


528 


ill a 

a j. a 


Lys 


JlxLx 


Lys 


Asp 


<jxy vai 


Arg Glu 


vai 


Pne 


Glu Met Ala Thr Arg 












165 








170 




175 




get 


gec 


ttg 


caa 


gtt 


agg aaa 


cac 


aga 


aag 


aag" 


caa cgt tgt tct att 


576 


Ala 


Ala 


Leu 


Gin 


Val 


Arg Lys 


His Arg 


Lys 


Lys 


Gin Arg Cys Ser lie 










180 








185 






190 




eta 


taa 




















582 


Leu 

























<210> 206 
<211> 193 
<212> PRT 

<213> Xenopus laevis 
<400> 206 

Met Ala Ala lie Arg Lys Lys Leu Val lie Val Gly Asp Gly Ala Cys 

1 5 10 15 

Gly Lys Thr Cys Leu Leu lie Val Phe Ser Lys Asp Gin Phe Pro Glu 

20 25 30 ' . . 

Val Tyr Val Pro Thr Val Phe Glu Asn Tyr lie Ala Asp lie Glu Val 

35 40 45 

Asp Gly Lys Gin Val Glu Leu Ala Leu Trp Asp Thr Ala Gly Gin Glu 

50 55 60 

Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp Thr Asp Val lie 
65 70 75 80 

Leu Met Cys Phe Ser lie Asp Ser Pro Asp Ser Leu Glu Asn lie Pro 

85 90 95 

Glu Lys Trp Thr Pro Glu Val Lys His Phe Cys Pro Asn Val Pro lie 

100 105 110 

lie Leu Val Gly Asn Lys Lys Asp Leu Arg Asn Asp Glu His Thr Arg 

115 120 125 

Arg Glu Leu Thr Lys Met Lys Gin Glu Pro Val Lys Pro Glu Asp Gly 

130 135 140 

Arg Glu Met Ala Asn Arg lie Asn Ala Phe Gly Tyr Leu Glu Cys Ser 
145 150 155 " 160 

Ala Lys Thr Lys Asp Gly Val Arg Glu Val Phe Glu Met Ala Thr Arg 

165 170 175 

Ala Ala Leu Gin Val Arg Lys His Arg Lys Lys Gin Arg Cys Ser lie 
180 185 190 

Leu 



<210> 207 
<211> 1092 
<212> DNA 

<213> Xenopus laevis 

<220> 

<221> CDS 

<222> (25).. (606) 



<400> 207 



75 80 85 

aac agt ttg gaa aat ate cct gag aaa tgg acc cce 

Asp Ser Leu Glu Asn lie Pro Glu Lys Trp Thr Pro Glu Val Lys Hxs 
90 95 100 

ttc tac ccc aat gtg cca ata ate ctg gtg gga aac aag aag gat ctg 387 
Phe Cvs Pro Asn Val Pro lie He Leu Val Gly Asn Lys Lys Asp Leu 

110 115 12° 

egg aat gac gag cac acg cgc egg gag ttg gee aaa atg aag cag gag 
A?f Asn Asp Glu His Thr Arg Arg Glu Leu Ala Lys Met Lys Gin Glu 

3 125 130 135 

cca gta aga gca gag gag gga agg gac atg gee aat cga ate age gee 
Pro Val Arg Ala Glu Glu Gly Arg Asp Met Ala Asn Arg He Ser Ala 

140 145 150 

ttt acre tac ttg gag tgc tct gec aag acc aag gat ggc gta aga gaa 53 1 

Phe Glv Tyr Leu Glu Cys Ser Ala Lys Thr Lys Asp Gly Val Arg Glu 

155 16° 165 „„ 

gtg ttt gag atg gcg act egg gcg get ctg cag gee aaa cgt ggg cgc 579 
Val Phe Glu Met Ala Thr Arg Ala Ala Leu Gin Ala Lys Arg Gly Arg 
170 175 I 80 

aag agg age acc tgc cag tta ctg tgaggagaga agececatge cctcattctc 633 
Lys Arg Ser Thr Cys Gin Leu Leu 
190 

cccettccct gtgeectetc tctccgctgt gggctagggg agetgeggae tgaggagacg 



caccaataae caatccettt cetgttctca tgggcagege caggccccga gtccagcaaa 
tcacatcagt gecaggtctg tagcaacagg gggtttctta caaeagtgee attgtetagt 
gcagggcccc ttcctctaaa aggegggggg gggattattt ggctttagaa ggccccgctg 
tcctgggggt eggattgtae acacgagagc gggacccctc gtgatattgg ttaattccac 
cccgaccctg tgctggcgag ggttaatcct gcagctcatg agaggggtgg ggaggtttta 
taataatgtt gtccctgctt gaatgacatt ttattgtcca ttgtactcga gtatcaacca 
ataaactttg tagccatgtt tagtaaaaaa aaaaaaaaa 



51 
99 
147 



WO 2005/014828 PCT/EP2004/008136 

160/291 

ggagggagct gcgagtagga aggc atg gca gec ata agg aag aag ctg gtg . 
ggaggy a a s = . Met Ala Ala He Arg Lys Lys Leu Val 

1 5 
att ata age gac ggg gcg tgt ggc aag act tgt etc etc ate gtc ttt 
He Val Gly Asp Gly Ala Cys Gly Lys Thr Cys Leu Leu He Val Phe 

10 15 20 25 

age aaa gac cag ttc cca gag gtc tat gtt ccc ace gtc ttt gag aac 
Ser Lys Asp Gin Phe Pro Glu Val Tyr Val Pro Thr Val Phe Glu Asn 

30 35 40 

tac atg gec gac ate gag gtg gac tec aag' cag gta gaa ctg get ctg 195 
Tyr Val Ala Asp He Glu Val Asp Ser Lys Gin Val Glu Leu Ala Leu 

45 50 55 

tag gac acg get ggg caa gag gat tac gac cga ctg aga ccc etc tec 243 
Tro Asp Thr Ala Gly Gin Glu Asp Tyr Asp Arg Leu Arg Pro Leu Ser 

60 65 70 

tac cct gac aca gac gtc ate etc atg tgt ttc tec ate gac age ccc 291 
Tyr Pro Asp Thr Asp Val He Leu Met Cys Phe Ser lie Asp Ser Pro 



gac agt ttg gaa aat ate cct gag aaa tgg ace cca gaa gtg aaa cat 339 



435 
483 



693 
753 



813 

873 

933 

993 

1053 

1092 



<210> 208 
<211> 193 
<212> PRT 



WO 2005/014828 

<213> Xenopus laevis 
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<400> 208 



Met 


Ala 


Ala 


lie 


Arg 


Lys 


Lys 


Leu 


Val 


lie Val Gly Asp Gly Ala Cys 




1 








5 










10 15 




Gly 


Lys 


Thr 


Cys 


Leu 


Leu 


lie 


Val 


Phe 


Ser Lys Asp Gin Phe Pro Glu 










20 










25 


30 




Val 


Tyr 


Val 


Pro 


Thr 


Val 


Phe 


Glu Asn 


Tyr Val Ala Asp lie Glu Val 








35 










40 




45 




Asp 


Ser 


Lys 


Gin 


Val 


Glu 


Leu Ala 


Leu 


Trp Asp Thr Ala Gly Gin Glu 






50 










55 






60 




Asp 


Tyr 


Asp 


Arg 


Leu 


Arg 


Pro 


Leu 


Ser 


Tyr Pro Asp Thr Asp Val lie 




65 










70 








75 80 




Leu 


Met 


Cys 


Phe 


Ser 


lie Asp Ser Pro 


Asp Ser Leu Glu Asn lie Pro 












85 










90 95 




Glu 


Lys 


Trp 


Thr 


Pro 


Glu 


Val 


Lys 


His 


Phe Cys Pro Asn Val Pro He 










100 










105 


110 




lie 


Leu "Val 


Gly 


Ash 


Lys Lys Asp Leu 


Arg Asn Asp Glu* His Thr Arg 








115 










120 




125 




Arg 


Glu 


Leu 


Ala 


Lys 


Met 


Lys 


Gin 


Glu 


Pro Val Arg Ala Glu Glu Gly 






130 










135 






140 




Arg 


Asp 


Met 


Ala 


Asn 


Arg 


lie 


Ser 


Ala 


Phe Gly Tyr Leu Glu Cys Ser 




145 










150 








155 160 




Ala 


Lys 


Thr. 


Lys 


Asp 


Gly Val Arg Glu 


Val Phe Glu Met Ala Thr Arg 












165 










170 175 




Ala 


Ala 


Leu 


Gin 


Ala 


Lys 


Arg Gly Arg 


Lys Arg Ser Thr Cys Gin Leu 










180 










185 


190 




Leu 






















<210> 209 


















<211> 582 












• 






<212> DNA 


















<213> Brachydanio rerio 












<220> 




















<221> CDS 


















<222> (1) . . i 


£582) 














<400> 209 


















atg 


gca 


gca 


att 


cgc 


aag 


aag 


ctg 


gtg 


att gta gga gat gga gca tgc 


48 


Met 


Ala 


Ala 


lie Arg 


Lys 


Lys 


Leu 


Val 


He Val Gly Asp Gly Ala Cys 




1 








5 










10 15 




ggg 


aaa 


aca 


tgt* ttg 


ctt 


att 


gtg 


ttc 


agt aaa gac cag ttt cct gaa 


96 


Gly 


Lys 


Thr 


Cys 


Leu 


Leu 


lie 


Val 


Phe 


Ser Lys Asp Gin Phe Pro Glu 










20 










25 


30 




gtc 


tac 


gta 


cca 


aca 


gtg 


ttt 


gaa 


aac 


tat gtt gec gac att gaa gtt 


144 


Val 


Tyr 


Val 


Pro 


Thr 


val 


Phe 


Glu 


Asn 


Tyr Val Ala Asp He Glu Val 








35 








> 


40 




45 




gac 


age 


aaa 


cag 


gtg 


gaa 


etc 


get 


ctg 


tgg gat act gca ggg cag gag 


192 


Asp 


Ser 


Lys 


Gin 


Val 


Glu 


Leu 


Ala 


Leu 


Trp Asp Thr Ala Gly Gin Glu 






50 










55 






60 




gac 


tac 


gac 


agg 


ctt 


cgt 


cct 


ctt 


tea 


tac cca gac acc gat gtc att 


240 


Asp 


Tyr 


Asp 


Arg 


Leu 


Arg 


Pro 


Leu 


Ser 


Tyr Pro Asp Thr Asp Val He 




65 










70 








75 80 




ctt 


atg 


tgt 


ttc 


tct 


att 


gac 


agt 


cct 


gac agt tta gag aat att ccg 


288 


Leu 


Met 


Cys 


Phe 


Ser 


lie 


Asp 


Ser 


Pro 


Asp Ser Leu Glu Asn lie Pro 












85 










90 95 




gag 


aag 


tgg 


acg 


cct 


gag 


gtc 


aaa 


cac 


ttt tgt cca aac get cca ate 


336 


Glu 


Lys 


Trp 


Thr 


Pro 


Glu 


Val 


Lys 


His 


Phe Cys Pro Asn Ala- Pro He 










100 










105 


110 




ate 


ctt 


gtg 


gga 


aac 


aaa 


aaa 


gac 


ctg 


cgc aat gat gaa cac aca cgc 


384 


lie 


Leu 


Val 


Gly Asn 


Lys 


Lys 


Asp 


Leu 


Arg Asn Asp Glu His Thr Arg 








115 










120 




125 




agg 


gaa 


ctg 


acc 


aag 


atg 


aag 


cag 


gag 


cca gtt aaa gca gag gaa ggg 


432 


Arg 


Glu 


Leu 


Thr Lys 


Met 


Lys 


Gin 


Glu 


Pro Val Lys Ala Glu Glu Gly 






130 










135 






140 




aga 


gac 


atg 


gec 


aat 


cga 


att 


gga 


gca 


ttt ggt tat atg gaa tgt tct 


480 
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Arg Asp Met Ala Asn Arg He Gly Ala Phe Gly Tyr Met Glu Cys Ser 
145 150 155 160 

gcc aaa aca aaa gac ggt gtt egg gag gtg ttt gaa atg gcc act aga 
Ala Lys Thr Lys Asp Gly Val Arg Glu Val Phe Glu Met Ala Thr Arg 

165 170 175 

o C g gcg ctg cag gca cgc agg gga aag aag age aac aaa tgc tgt ctg 
HI IS Leu Gin Ala Arg Arg Gly Lys Lys Ser Asn Lys Cys Cys Leu 
180 185 19° 

ctg tga 
Leu 



<210> 210 
<211> 193 
<212> PRT 

<213> Brachydaoio rerio . . . 

Met°Ala 1 Ala He Arg Lys Lys Leu Val He Val Gly Asp Gly Ala Cys 
1 5 1° 

Gly Lys Thr Cys Leu Leu He Val Phe Ser Lys Asp Gin Phe Pro Glu 

20 25 3 0 

Val Tyr Val Pro Thr Val Phe Glu Asn Tyr Val Ala Asp He Glu Val 

35 40 45 

Asp Ser Lys Gin Val Glu Leu Ala Leu Trp Asp Thr Ala Gly din Glu 

50 55 6° 

Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp Thr Asp Val lie 
65 ~ 70 75 80 

Leu Met Cys Phe Ser He Asp Ser Pro Asp Ser Leu Glu Asn lie Pro 

85 90 95 

Glu Lys Trp Thr Pro Glu Val Lys His Phe Cys Pro Asn Ala Pro He 

100 105 H° 

He Leu Val Gly Asn Lys Lys Asp Leu Arg Asn Asp Glu His Thr Arg 

115 120 I 25 

Arcr Glu Leu Thr Lys Met Lys Gin Glu Pro Val Lys Ala Glu Glu Gly 

130 135 140 

Arg Asp Met Ala Asn Arg He Gly Ala Phe Gly Tyr Met Glu Cys Ser 
145 150 155 160 

Ala Lys Thr. Lys Asp Gly val Arg Glu Val Phe Glu Met Ala Thr Arg 

165 170 175 

Ala Ala Leu Gin Ala Arg Arg Gly Lys Lys Ser Asn Lys Cys Cys Leu 
180 185 19° 

Leu 



528 
576 
582 



<210> 211 
<211> 576 
<212> DNA 

<213> Brachydanio rerio 

<220> 

<221> CDS 

<222> (1) . . (576) 

aJg^ag^gc a tc aag tgt gtg gtg gtg gga gac gga gcg gtg gga aaa 48 
Met Gin Ser He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 

1 5 10 * 

acc tgc ctg etc atc tec tac acg acc ggc gcc ttc ccc aaa gag tac 96 
Thr C?s Leu Leu He Ser Tyr Thr Thr Gly Ala Phe Pro Lys Glu Tyr 

20 25 30 

att ccc acc gtc ttc gac aac tac age tec cag gtc age gtg gac aac 
He Pro Thr Val Phe Asp Asn Tyr Ser Ser Gin Val Ser Val Asp Asn 

~ 35 40 4 5 

cgt acc gtc age etc aat ctg tgg gac acc gcg ggt cag gag gag tac 
Arg Thr Val Ser Leu Asn Leu Trp Asp Thr Ala Gly Gin Glu Glu Tyr 
50 55 60 



144 



192 
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gac 


cgc 


ctt 


cgc 


acg 


ctg 


Asp 


Arg 


Leu 


Arg 


Tnr 


Leu 


65 










70 


tgc 


ttc 


tec 


ate 


tec 


age 


Cys 


Phe 


Ser 


He 


Ser 


Ser 










85 


■■ . 


tgg 


cac 


ccg 


gag 


gtg 


aca 


Trp 


His 


Pro 


GlU 


Val 


Tnr 








100 






gtg 


gga 


acc 


aag 


agt 


gat 


Val 


Gly 


Thr 


Lys 


Ser 


Asp 






115 








ctg 


aag 


gag 


cag 


aac 


cag 


Leu 


Lys 


Glu 


Gin 


Asn 


Gin 




130 










ctg 


get 


cgc 


caa 


ate 


cac 


Leu 


Ala 


Arg 


Gin 


He 


His 


"145" 








150 


agt 


cag 


gac 


ggc 


ate 


aag 


Ser 


Gin 


Asp 


Gly 


He 


Lys 










165 




ctt 


agt 


cct 


caa 


ccc 


gtg 


Leu 


Ser 


Pro 


Gin 


Pro 


val 








180 






tga 
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tec 


tat 


cct 


caa 


acc 


aac 


Ser 


Tyr 


Pro 


Gin 


Thr 


Asn 










75 




cct 


ccg 


tec 


tac 


gag 


aac 


jb*ro 


Pro 


Ser 


Tyr 


Glu Asn 








90 






cac 


cac 


tgt 


cct 


age 


.gtg 


HIS 


His 


Cys 


Pro 


Ser 


Val 






105 








etc 


cgc 


aat 


gac' 


gec 


gac 


Leu 


Arg 


Asn 


Asp 


Ala 


Asp 




120 










gcg 


ccc 


ate 


acg 


acc 


cag 


TV T -n 

Axa 


Pro 


lie 


Thr 


Thr 


Gin 


lie 










140 


gee 


gtc 


aag 


tac 


cgt 


gag 


Ala 


Val 


Lys 


Tyr 


Arg Glu 










*i55 




gac 


gtg 


ttt 


gca 


gac 


gca 


Asp 


Val 


Phe 


Ala 


Asp Ala 








170 






get 


aac 


aag 


aag 


ccc 


tgt 


Ala 


Asn 


Lys 


Lys 


Pro 


Cys 






185 
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gtc 


ttc 


ate 


ate 


240 


Val 


Phe 


He 


He 










80 




ate 


aag 


cat 


aag 


288 


lie 


Lys 


His 


Lys 








95 






ccc 


ate 


ctg 


ctt 


336 


Pro 


He 


Leu 


Leu 






110 

J_ -L \J 








gtg 


ctg 


aag 


aag 


384 


Val 


Leu 


Lys 


Lys 




125 










cag 


ggt 


caa 


gcg 


432 


Gin 


Gly 


Gin 


Ala 




tgt 


teg 


gca 


etc 


480 


Cys 


Ser 


Ala 


Leu 










16b 




gtg 


cgt 


gee 


tac 


528 


Val 


Arg 


Ala 


Tyr 








175 






ata 


etc 


etc 




573 


He 


Leu 


Leu 








190 









576 



<210> 212 
<211> 191 
<212> PRT ; 

<213> Brachydanio rerio 
<400> 212 



Met 


Gin 


Ser 


He 


Lys 


Cys 


Val 


Val 


1 








5 








Thr 


Cys 


Leu 


Leu 


He 


Ser 


Tyr 


Thr 








20 










He 


Pro 


Thr 


val 


Phe 


Asp 


Asn 


Tyr 






35 










40 


Arg 


Thr 


Val 


Ser 


Leu 


Asn 


Leu 


Trp 




50 










55 




Asp 


Arg 


Leu 


Arg 


Thr 


Leu 


Ser 


Tyr 


65 










70 






Cys 


Phe 


Ser 


He 


Ser 


Ser 


Pro 


Pro 










85 








Trp 


His 


Pro 


Glu 


Val 


Thr 


His 


His 








100 










Val 


Gly 


Thr 


Lys 


Ser 


Asp 


Leu 


Arg 






115 










120 


Leu 


Lys 


Glu 


Gin 


Asn 


Gin 


Ala 


Pro 




130 










135 




Leu 


Ala 


Arg 


Gin 


He 


His 


Ala 


Val 


145 










150 






Ser 


Gin 


Asp 


Gly 


He 


Lys 


Asp 


Val 










165 








Leu 


Ser 


Pro 


Gin 


Pro 


Val 


Ala 


Asn 



180 

<210> 213 
<211> 576 
<212> DNA 

<213> Brachydanio rerio 



Val 


Gly 


Asp Gly 


Ala 


Val 


Gly 


Lys 




10 










15 




Thr 


Gly 


Ala 


Phe 


Pro 


Lys 


Glu 


Tyr 


25 










30 






Ser 


Ser 


Gin 


val 


Ser 


Val 


Asp 


Asn 










45 






Asp 


Thr 


Ala 


Gly 
60 


Gin 


Glu 


Glu 


Tyr 


Pro 


Gin 


Thr 
75 


Asn 


Val 


Phe 


He 


He 
80 


Ser 


Tyr 
90 


Glu 


Asn 


He 


Lys 


His 
95 


Lys 


Cys 


Pro 


Ser 


Val 


Pro 


He 


Leu 


Leu 


105 










110 






Asn 


Asp 


Ala Asp 


Val 


Leu 


Lys 


Lys 










125 








He 


Thr 


Thr 


Gin 
140 


Gin 


Gly 


Gin 


Ala 


Lys 


Tyr 


Arg 
155 


Glu 


Cys 


Ser 


Ala 


Leu 
160 


Phe 


Ala 
170 


Asp 


Ala 


Val 


Arg 


Ala 
175 


Tyr 


Lys 


Lys 


Pro 


Cys 


He 


Leu 


Leu 





185 190 



<220> 

<221> CDS 

<222> (1) . . (576) 
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asks a a a « e e ss a a a a sp si 18 

c 10 
1 j nra erae aaa tac 96 



1 ~ «-~«- oi-rr 1-fcr ate tec tac acc acc aat gec ttc cca gac gag tac 
S Cys Leu Leu Se Ser Tyr Thr Thr Asn Ala. Phe Pro Asp Glu Tyr 

20 ^ ^ i a a 

S S S fa! S2 a a SS K S- a S $ S3 5 SS 

s a a is a a a s a s s ss a a a a 152 

oac cec eta cga acc etc tec tac 003 oag aca cac gu ttc ate ate 240 
IS I?g Sg Tor Leu ser Tyr Pro Gin Thr His VI Phe II. lie 

| Se III 53 S SS Pro S 2 Sa Sa So SS S S # 

s si a ss a a - k s if? ss ss a a 336 

100 105 ,Q/ 

fai is s a a a a a ss a ss ss s a is a 

tic 120 JL^w 

a a ss ss if? s a a s a s a a ss a s 432 

n 135 

a s a a a a a a a a a ss a a s a «• 
s a ss ss ss a a a a a ss is a a a a 528 

165 170 .-.a 

r-t-n ra = ccc aac crcc aag aag cac acc aaa aaa tgt gtg ctg ctt 
LeS Tyr Pro Asn fit Lys Ly! His Thr Lys Lys Cys Val Leu Leu 

180 185 576 

taa 



Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Ala Phe Pro Asp Glu Tyr 
He Pro Thr Val Phe Asp Asn Tyr Ser Thr Gin Thr Cys Val Asp Gly 



<210> 214 
<211> 191 
<212> PRT 

<213> Brachydanio rerio 

Me^Gl^Thr He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 
x 5 10 

Leu 
Thi 

Arg Ala val Ser Leu Asn Leu Trp Asp Thr Ala Gly Gin Glu Glu Tyr 

cn 55 
Asp Arg Leu Arg Thr Leu Ser Tyr Pro Gin Thr His Val Phe He He 

%s Phe Ser Val Ala Ser Pro Ser Ser His Ala Asn Val Arg His Lys 

85 90 
Trp His Pro Glu Val Cys His His Cys Pro Gly Val Pro Val Leu Leu 

100 105 

Val Gly Thr Lys Arg Asp Leu Arg Gly Asp Lys Glu Thr Leu Glu Lys 

Tic 120 XZa 

Leu Lys Glu Gin Gly Met Ser Pro Thr Thr Pro Gin Gin Gly Ser Ala 
Leu A^a Arg Ser He Gly Ma Val Arg Tyr Leu Glu Cys Ser Ala Leu 
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145 150 155 160 

Leu Gin Glu Gly Val Arg Glu Val Phe Asn Glu Ala Val Arg Ala Val 

165 170 175 

Leu Tyr Pro Asn Ala Lys Lys His Thr Lys Lys Cys Val Leu Leu 
180 185 190 

<210> 215 
<211> 1124 
<212> DNA 

<213> Rattus norvegicus 

<220> 

<221> CDS 

<222> (100) . . (684) 

<400> 215 

aggtggtggc tcaggatagg tgtcctctgc actaataggc ctacctgggt gtagtctgac 60 

agcctgtggt ctgagaagtc ccaggtgctt tgtacagca atg get gec ate egg 114 

Met Ala Ala lie Arg 
1 5 

aag aaa ctg gtg att gtg gga gat gga get tgt gga aaa aca tgc ttg 162 
Lys Lys Leu Val lie Val Gly Asp Gly Ala Cys Gly Lys Thr Cys Leu 

10 15 20 

etc ate gtc ttc age aag gac cag ttt cct gat gtt tat gtg ccc aca 210 
Leu He Val Phe Ser Lys Asp Gin Phe Pro Asp Val Tyr Val Pro Thr 

25 30 35 

gtg ttt gag aac tat gtg get gat ate gaa gtg gat gga aaa cag gtg 258 
Val Phe Glu Asn Tyr Val Ala Asp He Glu Val Asp Gly Lys Gin Val 

40 45 50 

gag ttg gee etc tgg gac aca get ggt caa gaa gat tat gat cgc ttg 306 
Glu Leu Ala Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr Asp Arg Leu 

55 60 65 

agg cca etc tec tat ccg gac act gat gtc etc ttg ata tgt ttc tec 354 
Arg Pro Leu Ser Tyr Pro Asp Thr Asp Val Leu Leu He Cys Phe Ser 
70 75 80 85 

att ggc aac cct gat age ttt ggg aac ate cca cac aaa tgg ata cca 402 
lie Gly Asn Pro Asp Ser Phe Gly Asn He Pro His Lys Trp He Pro 

.90 95 100 

gaa gta aag cat ttc tgt ccc aac gtg ccc ate ate ctg gtc ggg age 450 
Glu Val Lys His Phe Cys Pro Asn Val Pro He He Leu Val Gly Ser 

105 110 115 

aag aag gat ctt egg aat gac ttg aac aca ata caa gag tta gee aag 498 
Lys Lys Asp Leu Arg Asn Asp Leu Ash Thr He Gin Glu Leu Ala Lys 

120 125 130 

agg aaa caa gag ccg gtg aaa cct gaa caa ggc cga gat ttg gcg aac 546 
Arg Lys Gin Glu Pro Val Lys Pro Glu Gin Gly Arg Asp Leu Ala Asn 

135 140 145 

age att ggc get ttc gag tat gtg gag tgt tct gca aag ace aaa gat 594 
Ser He Gly Ala Phe Glu Tyr Val Glu Cys Ser Ala Lys Thr Lys Asp 
150 155 160 165 

gga gtg aga aag gtc ttt gaa aaa age cac caa ggc ttg etc ttg caa 642 
Gly Val Arg Lys Val Phe Glu Lys Ser His Gin Gly Leu Leu. Leu Gin 

170 175 180 

acg aac cgt gtg aag aaa aag act ggc tgc ttt gtc ttt tgaagtcttt 691 
Thr Asn Arg Val Lys Lys Lys Thr Gly Cys Phe Val Phe 

185 190 
gtgcaaacac agccctagta cagttaattt tcaagtgcta tttacgaatg ttattgtgta 751 

attgetagee tttttgttta. tttatatatc caagattaca- ggtcagaagt catcttgeta 811 

ctaatgtttg gaaatcaact gtgattatta atgacattca ttctgtctga ccttccaggg 871 



tcctgactct tctgtaacag accattctgc atttcacctg aaacccaagg cactaattga 



931 
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1124 



agaacttatt atctgtagaa agagagatgg taacttcatc atttagactg taactacttt 991 

ctgaccaccc taccatccat atatgagctg agaagaaagc tgttgagtca ccacttgagt 1051 

gtttgctccc taacaacttt aatcaccatc cttattctct tcgaaatgtc tctccaggca 1111 
gctgtggtag agt 



<210> 216 ... 

" <2Tl> 194 
<212> PRT 

<213> Rattus norvegicus 

<400> 216 M „ ^ 

Met Ala Ala lie Arg Lys Lys Leu Val lie Val Gly Asp Gly Ala Cys 
15 io 15 

Gly Lys Thr Cys Leu Leu lie Val Phe Ser Lys Asp Gin Phe Pro Asp 

20 25 30 

Val Tyr Val Pro Thr Val Phe Glu Asn Tyr Val Ala Asp He Glu Val 

35 40 45 

Asp Gly Lys Gin Val Glu Leu Ala Leu Trp Asp Thr Ala Gly Gin Glu 

50 55 60 

Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp Thr Asp Val Leu 
65 70 ■. 75 80 

Leu He Cys Phe Ser He Gly Asn Pro Asp Ser Phe Gly Asn He Pro 

85 90 95 

His Lys Trp He Pro Glu Val Lys His Phe Cys Pro Asn Val Pro He 

100 105 HO 

He Leu Val Gly Ser Lys Lys Asp Leu Arg Asn Asp Leu Asn Thr He 

115 120 125 

Gin Glu Leu Ala Lys Arg Lys Gin Glu Pro Val Lys Pro Glu Gin Gly 

130 135 140 

Arg Asp Leu Ala Asn Ser He Gly Ala Phe Glu Tyr Val Glu Cys Ser 
145 150 155 160 

Ala Lys Thr Lys Asp Gly Val Arg Lys Val Phe Glu Lys Ser Hxs Gin 

165 170 175 

Gly Leu Leu Leu Gin Thr Asn Arg Val Lys Lys Lys Thr Gly Cys Phe 
180 185 190 

Val Phe • 



<210> 217 
<211> 576 
<212> DNA 

<213> Aplysia califomica 

<220> 
<221> CDS 
<222> (1)..(576) 

<400> 217 AQ 
atg caa acc ata aag tgt gtg gtt gtg ggg gat ggg gca gtc gga aaa 48 
Met Gin Thr He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 
1 5 10 15 

acc tgc ctt etc att tea tac acc aca aat aaa ttt cct tea gaa tat 
Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Lys Phe Pro Ser Glu Tyr 

20 25 30 

gtg cca aca gtg ttt gac aac tat gcg gtg acc gtg atg ate ggg gga 144 
Val Pro Thr Val Phe Asp Asn Tyr Ala Val Thr Val Met He Gly Gly 
35 40 45 



96 
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gag 


ccc 


tac 


acc 


ctg 


ggc 


Glu 


Pro 


Tyr 


Thr 


Leu 


Gly 




50 










gac 


agg 


eta 


cgt 


cca 


etc 


Asp 


Arg 


Leu 


Arg 


Pro 


Leu 


65 










70 


tgc 


ttc 


tec 


gtg 


gtc 


acc 


Cys 


Phe 


Ser 


Val 


Val 


Thr 










85 




tgg 


gtt 


ccg 


gag 


ate 


aca 


Trp 


Val 


Pro 


Glu 


lie 


Thr 








100 






gtg 


ggg 


aca 


cag 


att 


gat 


val 


Gly 


Thr 


Gin 


He 


Asp 






115 








ctg 


gec 


aag 


aac 


aaa 


cag 


Leu 


Ala 


Lys 


Asn 


Lys 


Gin 


■ •■ - 


130 










ctg 


gec 


cgc 


gag 


etc 


agg 


Leu 


Ala 


Arg 


Glu 


Leu 


Arg 


145 










150 


aac 


cag 


aga 


ggt 


ctg 


aag 


Asn 


Gin 


Arg 


Gly 


Leu 


Lys 










165 




eta 


gaa 


ccc 


cca 


gag 


cca 


Leu 


Glu 


Pro 


Pro 


Glu 


Pro 



180 



taa 
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ctg 


ttt 


gat 


acc 


gcg 


ggt 


Leu 


Phe 


Asp 


Thr 


Ala Gly 


55 










60 


age 


tac 


cct 


cag 


acg 


gac 


Ser 


Tyr 


Pro 


Gin 


Thr Asp 










75 




ccg 


tec 


tec 


ttc 


gag 


.aat 


Pro 


Ser 


Ser 


Phe 


Glu 


Asn 








90 






cat 


cac 


tgt 


cag' 


cgc 


acg 


His 


His 


Cys 


Gin 


Arg 


Thr 






105 








ctg 


cgc 


gac 


gac 


acg 


ccc 


Leu 


Arg Asp 


Asp 


Thr 


Pro 




120 










aag 


ccc 


ate 


acc 


gee 


gac 


Lys 


Pro 


lie 


Thr 


Ala Asp 


135 










140 


gcg 


gtc 


aag 


tac 


gtc 


gag 


Ala 


Val 


Lys 


Tyr 


Val 


Glu 










155 




aac 


gtg 


ttt 


gat 


gag 


get 


Asn 


val 


Phe 


Asp 


Glu 


Ala 








170 






ccc 


aag 


aaa 


aag 


aaa 


tgt 


Pro 


Lys 


Lys 


Lys 


Lys 


Cys 






185 









PCI7EP2004/008136 



cag gag gac tac 192 
Gin Glu Asp Tyr 

gtc ttc etc gtc 240 
Val Phe Leu Val 
80 

gtc agg gaa aag 288 
Val Arg Glu Lys 
95 

ccg ttc ctg ctg 336 
Pro Phe Leu Leu 
110 

acg gtg gag aag 384 

Thr Val Glu Lys 

125 

cac ggg gag cgt 432 
His Gly Glu Arg 

tgc tec gee etc 480 
Cys Ser Ala Leu 
160 

ate ctg gca gcg 528 
lie Leu Ala Ala 
175 

gtg etc ttg 573 
Val Leu Leu 
190 

576 



<210> 218 
<211> 191 
<212> PRT 

<213> Aplysia californica 
<400> 218 



Met 


Gin 


Thr 


He 


Lys 


Cys 


Val 


Val 


Val 


Gly 


Asp Gly Ala Val 


Gly 


Lys 


1 








5 










10 




15 




Thr 


Cys 


Leu 


Leu 


He 


Ser 


Tyr 


Thr 


Thr 


Asn 


Lys Phe Pro Ser 


Glu 


Tyr 






20 










25 




30 






Val 


Pro 


Thr 
35 


Val 


Phe 


Asp 


Asn 


Tyr 
40 


Ala 


Val 


Thr Val Met He 
45 


Gly 


Gly 


Glu 


Pro 


Tyr 


Thr 


Leu 


Gly 


Leu 


Phe Asp 


Thr 


Ala Gly Gin Glu 


Asp 


Tyr 




50 










55 








60 






Asp 


Arg 


Leu 


Arg 


Pro 


Leu 


Ser 


Tyr 


Pro 


Gin 


Thr Asp Val Phe 


Leu 


Val 


65 










70 














80 


Cys 


Phe 


Ser 


Val 


Val 


Thr 


Pro 


Ser 


Ser 


Phe 


Glu Asn Val Arg 


Glu 


Lys 








85 










90 




95 




Trp 


Val 


Pro 


Glu 


He 


Thr 


His 


His 


Cys 


Gin 


Arg Thr Pro Phe 


Leu 


Leu 






100 










105 




110 






Val 


Gly 


Thr 


Gin 


He 


Asp 


Leu 


Arg Asp 


Asp 


Thr Pro Thr Val 


Glu 


Lys 






115 










120 






125 






Leu 


Ala 
130 


Lys 


Asn 


Lys 


Gin 


Lys 
135 


Pro 


He 


Thr 


Ala Asp His Gly 
140 


Glu 


Arg 


Leu 


Ala 


Arg 


Glu 


Leu 


Arg 


Ala 


Val 


Lys 


Tyr 


Val Glu Cys Ser 


Ala 


Leu 


145 










150 










155 




160 


Asn 


Gin 


Arg 


Gly 


Leu 
165 


Lys 


Asn 


Val 


Phe 


Asp 
170 


Glu Ala He Leu 


Ala 
175 


Ala 


Leu 


Glu 


Pro 


Pro 
180 


Glu 


Pro 


Pro 


Lys 


Lys 
185 


Lys 


Lys Cys Val Leu 
190 


Leu 





<210> 219 
<211> 579 
<212> DNA 

<213> Aplysia californica 
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<220> 

<221> CDS 

<222> (1) . . (579) 

<400> 219 

atg cag get ate aag tgt gtg gta gtc ggg gac ggt get gtg ggt aag 48 

Met Gin Ala He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 

x 5 10 15 

aca tgt ctt ctt ate age tac act acc aat gee ttc ccc gga gaa tac 96 

Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Ala Phe Pro Gly Glu Tyr 

20 25 30 

att cca act gtc ttt gac aac tac tct gca aat gtt atg gta gat ggc 144 
He Pro Thr Val Phe Asp Asn Tyr Ser Ala Asn Val Met Val Asp Gly 

35 AO 45 

aag cca ate aat ctt gga eta tgg gat act get gga caa gag gac tat 192 
Lys Pro He Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr 
50 55 60 

• gat* cgt etc aga ccc ttg tea tat ccg caa act gat gtc ttt ctt ata 240 
Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu He 
65 ~ " 70 75 80 

tgt ttt tec ttg ate agt cct aca agt ttc gag aat gtt cga gca aag 288 
Cys Phe Ser Leu He Ser Pro Thr Ser Phe Glu Asn Val Arg Ala Lys 

85 90 95 

tgg ttt cct gaa gtg age cat cat tgc cct cat acc cct ate ate tta 336 
Trp Phe Pro Glu Val Ser His His Cys Pro His Thr Pro He He Leu 

100 105 HO 

gtg ggt acc aag ctt gat ctg cgt gaa gac aag gag aca att gag aaa 384 
Val Gly Thr Lys Leu Asp Leu Arg Glu Asp Lys Glu Thr He Glu Lys 

115 120 125 

ctg aga gac aag aag ctg tec cct ate acc tac cct caa ggg ctg gec 432 
Leu Arg Asp Lys Lys Leu Ser Pro He Thr Tyr Pro Gin Gly Leu Ala 

• 130 135 140 

atg gca agg gag ata agt get gtg aag tat ctt gag tgc tea gee tta 480 
Met Ala Arg Glu He Ser Ala Val Lys Tyr Leu Glu Cys Ser Ala Leu 
145 150 155 160 

acc caa aaa ggg ctg aag aat gtc ttt gat gag get att egg get gtt 528 
Thr Gin Lys Gly Leu Lys Asn Val Phe Asp Glu Ala He Arg Ala Val 

165 170 175 

ttg tgc cct aag cct aag cca aag aag aag aaa gga tgc gaa att ttg 576 
Leu Cys Pro Lys Pro Lys Pro Lys Lys Lys Lys Gly Cys Glu He Leu 
180 185 190 

tga 



579 



<210> 220 
<211> 192 
<212> PRT 

<213> Aplysia calif ornica 



<400> 220 



Met 


Gin 


Ala 


He 


Lys 


Cys 


Val Val 


1 








5 






Thr 


Cys 


Leu 


Leu 


He 


Ser 


Tyr Thr 






20 








He 


Pro 


Thr 


Val 


Phe 


Asp 


Asn Tyr 






35 








40 


Lys 


Pro 


He 


Asn 


Leu 


Gly 


Leu Trp 


50 










55 


Asp 


Arg 


Leu Arg 


Pro 


Leu 


Ser Tyr 


65 










70 




Cys 


Phe 


Ser 


Leu 


He 


Ser 


Pro Thr 








85 






Trp 


Phe 


Pro 


Glu 


Val 


Ser 


His His 






100 








Val 


Gly 


Thr 


Lys 


Leu 


Asp 


Leu Arg 




115 








120 



Val Gly Asp Gly Ala Val Gly Lys 

10 15 
Thr Asn Ala Phe Pro Gly Glu Tyr 
25 30 
Ser Ala Asn Val Met Val Asp Gly 
45 

Asp Thr Ala Gly Gin Glu Asp Tyr 
60 

Pro Gin Thr Asp Val Phe Leu He 

75 80 
Ser Phe Glu Asn Val Arg Ala Lys 

90 95 
Cys Pro His Thr Pro He He Leu 
105 HO 
Glu Asp Lys Glu Thr He Glu Lys 
125 



u l-cl dua y ya. <- v-^w »^ 3 v* — j j j - —a- , — 

Leu Thr Gly Cys Phe Arg Cys Ser Tyr Pro Gly Ser Thr His Ala Ser 

165 ~ 170 175 

age teg gaa aaa aaa aca tac ctt atg tga 

Ser Ser Glu Lys Lys Thr Tyr Leu Met 
180 185 



<210> 222 
<211> 185 
<212> PRT 

<213> Schistosoma japonicum 

<400> 222 ' ' ^ 

Met Gin Ala He Lys Cys Val Val He Gly Asp Gly Ala Val Gly Lys 
! 5 10 15 

Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Ala Phe Pro Gly Glu Tyr 

20 25 30 

Val Pro Thr val Phe Asp Asn Tyr Ser Ala Asn Val Met Val Gly Glu 
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Leu Arg Asp Lys Lys Leu Ser Pro He Thr Tyr Pro Gin Gly Leu Ala 

130 135 140 

Met Ala Arg Glu He Ser Ala Val Lys Tyr Leu Glu Cys Ser Ala Leu 
145 150 155 160 

Thr Gin Lys Gly Leu Lys Asn Val Phe Asp Glu Ala lie Arg Ala Val 

165 170 175 

Leu Cys Pro Lys Pro Lys Pro Lys Lys Lys Lys Gly Cys Glu He Leu 
180 185 190 

<210> 221 
<211> 558 . 
<212> DNA 

<213> Schistosoma japonicum 

<220> 
<221> CDS 

<222> (1) . . (558) . 
<400> 221 

atg caa gec ate aaa tgt gta gtt ate ggt gat ggt get gta gga aaa 48 

Met Gin Ala He Lys Cys Val Val He Gly Asp Gly Ala Val Gly Lys 

! 5 " 10 15 

aca tgt ttg etc att age tac act aca aat get ttt cct ggt gag tat 96 

Thr Cys Leu Leu lie Ser Tyr Thr Thr Asn Ala Phe Pro Gly Glu Tyr 

20 25 30 

gtc cca act gtc ttc gat aac tac tct gee aat gta atg gtt ggt gaa 144 
Val Pro Thr Val Phe Asp Asn Tyr Ser Ala Asn Val Met Val Gly Glu 

35 40 45 

aaa cgt gtg aac ctt ggt ttg tgg gat aca get ggt caa gaa gat tat 192 
Lys Arg Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr 

50 55 60 

gat aga tta egg cca eta teg tat cca cag act gac gtt ttc ttg gtt 
Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu Val 
65 70 75 80 

tgc ttt tct ttg ate agt cct tea tea ttt gat aac gtt cgt get aaa 
Cys Phe Ser Leu He Ser Pro Ser Ser Phe Asp Asn Val Arg Ala Lys 

85 90 95 

tgg tat cca gaa ata cgc cac ttt tct cca aac aca ccc att att ctt 336 
Trp Tyr Pro Glu He Arg His Phe Ser Pro Asn Thr Pro He He Leu 

100 ~ 105 HO 

gtt gga aca aaa ctg gat etc cgc aac agt tct acc agt cct aaa aac 384 
Val Gly Thr Lys Leu Asp Leu Arg Asn Ser Ser Thr Ser Pro Lys Asn 

115 120 125 

aac caa cca tea ata tct tat gaa caa ggt tta att atg gca aga gag 432 
Asn Gin Pro Ser He Ser Tyr Glu Gin Gly Leu He Met Ala Arg Glu 

130 135 140 

att gga get cat aag tat eta gaa tgt tea gcg tta act cag gat gga 48 0 

He Gly Ala His Lys Tyr Leu Glu Cys Ser Ala Leu Thr Gin Asp Gly 
145 150 155 160 

tta aca gga tgt ttt cga tgc age tat ccg ggc agt act cat gec tec 528 



240 



288 



558 
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35 40 45 

Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr 

55 " 60 
Leu Ser Tyr Pro Gin Thr Asp Val phe Leu Val 
70 75 80 

Ser Pro Ser Ser Phe Asp Asn Val Arg Ala Lys 

90 95 
Arg His Phe Ser Pro Asn Thr Pro lie lie Leu 

105 HO 
Asp Leu Arg Asn Ser Ser Thr Ser Pro Lys Asn 

120 125 
Ser Tyr Glu Gin Gly Leu He Met Ala Arg Glu 

135 * 140 

Tyr Leu Glu Cys Ser Ala Leu Thr Gin Asp Gly 
150 155 160 . 

Arg Cys Ser Tyr Pro Gly Ser Thr His Ala Ser 
170 175 

Thr - Tyr -Leu Met 

180 ~ - 185 

<210> 223 
<211> 576 
<212> DNA 

<213> Blumeria graminis 

<220> 
<221> CDS 
<222> (1) . . (576) 



Lys 


Arg 




Leu 




Dv 






Asp 


Arg 


Lieu a-l y 


Pro 










Cys 


pne 


OCX LIC LL 


lie 






85 


Trp 


Tyr 


Dm fill ii 


lie 








Val 


Gly 


inr L»ys 








115 




Asn 


Gin 


Pro Ser 


He 




130 






He 


Gly 


Ala His 


Lys 


145 








Leu 


Thr 


Gly Cys 


Phe 








165 


Ser 


Ser 


Glu Lys 


-Lys 



<400> 223 



48 



atg tct ctt aga tgc gtt gtt gtt ggc gat ggt gec gtt gga aaa act 
Met Ser Leu Arg Cys Val Val Val Gly Asp Gly Ala Val Gly Lys Thr 
15 10 15 

tgt ctt etc ate age tat aca aca aat aaa ttc ccc tec gaa tat gtc 96 
Cys Leu Leu He Ser Tyr Thr Thr Asn Lys Phe Pro Ser Glu Tyr Val 
20 25 3 0 



144 



192 



240 



cct act gtc ttc gat aat tat get gtt act gtt atg att ggc gat gag 
Pro Thr Val Phe Asp Asn Tyr Ala Val Thr Val Met He Gly Asp Glu 

35 40 45 

cca tat ace ctt ggc eta ttc gac act get ggt caa gaa gat tac gat 
Pro Tyr Thr Leu Gly Leu Phe Asp Thr Ala Gly Gin Glu Asp Tyr Asp 

50 55 60 

cgc tta egg ccc ctt tea tac cct caa act gat gta ttt etc gtc tgt 
Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu Val Cys 
65 70 75 80 

ttc tct gtt act tea ccc gca tea ttt gaa aat gtt cgc gag aaa tgg 288 
Phe Ser Val Thr Ser Pro Ala Ser Phe Glu Asn Val Arg Glu Lys Trp 

85 90 95 

ttt cct gaa gtc cat cat cac tgc cct gga gtc cct tgc tta att gtt 33 6 

Phe Pro Glu Val His His His Cys Pro Gly Val Pro Cys Leu He Val 
100 105 1 HO 



384 



432 



ggc ace caa aca gat ctg aga gat gat ctt age gtt cga gaa aag tta 
Gly Thr Gin Thr Asp Leu Arg Asp Asp Leu Ser Val Arg Glu Lys Leu 

115 ~ 120 125 

aac aaa cag aaa atg caa cca gtt aag aga gaa gat gga gag aga atg 
Asn Lys Gin Lys Met Gin Pro Val Lys Arg Glu Asp Gly Glu Arg Met 

130 ** 135 140 

get aag gat ttg ggt gca gtt aga tat gtg gaa tgc age gcg ttg aca 480 
Ala Lys Asp Leu Gly Ala Val Arg Tyr Val Glu Cys Ser Ala Leu Thr 
145 150 155 160 

cag tat aag ttg aag gat gta ttc gat gag get ata gtg get gca eta 528 
Gin Tyr Lys Leu Lys Asp Val Phe Asp Glu Ala He Val Ala Ala Leu 

165 170 175 

gaa cct ccc get cca aaa aag aag cat cga aac tgt eta ate eta 573 
Glu Pro Pro Ala Pro Lys Lys Lys His Arg Asn Cys Leu He Leu 
180 185 190 



tga 



576 



WO 2005/014828 
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<210> 224 
<211> 191 
<212> PRT 

<213> Blumeria graminis 
<400> 224 

Met Ser Leu Arg Cys Val Val Val Gly Asp' Gly Ala Val Gly Lys Thr 

1 5 10 15 

Cys Leu Leu He Ser Tyr Thr Thr Asn Lys Phe Pro Ser Glu Tyr Val 

20 25 30 

Pro Thr Val Phe Asp Asn Tyr Ala Val Thr Val Met He Gly Asp Glu 

35 40 45 

Pro Tyr Thr Leu Gly Leu Phe Asp Thr Ala Gly Gin Glu Asp Tyr Asp 

50 55 60 

Arg Leu Arg Pro Leu Ser -Tyr- Pro Gin Thr Asp Val Phe- Leu Val Cys 
65 70 75 80 

Phe Ser Val Thr Ser Pro Ala Ser Phe Glu Asn Val Arg Glu Lys Trp 

85 90 95 

Phe Pro Glu Val His His His Cys Pro Gly Val Pro Cys Leu He Val 

100 105 110 

Gly Thr Gin Thr Asp Leu Arg Asp Asp Leu Ser Val Arg Glu Lys Leu 

115 120 125 

Asn Lys Gin Lys Met Gin Pro Val Lys Arg Glu Asp Gly Glu Arg Met 

130 135 140 

Ala Lys Asp Leu Gly Ala Val Arg Tyr Val Glu Cys Ser Ala Leu Thr 
145 ~ " 150 ™ 155 160 

Gin Tyr Lys Leu Lys Asp Val Phe Asp Glu Ala He Val Ala Ala Leu 

165 170 175 

Glu Pro Pro Ala Pro Lys Lys Lys His Arg Asn Cys Leu He Leu 
180 * 185 190 

<210> 225 
<211> 757 
<212> DNA 

<213> Brassica napus 

<220> 

<221> CDS 

<222> (164).. (757) 

<400> 225 

cgatacaccg attcatttca ttcttcattg atcgcttcct ctcacatacc cgatttcgtt 60 
tagggttaac attttctagg gttttagaga taggcgaaac tgaaattgag aagaagacgg 120 

175 
223 
271 
319 
367 
415 



gtgttgtttc ctcatagatc tgaggggttg aatttttcga 


ttt 


atg 


age 


gca 


tct 


























Met 
1 


Ser 


Ala 


Ser 


egg 


ttc 


ata 


aag 


tgc 


gtg 


acg 


gtt 


ggg 


gac 


gga 


gca 


gtg 


ggc 


aaa 


aca 


Arg 


Phe 


He 


Lys 


Cys 


Val 


Thr 


Val 


Gly 


Asp 


Gly 


Ala 


Val 


Gly 


Lys 


Thr 


5 










10 










15 










20 


tgt 


etc 


etc 


ate 


tct 


tac 


acc 


age 


aac 


act 


ttc 


cct 


acg 


gat 


tat 


gtt 


Cys 


Leu 


Leu 


He 


Ser 


Tyr 


Thr 


Ser 


Asn 


Thr 


Phe 


Pro 


Thr 


Asp 


Tyr 


Val 










25 










30 










35 




cca 


act 


gtt 


ttc 


gat 


aac 


ttc 


age 


get 


aat 


gtt 


gtt 


gtt 


aac 


gga 


gee 


Pro 


Thr 


Val 


Phe 


Asp 


Asn 


Phe 


Ser 


Ala 


Asn 


Val 


Val 


Val 


Asn 


Gly 


Ala 








40 








45 










50 






act 


gtc 


aac 


tta 


gga 


etc 


tgg 


gat 


acc 


gca 


ggg 


cag 


gag 


gat 


tat 


aac 


Thr 


Val 


Asn 


Leu 


Gly 


Leu 


Trp 


Asp 


Thr 


Ala 


Gly 


Gin 


Glu 


Asp 


Tyr 


Asn 






55 










60 










65 








aga 


ttg 


aga 


ccc 


ttg 


agt 


tac 


cgc 


ggt 


get 


gac 


gtt 


ttc 


ate 


ttg 


gee 


Arg 


Leu 


Arg 


Pro 


Leu 


Ser 


Tyr 


Arg 


Gly 


Ala 


Asp 


Val 


Phe 


He 


Leu 


Ala 




70 










75 










80 
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ttc 


tct 


etc 


ate 


agt 


aag 


Phe 


Ser 


Leu 


He 


Ser 


Lys 


85 










90 


ate 


cct 


gag 


ctg 


acc 


cac 


lie 


Pro 


Glu 


Leu 


Thr 


His 










105 




agt 


acc 


aaa 


eta 


gat 


ctt 


GlV 


Thr 


Lys 


Leu Asp 


Leu 








120 






cct 


ggt 


get 


gta 


cct 


att 


Pro 


Gly 


Ala 


Val 


Pro 


He 






135 








eta 


att 


gga 


get 


cct 


teg 


Leu 


He 


Gly 


Ala 


Pro 


Ser 




150 










aac 


gtg 


aaa 


ggg 


gtg 


ttt 


Asn 


Val 


Lv& 


Gly Val 


Phe 


165 










170 


cca 


aag 


cag 


aag 


aaa 


aag 


Pro 


Lys 


Gin 


Lys 


Lys 


Lys 










185 




ttg 


taa 










Leu 













172/291 

get agt tat gag aat gtc 
Ala Ser Tyr Glu Asn Val 
95 

tat gec cct ggt gtc cct 
Tyr Ala Pro Gly Val Pro 
110 

agg gat gac aaa cag ttc 
Arg Asp Asp Lys Gin Phe 
125 

acc act tct cag" gga gag 
Thr Thr Ser Gin Gly Glu 
140 

tac ate gag tgc agt tea 
Tyr He Glu Cys Ser Ser 
155 160 
gat gca gcg ate aga gtg 
Asp Ala Ala He Arg Val 

- - ■ 175 

aag age aag gca caa aag 
Lys Ser Lys Ala Gin Lys 
190 
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tec aag aag tgg 463 
Ser Lys Lys Trp 
100 

ate gtt ctt gtt 511 
He Val Leu Val 
115 

ttc gtt gac cac 559 
Phe Val Asp His 
130 

gaa eta atg aag 607 

Glu Leu Met Lys 

145 

aaa tct caa gag 655 
Lys Ser Gin Glu 

gta ctt caa cct 703 
Val Leu Gin Pro 

- i£0 -- - 
gee tgc tec att 751 
Ala Cys Ser He 
195 

757 



<210> 226 
<211> 197 
<212> PRT 

<213> Brassica napus 
<400> 226 

Met Ser Ala Ser Arg Phe He Lys Cys Val Thr Val Gly Asp Gly Ala 

15 10 15 

Val Gly Lys Thr Cys Leu Leu He Ser Tyr Thr Ser Asn Thr Phe Pro 

20 25 30 

Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val 

35 40 45 

Val Asn Gly Ala Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 

50 55 60 

Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 
65 70 75 80 

Phe He Leu Ala Phe Ser Leu He Ser Lys Ala Ser Tyr Glu Asn Val 

85 90 95 

Ser Lys Lys Trp He Pro Glu Leu Thr His Tyr Ala Pro Gly Val Pro 

100 105 HO 

He Val Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe 

115 120 125 

Phe Val Asp His Pro Gly Ala Val Pro He Thr Thr Ser Gin Gly Glu 

130 135 140 

Glu Leu Met Lys Leu He Gly Ala Pro Ser Tyr He Glu Cys Ser Ser 
145 150 155 160 

Lys Ser Gin Glu Asn Val Lys Gly Val Phe Asp Ala Ala He Arg Val 

165 170 175 

Val Leu Gin Pro Pro Lys Gin Lys Lys Lys Lys Ser Lys Ala Gin Lys 

180 185 190 

Ala Cys Ser He Leu 
195 

<210> 227 
<211> 1015 
<212> DNA 

<213> Zinnia elegans 



<220> 

<221> CDS 

<222> (170).. (763) 
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<400> 227 

ggtttgttct tggggtttgg ctggaaaaag attcaatttt tcggtgtgtt tttgggaaaa 60 

agatttggtg aaatttgtgg gtgttgaaga aagattgttg gggattattg ggtgtgagtg 120 

atttggtggt gatcaggtct tctgggtgtg gaagaagatc tgaagaaat atg agt gca 178 

Met Ser Ala 



tea 


aga 


ttt 


ata 


aag 


tgt 


gtt 


act 


gtt 


ggt 


gat 


ggg 


get 


1 

gtt 


gga 


aag 


226 


Ser 


Arg 


Phe 


He 


Lys 


Cys 


val 


Thr 


Val 


Gly 


Asp 


Gly 


Ala 


val 


Gly 


Lys 






5 










10 










15 












act 


tgt 


ttg 


ttg 


att 


tct 


tac 


ace 


agt 


aat 


acc 


ttc 


cct 


acg 


gat 


tac 


274 


Thr 


Cys 


Leu 


Leu 


He 


Ser 


Tyr 


Thr 


Ser 


Asn 


Thr 


Phe 


Pro 


Thr 


Asp 


Tyr 




20 










25 










30 










35 




gtc 


cca 


acg 


gta 


ttt 


gat 


aat 


ttc 


agt 


get 


aat 


gta 


gtt 


gtc 


aat 


gga 


322 


Val 


Pro 


Thr- Val 


Phe 


Asp 


Asn 


Phe 


Ser 


Ala 


Asn 


Val 


Val 


Val 


Asn 


Gly 


- 










40 










45 










50 






age 


aca 


gtt 


aat 


eta 


ggc 


ttg 


tgg 


gac 


act 


get 


gga 


cag 


gag 


gac 


tat 


370 


Ser 


Thr 


Val 


Asn 


Leu 


Gly 


Leu 


Trp 


Asp 


Thr 


Ala 


Gly 


Gin 


Glu 


Asp 


Tyr 










55 










60 










65 








aac 


aga 


tta 


aga 


ccg 


ttg 


agt 


tat 


cgt 


gga 


get 


gat 


gtt 


ttc 


att 


ttg 


418 


Asn 


Arg 


Leu 


Arg 


Pro 


Leu 


Ser 


Tyr 


Arg 


Gly 


Ala 


Asp 


Val 


Phe 


He 


Leu 








70 










75 










80 










gcg 


ttc 


tea 


etc 


att 


age 


aag 


get 


age 


tat 


gag 


aac 


gtt 


tec 


aag 


aag 


466 


Ala 


Plie 


Ser 


Leu 


He 


Ser 


Lys 


Ala 


Ser 


Tyr 


Glu 


Asn 


Val 


Ser 


Lys 


Lys 






85 










90 










95 












tgg 


att 


cca 


gaa 


ttg 


aag 


cat 


tat 


get 


cct 


ggt 


gtt 


cca 


ate 


gtt 


ctt 


514 


Trp 


lie 


Pro 


Glu 


Leu 


Lyj 


His 


Tyr 


Ala 


Pro 


Gly 


Val 


Pro 


He 


Val 


Leu 




100 










105 










110 










115 




gtt 


ggg 


acg 


aaa 


eta 


gat 


etc 


cga 


gat 


gat 


aag 


cag 


ttt 


ttt 


gta 


gat 


562 


Val 


Gly 


Thr 


Lys 


Leu 


Asp 


Leu 


Arg 


Asp 


Asp 


Lys 


Gin 


Phe 


Phe 


Val 


Asp 












120 










125 










130 






cac 


cct 


ggt 


gca 


act 


cct 


att 


ace 


act 


gca 


cag 


gga 


gag 


gaa 


tta 


aag 


610 


His 


Pro 


Gly 


Ala 


Thr 


Pro 


lie 


Thr 


Thr 


Ala 


Gin 


Gly 


Glu 


Glu 


Leu 


Lys 








13 5 










140 










145 








aag 


aca 


ate 


ggt 


get 


cct 


gag 


tac 


att 


gaa 


tgt 


agt 


tea 


aaa 


aca 


caa 


658 


Lys 


Thr 


lie 


Gly 


Ala 


Pro 


Glu 


Tyr 


He 


Glu 


Cys 


Ser 


Ser 


Lys 


Thr 


Gin 






150 










155 










160 










ttg 


aat 


gtg 


aag 


caa 


gtt 


ttc 


gat 


get 


gee 


att 


aaa 


gtc 


gtt 


ctg 


gcg 


706 


Leu 


Asn 


Val 


Lys 


Gin 


Val 


Phe 


Asp 


Ala 


Ala 


He 


Lys 


val 


Val 


Leu 


Ala 






165 










170 










175 












cct 


cca 


aag 


gcg 


aag 


aaa 


aag 


aag 


gga 


aaa 


get 


caa 


aag 


gca 


tgc 


tea 


754 


Pro 


Pro 


Lys 


Ala 


Lys 


Lys 


Lys 


Lys 


Gly 


Lys 


Ala 


Gin 


Lys 


Ala 


Cys 


Ser 





180 185 190 195 

ata ttg tgatgaagaa gatgaaaacc agttgttcaa cgcttatctc gtcttgaatt 810 
He Leu 

caattccata tgttggttga tttgtatttt gatctttgag tactcgaaaa tcttggtgta 870 



ctaattctgt atcttggtca aaaatttgta tttccatget cgttttctgt ttgcgactgt 93 0 



tttactactt tgttcttcta agttgaaaaa teggtatcta tegttgaate aatgaattcc 990 



attctaatga aaaaaaaaaa aaaaa 1015 



<210> 228 
<211> 197 
<212> PRT 

<213> Zinnia elegans 
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<400> 228 

Met Ser Ala Ser Arg Phe He Lys Cys Val Thr Val Gly Asp Gly Ala 

1 5 io 15 

Val Gly Lys Thr Cys Leu Leu He Ser Tyr Thr Ser Asn Thr Phe Pro 

20 25 30 

Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val 

35 40 45 

Val Asn Gly Ser Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 

50 55 60 

Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 
65 70 75 8<T 

Phe He Leu Ala Phe Ser Leu He Ser Lys Ala Ser Tyr Glu Asn Val 

85 90 95 

Ser Lys Lys Trp He Pro Glu Leu Lys His Tyr Ala Pro Gly Val Pro 

100 105 HO 

He Val Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe 

115 120 125 

Phe Val Asp His Pro Gly Ala Thr Pro He Thr Thr Ala Gin Gly Glu ■ 

130 135 140 

Glu Leu Lys Lys Thr He Gly Ala Pro Glu Tyr He Glu Cys Ser Ser 
145 150 155 160 

Lvs Thr Gin Leu Asn Val Lys Gin Val Phe Asp Ala Ala He Lys Val 

165 170 175 

Val Leu Ala Pro Pro Lys Ala Lys Lys Lys Lys Gly Lys Ala Gin Lys 

180 185 190 

Ala Cys Ser He Leu 
195 

<210> 229 

<211> 843 ?\ 

<212> DNA 

<213> Zinnia elegans 

i 

<220> 
<221> CDS 

<222> (114) . . (710) 



<400> 229 

aaacacttgt cccattacta ctacacttat tccttttatc ttcctttccc ttgcttaaca 



60 



116 



164 



212 



acaacaacca aaccacttgt cgatctcttt gtatcatatt ccgatcgacc gaa atg 

Met 
1 

aca qca gca gcc ace aga ttc ate aag tgc gtc acc gtt ggc gac ggt 
Ala Ala Ala Ala Thr Arg Phe He Lys Cys Val Thr Val Gly Asp Gly 

5 10 15 

qcg gtc gga aaa acc tgc atg etc ate tec tac acc age aat act ttc 
Ala Val Gly Lys Thr Cys Met Leu He Ser Tyr Thr Ser Asn Thr Phe 

20 25 - 30 

ccc acc gat tat gtg ccg aca gtg ttt gat aac ttc agt gcg aat gtt 260 
Pro Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val 

35 40 45 

gtg gtc gga gac age act gtc aat ctt ggc ctt tgg gat act gcc ggt 3 08 

Val Val Gly Asp Ser Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly 
50 55 60 65 

cag gag gat tat aac aga ctg agg cca ctg agt tat aga ggt gca gat 356 
Gin Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp 

70 75 80 

gtg ttt tta ttg gca ttt tct etc att age aga ccc agt tat gaa aac 
Val Phe Leu Leu Ala Phe Ser Leu He Ser Arg Pro Ser Tyr Glu Asn 

85 90 95 

ate tec aag aag tgg ate tea gaa eta agg cat tat gcc cca gat gtc 
He Ser Lys Lys Trp He Ser Glu Leu Arg His Tyr Ala Pro Asp Val 

100 ~ 105 HO 

ccc att gtg ctg gtg gga acc aaa tta gat tta cgt gaa gac aaa caa 
Pro He Val Leu Val Gly Thr Lys Leu Asp Leu Arg Glu Asp Lys Gin 
115 120 125 



404 



452 



500 
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175/291 














tat 


ttg 


agt 


gat 


cat 


cca 


aat 


crt*t* aca ace 


ate 


aca 


act 


tct 


cag ggt 


548 


Tyr 


Leu 


Ser* 


Asp 


His 


Pro 


Asn 


Val Thr Ala 


He 


Thr 


Thr 


Ser 






li U 










135 






JL*X\J 








1 AC 




gaa 


gaa 


ctg 


aag 


aaa 


agt 


att 


yy u y yv>a 


crtcr 


tac 


att 


cracr 


ugt age 


c a rr 


/-in 
o-LU 




Leu 


Lys 


Lys 
150 


Ser 


lie 


Glv Ala Ala 

wijr ajlcl x*vj.cl 

• 155 


Val 

V GLJL 




He 


Glu 


uys oci , 

loU 




tec 


aaa 


act 


caa 


cacr 


aat 


crtcr 


aacf orct crfct 


ttt 


gat 


get 




att aga 


644 


Ser 


Lys 


Thr 


Gin 


Gin 


Asn 


Val 


Lys Ala Val 


Phe 


Asp 


Ala 


Ala 


He Arg 








J. O -J 








1 7fl 
x / u 








175 






gtt 


gtt 


tta 


caa 


cca 


cca 


aag 


ctg aag aaa 


aag 


cga 


age 


aag 


caa cga 


692 


Val 


Val 


Leu 
180 


Gin 


Pro 


Pro 


Lys 


Leu Lys Lys 
185 


Lys 


Arg 


Ser 
190 


Lys 


Gin Arg 




eta 


tgt 


gtc 


tat 


eta 


tagatctget tctctgtgca attgcatgta taatgttcta 


747 


Leu 


Cys 
195 


Val 


Tyr 


Leu 





















tgtataatgt ctttctcatt tagaacttta aatgatgtaa tgaaatccaa gtcatcaaat 807 



ggtttccttg taaaaaaaaa aaaaaaaaaa aaaaaa 843 



<210> 230 
<211> 198 
<212> PRT 

<213> Zinnia elegans 
<400> 230 



Met 


Ala Ala -Ala 


Ala 


Thr 


Arg 


Phe He 


Lys 


Cys 


Val 


Thr 


Val 


Gly 


Asp 


1 




5 








10 










15 




•Gly 


Ala Val Gly 


Lys 


Thr 


Cys 


Met Leu 


He 


Ser 


Tyr 


Thr 


Ser 


Asn 


Thr 




20 








25 










30 






Phe 


Pro Thr Asp 


Tyr 


Val 


Pro 


Thr Val 


Phe 


Asp 


Asn 


Phe 


Ser 


Ala 


Asn 




35 








40 








45 








Val 


Val Val Gly 


Asp 


Ser 


Thr 


Val Asn 


Leu 


Gly 


Leu 


Trp 


Asp 


Thr 


Ala 




50 






55 








60 










Gly 


Gin Glu Asp 


Tyr 


Asn 


Arg 


Leu Arg 


Pro 


Leu 


Ser 


Tyr 


Arg 


Gly 


Ala 


65 






70 








75 










80 


Asp 


Val Phe Leu 


Leu 


Ala 


Phe 


Ser Leu 


He 


Ser 


Arg 


Pro 


Ser 


Tyr 


Glu 






85 








90 










95 




Asn 


He Ser Lys 


Lys 


Trp 


He 


Ser Glu 


Leu 


Arg 


His 


Tyr 


Ala 


Pro 


Asp 




100 






105 










no 






Val 


Pro He Val 


Leu 


Val 


Gly 


Thr Lys 


Leu 


Asp 


Leu 


Arg 


Glu 


Asp 


Lys 




115 








120 








125 








Gin 


Tyr Leu Ser 


Asp 


His 


Pro 


Asn Val 


Thr 


Ala 


He 


Thr 


Thr 


Ser 


Gin 




130 






135 








140 










Gly 


Glu Glu Leu 


Lys 


Lys 


Ser 


He Gly 


Ala 


Ala 


Val 


Tyr 


He 


Glu 


Cys 


145 






150 








155 










160 


Ser 


Ser Lys Thr 


Gin 


Gin 


Asn 


Val Lys 


Ala 


Val 


Phe 


Asp 


Ala 


Ala 


He 




165 








170 










175 




Arg 


Val Val Leu 


Gin 


Pro 


Pro 


Lys Leu 


Lys 


Lys 


Lys 


Arg 


Ser 


Lys 


Gin 




180 








185 










190 






Arg 


Leu Cys Val 


Tyr 


Leu 





















195 

<210> 231 
<211> 659 
<212> DNA 

<213> Ciona intestinal is 

<220> 

<221> CDS 

<222> (24) . . (605) 



<400> 231 

ataaaattta ttgaaacacg aag atg net tea att cgt aag aag ctg aaa ata 

Met Xaa Ser He Arg Lys Lys Leu Lys He 



53 
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15 10 
gat ggc gaa ggg get tgt ggt aaa act tgc tta etc ate gng ttt age 101 
Asp Gly Glu Gly Ala Cys Gly Lys Thr Cys Leu Leu lie Xaa Phe Ser 

15 20 25 

aaa gat caa ttc cca gaa gtt tac gtc cca act gtt ttt gaa aac tat 149 
Lys Asp Gin Phe Pro Glu Val Tyr Val Pro Thr Val Phe Glu Asn Tyr 

30 35 . 40 

gtg get gat att gaa gta gac tct aaa cag gtt gag ctt get ttg tgg 197 
Val Ala Asp lie Glu Val Asp Ser Lys Gin Val Glu Leu Ala Leu Trp 

45 50 55 

gat aca get ggt caa gaa gat tac gac agg ctt cgt cca ctt tec tac 245 
Asp Thr Ala Gly Gin Glu Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr 

60 65 70 

ccc gat act gat gtt att ctt atg tgt ttc tea ate gac agt cca gat 293 
Pro Asp Thr Asp Val lie Leu Met Cys Phe Ser lie Asp Ser Pro Asp 
75 80 85 90 

tea ctt gag aac att ccc gaa aaa tgg ace cct gag gta agg cat ttt 341 
Ser Leu: Glu Asn lie Pro Glu Lys Trp Thr Pro Glu Val Arg His Phe 

95 100 105 

tgc cca agt gtt cca aty att ttg gtt gga aac aaa aaa gat ctt cgt 389 
Cys Pro Ser Val Pro lie lie Leu Val Gly Asn Lys Lys Asp Leu Arg 

110 115 120 

aac ggc agt tea aca ata aaa gaa ctt get aaa atg aaa caa tct gec 437 
Asn Gly Ser Ser Thr lie Lys Glu Leu Ala Lys Met Lys* Gin Ser Ala 

125 130 135 

gtt agt aac gaa gat gga atg gcg atg gca gac aag att ggc get tac 485 
Val Ser Asn Glu Asp Gly Met Ala Met Ala Asp Lys He Gly Ala Tyr 

i40 145 150 

gga tat atg gaa tgt tea gcg cgc ace aaa gaa ggt gtt egg gaa gtg 533 
Gly Tyr Met Glu Cys Ser Ala Arg Thr Lys Glu Gly Val Arg Glu Val 
155 160 165 170 

ttt gag ctt gca act aaa gca get tta caa ace aag aaa aga aag aaa 581 
Phe Glu Leu Ala Thr Lys Ala Ala Leu Gin Thr Lys Lys Arg Lys Lys 

175 180 185 

aag agt gga tct gaa gtc ttg taaagaaatt ttaactggag tctagcaatt 632 
Lys Ser Gly Ser Glu Val Leu 
190 

gacttaacat tggacgcatg ctttcat 659 



<210> 232 
<211> 193 
<212> PRT 

<213> Ciona intestinalis 
<400> 232 

Met Xaa Ser He Arg Lys Lys Leu Lys He Asp Gly Glu Gly Ala Cys 

X 5 10 15 

Gly Lys Thr Cys Leu Leu He Xaa Phe Ser Lys Asp Gin Phe Pro Glu 

20 25 30 

Val Tyr Val Pro Thr Val Phe Glu Asn Tyr Val Ala Asp He Glu Val 

35 40 45 

Asp Ser Lys Gin Val Glu Leu Ala Leu Trp Asp Thr Ala Gly Gin Glu 

50 * 55 60 

Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp Thr Asp Val He 
65 ~ ~ 70 75 80 

Leu Met Cys Phe Ser He Asp Ser Pro Asp Ser Leu Glu Asn He Pro 

85 90 95 

Glu Lys Trp Thr Pro Glu Val Arg His Phe Cys Pro Ser Val Pro He 

100 105 HO 

He Leu Val Gly Asn Lys Lys Asp Leu Arg Asn Gly Ser Ser Thr He 

115 120 125 

Lys Glu Leu Ala Lys Met Lys Gin Ser Ala Val Ser Asn Glu Asp Gly 

130 135 140 

Met Ala Met Ala Asp Lys He Gly Ala Tyr Gly Tyr Met Glu Cys Ser 
145 150 155 160 



96 
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Ala Arg Thr Lys Glu Gly Val Arg Glu Val PtLe Glu Leu Ala Thr Lys 

165 "* 170 175 

Ala Ala Leu Gin Thr Lys Lys Arg Lys Lys Lys Ser Gly Ser Glu Val 
180 185 190 

Leu 

<210> 233 
<211> 567 
<212> DNA 

<213> Schistosoma mansoni 

<220> 

<221> CDS 

<222> (1) ... (567) 

<400> 233 

atg caa gcc ate aaa tgt gtg gtt' gtc ggt gat ggc get gtc gga aag - 48 

Met Gin Ala lie Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 
1 5 10 15 ■ 

acc tgt tta etc att age tac acg acg aat gcc ttt cct ggt gag tac 
Thr Cys Leu Leu lie Ser Tyr Thr Thr Asn Ala Phe Pro Gly Glu Tyr 

20 25 30 

gtc cca acc gtt ttc gat aac tat tct gcc aac gta atg gtt gac cga 144 
Val Pro Thr Val Phe Asp Asn Tyr Ser Ala Asn Val Met Val Asp Arg 

35 40 45 

aaa cct gtg aac ctt ggt ttg tgg gat aca get ggt cag gag gat tat 192 
Lys Pro Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr 

50 55 60 

gac aga eta aga cca etc tea tat cca caa act gat gtg ttt ttg gtg 240 
Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu Val 
65 .70 " 75*. 80 

tgc ttc tec tta gtc agt cgc aca teg ttc gag aat gtt cgt agt aaa 288 
Cys Phe Ser Leu Val Ser Arg Thr Ser Phe Glu Asn Val Arg Ser Lys 

85 90 95 

tgg cat ccg gag ata tec gca tat gtt cca aga get cct att att ctt 336 
Trp His Pro Glu lie Ser Ala^Tyr Val Pro Arg Ala Pro lie lie Leu 

100 105 110 

gtc gga act aaa cga gac ctt egg gat agt cct aat ggc eta aaa tea 384 
Val Gly Thr Lys Arg Asp Leu Arg Asp Ser Pro Asn Gly Leu Lys Ser 

115 120 125 

aca acg ttt cca gtc acg tat gca gag ggt tgc agg atg gcc aga gaa 432 
Thr Thr Phe Pro Val Thr Tyr Ala Glu Gly Cys Arg Met Ala Arg Glu 

130 135 140 

•att aaa get gtg aaa tac ctg gaa tgc tct get eta act cag ttt gga 480 
He Lys Ala Val Lys Tyr Leu Glu Cys Ser Ala Leu Thr Gin Phe Gly 
145 150 155 160 

tta aaa gat gtc ttc gac gaa get att cga gca gta ctg atg cct gaa 528 
Leu Lys Asp Val Phe Asp Glu Ala lie Arg Ala Val Leu Met Pro Glu 

165 170 175 

ggg aag aaa aag aaa cat agt tea tgt gaa tta att taa 567 
Gly Lys Lys Lys Lys His Ser Ser Cys Glu Leu lie . 

180 185 

<210> 234 
<211> 188 
<212> PRT 

<2 13 > Schistosoma mansoni 
<400> 234 

Met Gin Ala lie Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 

1 5 10 15 

Thr Cys Leu Leu lie Ser Tyr Thr Thr Asn Ala Phe Pro Gly Glu Tyr 

20 25 30 

Val Pro Thr Val Phe Asp Asn Tyr Ser Ala Asn Val Met Val Asp Arg 

35 40 45 

Lys Pro Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr 
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50 55 60 

_ _. . * — t Ser «pyr Pro Gin Thr Asp Val Phe Leu Val 

75 80 
Arg Thr Ser Phe Glu Asn Val Arg Ser Lys 

90 95 
Ala Tyr Val- Pro Arg Ala Pro lie lie Leu 

105 HO 
Leu Arg Asp Ser Pro Asn Gly Leu Lys Ser 

120 ~ 125 
Tyr Ala Glu Gly' Cys Arg Met Ala Arg Glu 
135 140 
Leu Glu Cys Ser Ala Leu Thr Gin Phe Gly 
155 160 
Glu Ala He Arg Ala Val Leu Met Pro Glu 

170 175 
Ser Ser Cys Glu Leu He 
180 " 185 

<210> 235 
<211> 600 
<212> DNA 

<213> Penicillium marneffei 

<220> 
<221> CDS 
<222> (1) . . (600) 



Asp Arg 


Leu 


Arg 


Pro 


Leu 


65 








70 


Cys Phe 


Ser 


Leu 


Val 


Ser 






85 




Trp His 


Pro 


Glu 


He 


Ser 




100 






Val Gly 


Thr 


Lys 


Arg 


Asp 




115 








Thr Thr 


Phe 


Pro 


Val 


Thr 


130 










He Lys 


Ala 


Val 


Lys 


Tyr 


145 








150 


Leu Lys 


Asp 


Val 


Phe 


Asp 






165 




Gly Lys 


Lys 


Lys 


Lys 


His 



48 
96 
144 



ata°qcq 3 tct ggg cct gcg act caa teg ttg aag tgt gtg gtg acc ggt 
Me? Ill Ser H? Pro III Thr Gin Ser Leu Lys Cys Val Val 'Thr Gly 
1 5 10 1= 

gat ggt get gtc ggc aag aca tgt etc etg ata tea tac acc acc aat 
Asp lly Ala Val Gly Lys Thr Cys Leu Leu He Ser Tyr Thr Thr Asn 

20 " 25 30 

gec ttt ccc ggc gaa tac att ccc acc gta ttc gac aae tac tee get 
Ala Phe Pro Gly Glu Tyr He Pro Thr Val - Phe Asp Asn Tyr Ser Ala 

35 40 45 

agt gtc atg gtc gac ggc agg ccc ate age etc gga etc tgg gat aca 192 
llr Val Mel Val Lp Gly Arg Pro He Ser Leu- Gly Leu Trp Asp Thr 

50 55 60 

get ggt caa gag gat tat gac cgt etc cgc ccc tta tec tac cct caa 
Ala Gly Gin Glu Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin 
65 70 75 80 

acc gac gtc ttc ttg ate tgc ttc tct ate gtc tct cca cca tec ttt 
Thr Asp Val Phe Leu He Cys Phe Ser He Val Ser Pro Pro Ser Phe 

85 9° 95 

gac aac gta aaa gee aag tgg ttt ccc gaa ate gag cae cat gca ccc 
Asp Asn Val Lys Ala Lys Trp Phe Pro Glu He Glu Hxs His Ala Pro 

100 105 110 

gge gtg ccc ate att ctt gtc ggc aca aag ctt gat ttg aga gaa gat 
Gly Val Pro He He Leu Val Gly Thr Lys Leu Asp Leu Arg Glu Asp 

115 120 125 

aga get acc get gag gcg ctg cga gee aaa aag atg gag ccg gtg teg 
Arg Ala Thr Ala Glu Ala Leu Arg Ala Lys Lys Met Glu Pro Val Ser 

130 135 140 

tat gag cag gcg ctt gca gtt gca aag gag att agg gcg cat aag tat 
iyr III Gil III Leu Ala Val Ala Lys Glu He Arg Ala His Lys Tyr 
145 150 I 55 

ctg gag tgt teg gec ttg acc cag agg aat ttg aag age gtg ttt gat 528 
Leu Glu C?s Ser Ila Leu Thr Gin Arg Asn Leu Lys Ser Val Phe Asp 

165 170 1"5 

qaa get att egg gee gtc etc aat cct cgc cct caa ccc aag aac aag 
5lu Ala He Arg Ala Val Leu Asn Pro Arg Pro Gin Pro Lys Asn Lys 

180 185 190 

gca aaa cga tgc act att ctg taa 
Ala Lys Arg Cys Thr He Leu 
195 



240 
288 
336 
384 
432 
480 



576 
600 
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<210> 236 
<211> 199 
<212> PRT 

<213> Penicillium marneffei 
<400> 236 



Met 


Ala 


Ser 


Gly 


Pro 


Ala Thr 


Gin 


Ser 


Leu Lys Cys Val Val 


Thr 


Gly 


1 






5 








10 


15 




Asp 


Gly 


Ala 


Val 


Gly 


Lys. Thr 


Cys 


Leii 


Leu He Ser Tyr Thr 


Thr 


Asn 




20 








25 


30 






Ala 


Phe 


Pro 


Gly 


Glu 


Tyr lie 


Pro 


Thr 


Val Phe Asp Asn Tyr 


Ser 


Ala 






35 






40 




45 






Ser 


Val 
50 


Met 


Val 


Asp 


Gly Arg 
55 


Pro 


lie 


Ser Leu Gly Leu Trp 
60 


Asp 


Thr 


Ala 


Gly 


Gin 


Glu 


Asp 


Tyr Asp 


Arg 


Leu 


Arg Pro Leu Ser Tyr 


Pro 


Gin 


65 








70 






75 




80 


Thr 


Asp 


Val 


Phe 


Leu 


lie* Cys 


Phe 


Ser 


Xle" Val Ser Pro Pro 


Ser 


Phe 








85 








90 


95 




Asp 


Asn 


Val 


Lys 


Ala 


Lys Trp 


Phe 


Pro 


Glu He Glu His His 


Ala 


Pro 






100 








105 


110 






Gly 


Val 


Pro 


He 


He 


Leu Val 


Gly 


Thr 


Lys Leu Asp Leu Arg 


Glu 


Asp 




115 








120 




125. 






Arg 


Ala 


Thr 


Ala 


Glu 


Ala Leu 


Arg 


Ala 


Lys Lys Met Glu Pro 


Val 


Ser 


130 








135 






140 






Tyr 


Glu 


Gin 


Ala 


Leu 


Ala Val 


Ala 


Lys 


Glu lie Arg Ala His 


Lys 


Tyr 


145 










150 






155 




160 


Leu 


Glu 


Cys 


Ser 


Ala 


Leu Thr 


Gin 


Arg 


Asn Leu Lys Ser Val 


Phe 


Asp 








165 








170 


175 




Glu 


Ala 


lie 


Arg 
180 


Ala 


Val Leu 


Asn 


Pro 
185 


Arg Pro Gin Pro Lys 
190 


Asn 


Xiys 


Ala 


Lys 


Arg 
195 


Cys 


Thr 


lie Leu 













<210> 237 
<211> 845 
<212> DNA 

«c213> Schistosoma japonicum 



<220> 

<221> CDS 

<222> (39) . . (617) 

<400> 237 

ccgggccccc ccttggactt tactggtttg ctggagtg atg agt gca ataagg aaa 

Met Ser Ala He Arg Lys 



aag 


etc 


gtt 


att 


gtt 


ggg 


gat 


gga 


gec 


tgt 


1 

ggg 


aaa 


act 


tgt 


5 

eta 


eta 


104 


Lys 


Leu 


Val 


He 


Val 


Gly 


Asp 


Gly 


Ala 


Cys 


Gly 


Lys 


Thr 


Cys 


Leu 


Leu 








10 










15 










20 








ata 


gta 


ttt 


age 


aaa 


gac 


cag 


ttt 


cct 


gaa 


gtt 


tac 


gtt 


ccg 


act 


gta 


. 152 


He 


Val 


Phe 


Ser 


Lys 


Asp 


Gin 


Phe 


Pro 


Glu 


Val 


Tyr 


Val 


Pro 


Thr 


Val 








25 








30 










35 










ttc 


gaa 


aac 


tat 


gtc 


gcg 


gat 


att 


gag 


gta 


gat 


aac 


aaa 


cag 


gtt 


gaa 


200 


Phe 


Glu 


Asn 


Tyr 


Val 


Ala 


Asp 


He 


Glu 


Val 


Asp 


Asn 


Lys 


Gin 


Val 


Glu 






40 








45 










50 












ttg 


gca 


ctt 


tgg 


gat 


act 


get 


ggt 


caa 


gaa 


gac 


tat 


gac 


aga 


tta 


aga 


248 


Leu 


Ala 


Leu 


Trp 


Asp 


Thr 


Ala 


Gly 


Gin 


Glu 


Asp 


Tyr 


Asp 


Arg 


Leu 


Arg 




55 






60 










65 










70 




cca 


ctt 


tct 


tac 


ccc 


gat 


acg 


gat 


gtt 


att 


tta 


atg 


tgc 


ttt 


age 


ate 


296 


Pro 


Leu 


Ser 


Tyr 


Pro 


Asp 


Thr 


Asp 


Val 


He 


Leu 


Met 


Cys 


Phe 


Ser 


He 










75 










80 










85 






gac 


act 


cca 


gat 


agt 


ttg 


gaa 


aac 


ata 


cct 


gag 


aaa 


tgg 


aca 


cca 


gaa 


344 


Asp 


Thr 


Pro 


Asp 


Ser 


Leu 


Glu 


Asn 


He 


Pro 


Glu 


Lys 


Trp 


Thr 


Pro 


Glu 








90 










95 










100 








gtc 


cga 


cat 


ttc 


tgc 


cct 


gat 


gta 


cct 


att 


gtc 


tta 


gtt 


gga 


aac 


aaa 


392 


Val 


Arg 


His 


Phe 


Cys 


Pro 


Asp 


Val 


Pro 


He 


Val 


Leu 


Val 


Gly 


Asn 


Lys 






105 








110 










115 
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- crac aaa cat aqa crga 440 



aag 


gac 


ttg cga 


Lys 


Asp 


Leu Arg 




120 




aaa 


cag 


_-j -> +-4- +- 

gaa tt-t 


Lys 


Gin 


G1U irlie 


135 






att 


cat 


get tac 


He 


His 


Aia xyr 


gtt 


cgt 


gaa gtt 


Val 


Arg 


Glu Val 






170 


aaa 


acg 


aaa aag 


Lys 


Thr 


Lys Lys 






185 









180/291 


acrt 


gaa 


aac 


tea aca aga 


Ser 


Glu Asn 


Ser Thr Arg 






125 




gtt 


aaa 


teg 


gat gag ggt 


Val 


Lys 


Ser 


Asp Glu Gly 




140 






tea 


tat 




cracr tcrt tea 


Ser Tyr 


He 


Glu Cys Ser 


155 






160 


ttc 


gag 


acc 


crccr act aaa 


Phe 


Glu 


Thr 


Ala Thr Arg 








175 


aag 


aag 


tgt 


gat eta att 


Lys Lys 


Cys 


Asp Leu He 








190 



130 

tat gca atg gec gat cgt 
Tyr Ala Met Ala Asp Arg 
145 150 
get aag acg aag gaa gga 
Ala Lys Thr Lys Glu Gly 
165 



180 



488 
536 
584 
634 
694 

atatggtuciu uuyyuwi.^ w. 3 **w~w~- — - - 

attaaatttc gactttttat taactgcttg tttcttcaac aaacatattt tggtttttta 754 
aaaagacaat ccattccatg ataaccttta ttatccgttt ttatcttgat actaaaatct 814 
gtcaggcctg ttaaaaaaaa aaaaaaaaaa a 845 



<210> 238 
<211> 192 
<212> PRT 

<213> Schistosoma japonicum 

Met°Ser 3 Ala He Arg Lys Lys Leu Val He Val Gly Asp Gly Ala Cys 

1 5 1° 15 

Gly Lys Thr Cys Leu Leu He Val Phe Ser Lys Asp Gin Phe Pro Glu 

20 25 30 

Val Tyr Val Pro Thr val Phe Glu Asn Tyr Val Ala Asp He Glu Val 

35 40 45 

Asp Asn Lys Gin Val Glu Leu Ala Leu Trp Asp Thr Ala Gly Gin Glu 

50 55 60 

Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp Thr Asp Val He 
65 70 75 80 

Leu Met Cys Phe Ser He Asp Thr Pro Asp Ser Leu Glu Asn He Pro 

85 90 95 

Glu Lys Trp Thr Pro Glu Val Arg His Phe Cys Pro Asp Val Pro He 

100 105 HO 

Val Leu Val Gly Asn Lys Lys Asp Leu Arg Ser Glu Asn Ser Thr Arg 

115 * 120 125 

Asp Asp Gly His Arg Gly Lys Gin Glu Phe Val Lys Ser Asp Glu Gly 

130 135 140 

Tyr Ala Met Ala Asp Arg He His Ala Tyr Ser Tyr He Glu Cys Ser 
lis 150 155 160 

Ala Lys Thr Lys Glu Gly Val Arg Glu Val Phe Glu Thr Ala Thr Arg 

165 170 175 

Ala Ala Leu Gin Ser Lys Lys Thr Lys Lys Lys Lys Cys Asp Leu He 
180 185 190 

<210> 239 
<211> 582 
<212> DNA 

<213> Ustilago maydis 



<220> 

<221> CDS 

<222> (1)..(582) 



WO 2005/014828 



<400> 239 




atg 


cag 


acc 


ate aag tgt 


Met 


Gin 


Thr 


lie Lys Cys 


1 






5 


acg 


tgt 


ctt 


ctt ate teg 


Thr 


Cys 


Leu 


Leu lie Ser 








20 


ate 


ccc 


aca 


gtg ttt gac 


lie 


Pro 


Thr 


Val Phe Asp 






35 




aaa 


ccc 


gtc 


tct etc ggt 


Lys 


Pro 


Val 


Ser Leu Gly 




50 






gac 


egg 


tta 


aga ccg etc 


Asp 


Arg 


Leu 


Arg Pro Leu 


65 






70 


tgc 


tt-t- 


tea- 


-ctg gtg-agt 


Cys 


Phe 


ser 


Leu Val Ser 








85 


tgg 


tgg 


ccc 


gaa gtg teg 


Trp 


Trp 


Pro 


Glu Val Ser 








100 


gtg 


gga 


acc 


aag ttg gat 


Val 


Gly 


Thr 


Lys Leu Asp 






115 




ctg 


cgc 


gat 


cgc agg atg 


Leu 


Arg 


Asp 


Arg Arg Met 




13 0 






atg 


gcg 


agg 


gac att cac 


Met 


Ala 


Arg 


Asp lie His 


145 






150 


acc 


cag 


aag 


gga ttg aag 


Thr 


Gin 


Lys 


Gly Leu Lys 








165 


ttg 


get 


cct 


gca cca gtc 


Leu 


Ala 


Pro 


Ala Pro Val 








180 


ctt 


taa 






Leu 









<210> 240 
<211> 193 
<212> PRT 

<213> Ustilago maydis 



<400> 240 



Met 


Gin 


Thr 


lie 


Lys 


Cys 


1 








5 




Thr 


Cys 


Leu 


Leu 
20 


He 


Ser 


He 


Pro 


Thr 
35 


Val 


Phe 


Asp 


Lys 


Pro 
50 


Val 


Ser 


Leu 


Gly 


Asp 


Arg 


Leu 


Arg 


Pro 


Leu 


65 










70 


Cys 


Phe 


Ser 


Leu 


Val 


Ser 








85 




Trp 


Trp 


Pro 


Glu 
100 


Val 


Ser 


Val 


Gly 


Thr 
115 


Lys 


Leu 


Asp 


Leu 


Arg 
130 


Asp 


Arg 


Arg 


Met 


Met 


Ala 


Arg 


Asp 


lie 


His 
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gta 


gtc gtc 


gga 


gac 


ggt 


Val 


Val Val 


Gly 


Asp 


Gly 






10 






tat 


aca acc 


aac 


gee 


ttc 


Tyr 


Thr Thr 


Asn 


Ala 


Phe 




25 








aac 


tat teg 


gca 


aac 


gtc 


Asn 


Tyr Ser 


Ala* 


Asn 


Val 




40 








etc 


tgg gat 


act 


gcg 


ggt 


Leu 


Trp Asp 


Thr 


Ala 


Gly 


55 








60 


teg 


tac ccg 


caa 


acc 


gac 


Ser 


Tyr Pro 


Gin 


Thr 


Asp 








75 




cca- 


-cct teg 


ttc 


gaa 


aac 


Pro 


Pro Ser 


Phe 


Glu 


Asn 






90 






cat 


cac get 


ccc 


aac 


att 


His 


His Ala 


Pro 


Asn 


He 




105 








ctg 


cgc gag 


gat 


cca 


gaa 


Leu 


Arg Glu 


Asp 


Pro 


Glu 




120 








cag 


cct ate 


acg 


tat 


gcg 


Gin 


Pro He 


Thr 


Tyr 


Ala 


135 








140 


get 


acc aag 


tat 


ttg 


gag 


Ala 


Thr Lys 


Tyr 


Leu 


Glu 








155 




ggc 


gta ttt 


gat 


gaa 


gcg 


Gly 


Val Phe 


Asp 


Glu 


Ala 






170 






cag 


agt aag 


aag 


aaa 


aac 


Gin 


Ser Lys 


Lvs 


Lys 


Asn 




185 








Val 


Val Val 


Gly 


Asp 


Gly 






10 






Tyr 


Thr Thr 


Asn 


Ala 


Phe 


25 








Asn 


Tyr Ser 


Ala 


Asn 


Val 




40 








Leu 


Trp Asp 


Thr 


Ala 


Gly 


55 








60 


Ser 


Tyr Pro 


Gin 


Thr 


Asp 








75 




Pro 


Pro Ser 


Phe 


Glu 


Asn 






90 






His 


TT A i-i Til - 

HIS Ala 


Pro 


Asn 


He 




105 








Leu 


Arg Glu 


Asp 


Pro 


ulu 




120 








Gin 


Pro He 


Thr 


Tyr 


Ala 


135 








140 


Ala 


Thr Lys 


Tyr 


Leu 


Glu 
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gee 


gtc 


gga 


aag 


48 


Ala 


Val 


Gly 


Lys 








15 






ccg 


gg c 


gaa 


tac 


96 


Pro 


Gly 


Glu 


Tyr 






30 








atg 


gtc 


gac 


ggt 


144 


Met 


Val 


Asp 


Gly 




45 










caa 


gaa 


gac 


tac 


192 


Gin 


Glu 


Asp 


Tyr 




gtg 


ttc 


etc 


gtg 


240 


Val 


Phe 


Leu 


Val 










80 




gtt 


cga 


acc 


aag 


288 


Val 


Arg 


Thr 


Lys 








95 






cec 


acc 


ate 


etc 


336 


Pro 


Thr 


He 


Leu 






110 








acg 


att 


gee 


aag 


384 


Thr 


He 


Ala 


Lys 




125 










cag 


ggc 


aat 


cag 


432 


Gin 


Gly 


Asn 


Gin 




tgc 


tct 


gca 


etc 


480 


Cys 


Ser 


Ala 


Leu 










160 




ate 


agg 


age 


gtt 


528 


He 


Arg 


Ser 


Val 








175 






aac 


tgt 


ttg 


att 


576 


Asn 


Cys 


Leu 


He 






190 
















582 


Ala 


val 


Gly 


Lys 








15 






Pro 


Gly 


Glu 


Tyr 






30 








Met 


Val 


Asp 


Gly 




45 










Gin 


Glu 


Asp 


Tyr 




Val 


Phe 


Leu 


Val 










80 




Val 


Arg 


Thr 


Lys 














Pro 


Thr 


He 


Leu 






110 








Thr 


He 


Ala 


Lys 




125 










Gin 


Gly 


Asn 


Gin 




Cys 


Ser 


Ala 


Leu 





300 
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145 ' 150 155 160 

Thr Gin Lys Gly Leu Lys Gly Val Phe Asp Glu Ala He Arg Ser Val 

165 ~ 170 175 

Leu Ala Pro Ala Pro Val Gin Ser Lys Lys Lys Asn Asn Cys Leu He 
180 185 190 

Leu 

<210> 241 
<211> 1196 
<212> DNA 

<213> Ustilago maydis 

<220> 
<221> CDS 

<222> (436) (1080) 
<400> 241 

aattgcgatg ggatgcaacc gtccacaggg tcccctaagc ccgtgggttc atgtggtcaa 60 

cctcgccacg gccttgatca acacccacca ccgcattcgt agattttcaa acaataccgg 12 0 

ccccatttca tcgccttttg gttttggaag tcaggaacaa gcagcatcat agcaagtgga 180 

cgcaccaacg agcccaagtt acagccaatg gacattcagc tttcctccgg ttcgctcgac 240 

ggccgaatcc tcctcctctc tgtcgggaac gggcgcacaa ccattgtagg aattgacacc 

gattgtttca attgtcccag ccagaagttt gcaccgctca gcacctttgg cattggtagg 3 60 

aacagaaatc ccttttcgtt tccatcgcaa acaggaacac tgtcttgttg atcatctttg 420 

ctcccgccgc tcagc atg gca ccg gca gca att tgt agg aag ctt gta att 471 
Met Ala Pro Ala Ala He Cys Arg Lys Leu Val He 
1 5 10 

gtc ggc gac ggt get tgc ggc aag acg agt ttg ctt tgc gtt ttt gcc 519 
Val Gly Asp Gly Ala Cys Gly Lys Thr Ser Leu Leu Cys Val Phe Ala 

15 ' 20 25 

att ggc gag ttc ccg caa gag tat gaa ccc acc att ttc gaa aac tac 567 
He Gly Glu Phe Pro Gin Glu Tyr Glu Pro Thr He Phe Glu Asn Tyr 

30 35 40 

gtc gcc gag ate cgc ctt gat ggc aag cct gtc cag ctg gcg eta tgg 615 
Val Ala Glu He Arg Leu Asp Gly Lys Pro Val Gin Leu Ala Leu Trp 
45 50 55 60 

gac acc gcg ggt cag gaa gaa tac gag cgt ctt cgt cca etc tec tac 663 
Asp Thr Ala Gly Gin Glu Glu Tyr Glu Arg Leu Arg Pro Leu Ser Tyr 

65 70 75 

tea caa gca cac gtc ate ttg ate gcc ttt gcc ate gat aca ccc gac 711 
Ser Gin Ala His Val He Leu He Ala Phe Ala He Asp Thr Pro Asp 

80 85 90 

teg etc gaa aac gtg caa gtc aag tgg atg gag gag gta cgt caa ata 759 
Ser Leu Glu Asn Val Gin Val Lys Trp Met Glu Glu Val Arg Gin He 

95 100 105 

tgc ggc ccc tea gtg cct gtg etc ctg gta ggc tgc aag aag gat ctt 
Cys Gly Pro Ser Val Pro Val Leu Leu Val Gly Cys Lys Lys Asp Leu 

110 H5 120 

cgc gaa aat gcc ate get aag ggc aag ccg gtt cag ggt cac tat gta 
Arg Glu Asn Ala He Ala Lys Gly Lys Pro Val Gin Gly His Tyr Val 
125 130 135 140 

aag aga caa cag get aaa ctg gta gca gca cag ate ggc get cga teg 
Lys Arg Gin Gin Ala Lys Leu Val Ala Ala Gin He Gly Ala Arg Ser 



807 



855 



903 
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145 










150 




155 




tat 


cac 


gaa 


tgc 


tea 


tea 


etc 


aac 


aac 


caa 


ggc gtc gac 


gee gtg ttc 


951 


Tyr 


His 


Glu 


Cys 


Ser 


Ser 


lieu 


Asn 


Asn 


Gin 


Gly Val Asp 


Ala Val Phe 










160 










165 






170 




gaa 


gee 


gcg 


acg 


cgc 


gcg 


gee 


atg 


Ctt 


gtg 


cgc aat tec 


ggt get tct 


999 


Glu 


Ala 


Ala 


Thr 


Arg 


Ala 


Ala 


Met 


Leu 


Val 


Arg Asn Ser 


Gly Ala Ser 








175 










180 






185 




tea 


gga 


ggc 


gec 


ate 


teg 


cag 


age 


aag 


acc 


aag gag gca 


tta cac aac 


1047 


Ser 


Gly 


Gly 


Ala 


He 


Ser 


Gin 


Ser Lys 


Thr, 


Lys Glu Ala 


Leu His Asn 






190 










195 








200 






gat 


get 


ggc 


tct 


tgc 


aaa 


tgt 


ate 


gtc 


etc 


tagaaatctc tcttgtgctt 


1097 


Asp 


Ala 


Gly 


Ser 


Cys 


Lys 


Cys 


He 


Val 


Leu 









205 210 

cgtcttgtcg acccttgtcc taagcctcgc tctcgtctgc ctttccatcg tacagtcget 1157 
ctccctgcat ttttgtccct ccactcctcc catgttcta 1196 



<210> 242 
<211> 214 
<212> PRT 

<213> Ustilago maydis 
<400> 242 



Met 


Ala 


Pro 


Ala 


Ala 


He 


Cys 


Arg 


Lys 


Leu. 


Val He Val Gly Asp Gly 


1 








5 










10 


15 


Ala 


Cys 


Gly 


Lys 


Thr 


Ser 


Leu 


Leu 


Cys 


Val 


Phe Ala lie Gly Glu Phe 








20 










25 




30 


Pro 


Gin 


Glu 


Tyr 


Glu 


Pro 


Thr 


He 


Phe 


Glu 


Asn Tyr Val Ala Glu He 






35 










40 






45 


Arg 


Leu 


Asp 


Gly 


Lys 


Pro 


Val 


Gin 


Leu 


Ala 


Leu Trp Asp Thr Ala Gly 




50 










55 








60 


Gin 


Glu 


Glu 


Tyr 


Glu 


Arg 


Leu 


Arg 


Pro 


Leu 


Ser Tyr Ser Gin Ala His 


65 










70 










75 80 


Val 


He 


Leu 


He 


Ala 


Phe 


Ala 


He 


Asp 


Thr 


Pro Asp Ser Leu Glu Asn 










85 










90 


95 


Val 


Gin 


Val 


Lys 


Trp 


Met 


Glu 


Glu 


Val 


Arg 


Gin He Cys Gly Pro Ser 








100 










105 




110 


Val 


Pro 


Val 


Leu 


Leu 


Val 


Gly 


Cys 


Lys 


Lys 


Asp Leu Arg Glu Asn Ala 






115 










120 






125 


He 


Ala 


Lys 


Gly 


Lys 


Pro 


Val 


Gin 


Gly 


His 


Tyr Val Lys Arg Gin Gin 




13 0 










135 








140 


Ala 


Lys 


Leu 


Val 


Ala 


Ala 


Gin 


He 


Gly 


Ala 


Arg Ser Tyr His Glu Cys 


145 










150 










155 160 


Ser 


Ser 


Leu 


Asn 


Asn 


Gin 


Gly 


Val 


Asp 


Ala 


Val Phe Glu Ala Ala Thr 










165 










170 


175 


Arg 


Ala 


Ala 


Met 


Leu 


Val 


Arg. 


Asn 


Ser 


Gly 


Ala Ser Ser Gly Gly Ala 








180 










185 




190 


He 


Ser 


Gin 


Ser 


Lys 


Thr 


Lys 


Glu 


Ala 


Leu 


His Asn Asp Ala Gly Ser 






195 










200 






205 


Cys 


Lys 


Cys 


He 


Val 


Leu 














210 





















<210> 243 
<211> 576 
<212> DNA 

<213> Ustilago maydis 

<220> 

<221> CDS 

<222> (1) . . (576) 

<400> 243 

atg cag acc ate aag tgt gtt gtt gtc gga gat ggt gcg gtc ggt aag 48 
Met Gin Thr He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 
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1 5 10 15 

acc tgc ctg ttg att teg tac acc acc aac aag ttt ccc teg gag tat 96 

Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Lys Phe Pro Ser Glu Tyr 

20 25 30 

gtt ccg aca gtg ttt gac aac tac gec gtg act gtc atg att ggc gag 144 
Val Pro Thr Val Phe Asp Asn Tyr Ala Val Thr Val Met He Gly Glu 

35 40 45 

gat ccg tac aca etc gga ttg ttc gat acc gec ggt cag gag gac tac 192 
Asp Pro Tyr Thr Leu Gly Leu Phe Asp Thr Ala Gly Gin Glu Asp Tyr 

50 55 60 

gac cga ctg cga ccg ctt tea tac ccg cag acg gat gtc ttc ctg gtc 240 
Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu Val 
65 " 70 75 80 

tgc ttc teg gtc acc tea cca gee teg ttc gaa aat gtc aag gaa aag 288 
Cys Phe Ser Val Thr Ser Pro Ala Ser Phe Glu Asn Val Lys Glu Lys 

85 90 95 

tgg ttc ccg gag gtg cat cac cat tgc cct ggt gtg ccg tgc ctg att 336 
Trp Phe- Pro Glu Val- His - His His Cys Pro Gly Val Pro -Cys Leu He 

100 105 HO 

gtg gga acc cag gtg gat ttg cgc gac gac cac gee gtc ate gag aag 384 
Val Gly Thr Gin Val Asp Leu Arg Asp Asp His Ala Val He Glu Lys 

115 120 125 

ctt gca cgt tea aag cag cgt cct gtg ccc ttt gag gcg ggt gag cgt 432 
Leu Ala Arg Ser Lys Gin Arg Pro Val Pro Phe Glu Ala Gly Glu Arg 

130 " "* 135 140 

ttg gcg aga gag ttg ggt gcg gtc aag tac gtc gag tgc teg gcg ctg 480 
Leu Ala Arg Glu Leu Gly Ala Val Lys Tyr Val Glu Cys Ser Ala Leu 
145 ~ 150 155 160 

acg caa aag gga ttg aag aac gtc ttc gac gag gee ate gtg get gcg 528 
Thr Gin Lys Gly Leu Lys Asn Val Phe top Glu Ala He Val Ala Ala 

165 170 175 

ctg gaa ccg cct gta ate cgc aag aag tec aag tgc gee att etc 573 
Leu Glu Pro Pro Val He Arg Lys Lys Ser Lys Cys Ala He Leu 
180 185 190 

tga 



576 



<210> 244 
<211> 191 
<212> PRT 

<213> Ustilago maydis 
<400> 244 

Met Gin Thr He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 

! 5 10 15 

Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Lys Phe Pro Ser Glu Tyr 

20 25 30 

Val Pro Thr Val Phe Asp Asn Tyr Ala Val Thr Val Met He Gly Glu 

35 40 45 

Asp Pro Tyr Thr Leu Gly Leu Phe Asp Thr Ala Gly Gin Glu Asp Tyr 

50 * 55 60 

Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu Val 
65 ^ " 70 75 80 

Cys Phe Ser Val Thr Ser Pro Ala Ser Phe Glu Asn Val Lys Glu Lys 

85 * 90 95 

Trp Phe Pro Glu Val His His His Cys Pro Gly Val Pro Cys Leu He 

100 105 HO 

Val Gly Thr Gin Val Asp Leu Arg Asp Asp His Ala Val He Glu Lys 

115 120 125 

Leu Ala Arg Ser Lys Gin Arg Pro Val Pro Phe Glu Ala Gly Glu Arg 

130 " 135 140 

Leu Ala Arg Glu Leu Gly Ala Val Lys Tyr Val Glu Cys Ser Ala Leu 
145 150 155 160 

Thr Gin Lys Gly Leu Lys Asn Val Phe Asp Glu Ala He Val Ala Ala 

165 170 175 

Leu Glu Pro Pro Val He Arg Lys Lys Ser Lys Cys Ala He Leu 
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180 



185/291 
185 
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190 



<210> 245 
<211> 633 
<212> DNA 

<213> Encephalitozoon cuniculi 

<220> 

<221> CDS 

<222> (1) . . (633) 

<400> 245 

atg aag gag aca aac tgg ata atg aac agt gaa aca cac cga tct gga 48 

Met Lys Glu Thr Asn Trp lie Met Asn Ser Glu Thr His Arg Ser Gly 

1 5 10 15 

aaa gtt gtc att gtg gga gat gga gcc tgc ggt aaa aca tgt ctg etc 96 

Lys Val Val He Val Gly Asp Gly Ala Cys Gly Lys Thr Cys Leu Leu 

20 " * 25 " 30* ' ~ * " 

gag gtg ttt aga aga aac cag ttt cct gaa aca tac ate cca acc gtt 144 

Glu Val Phe Arg Arg Asn Gin Phe Pro Glu Thr Tyr He Pro Thr Val 

35 40 45 

gtc gac aac ttc gtg aag gaa gtg aag gtc get gac gac aaa tat gtt 192 
Val Asp Asn Phe Val Lys Glu Val Lys Val Ala Asp Asp Lys Tyr Val 

50 55 60 

tec tta gcc tta tgg gat act gca ggg caa gag gat tat gat aca ate 240 

Ser Leu Ala Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr Asp Thr He 
65 70 75 80 

agg cct ctt tea tac aag gag aca gat ctt gta eta ctg tgc tat aca 288 
Arg Pro Leu Ser Tyr Lys Glu Thr Asp Leu Val Leu Leu Cys Tyr Thr 

85 - 90 95 

ata gaa aac aag aag aaa ate cca aac ate tea aga aaa tgg ctg atg 336 

lie Glu Asn Lys Lys Lys lie Pro Asn He Ser Arg Lys Trp Leu Met 

100 ' 105 110 

gag ata aag aac tac tgc ccc teg teg caa ttc ttt ctt ate ggg ttg 384 

Glu He Lys Asn Tyr Cys Pro Ser Ser Gin Phe Phe Leu He Gly Leu 

115 120 125 

aag aag gat ata agg gat atg gat gat ccg aca ate gac aag teg tea 432 

Lys Lys Asp He Arg Asp Met Asp Asp Pro Thr He Asp Lys Ser Ser 

130 135 140 

ata gtc acg gaa age gag ggt egg aag ate gca gat aat ate aat get 480 
He Val Thr Glu Ser Glu Gly Arg Lys He Ala Asp Asn He Asn Ala 
145 150 155 160 

gcg agg ttt ttt gag tgc tct gcc cgt acc aga gaa aac gtg aac ctt 528 

Ala Arg Phe Phe Glu Cys Ser Ala Arg Thr Arg Glu Asn Val Asn Leu 

165 170 175 

gta ttc gtg gaa gcc gca aag tat ata tgg gac acg aag cag get get 576 

Val Phe Val Glu Ala Ala Lys Tyr He Trp Asp Thr Lys Gin Ala Ala 

180 185 190 

tct gat gcc aga teg tgt ggc ttt ttt teg tgc ata aga tgc tgc cgt 624 
Ser Asp Ala Arg Ser Cys Gly Phe Phe Ser Cys He Arg Cys Cys Arg 

195 200 205 
tat aga tag 633 
Tyr Arg 

210 

<210> 246 
<211> 210 
<212> PRT 

<213> Encephalitozoon cuniculi 
<400> 246 

Met Lys Glu Thr Asn Trp He Met Asn Ser Glu Thr His Arg Ser Gly 

1 5 10 15 

Lys Val Val He Val Gly Asp Gly Ala Cys Gly Lys Thr Cys Leu Leu 

20 25 30 

Glu Val Phe Arg Arg Asn Gin Phe Pro Glu Thr Tyr He Pro Thr Val 
35 40 45 
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Val Asp Asn Phe Val Lys Glu Val Lys Val Ala Asp Asp Lys Tyr Val 

50 55 60 

Ser Leu Ala Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr Asp Thr He 
65 70 75 80 

Arg Pro Leu Ser Tyr Lys Glu Thr Asp Leu Val Leu Leu Cys Tyr Thr 

85 90 95 

He Glu Asn Lys Lys Lys He Pro Asn He Ser Arg Lys Trp Leu Met 

100 105 HO 

Glu He Lys Asn Tyr Cys Pro Ser Ser Gin. Phe Phe Leu He Gly Leu 

115 120 125 

Lys Lys Asp He Arg Asp Met Asp Asp Pro Thr He Asp Lys Ser Ser 

130 135 140 

He Val Thr Glu Ser Glu Gly Arg Lys He Ala Asp Asn He Asn Ala 
145 150 155 160 

Ala Arg Phe Phe Glu Cys Ser Ala Arg Thr Arg Glu Asn Val Asn Leu 

165 170 175 

Val Phe Val Glu Ala Ala Lys Tyr He Trp Asp Thr Lys Gin Ala Ala 

180 " " 185 190 

Ser Asp Ala Arg Ser Cys Gly Phe Phe Ser Cys He Arg Cys Cys Arg 
195 200 205 

Tyr Arg 
210 

<210> 247 
<211> 588 
<212> DNA 
<213> Mucor rouxii 

<220> 

<221> CDS 

<222> (1) (588) 

atS°gct 4 gaa ate aga cga aaa ctt gtg att gtt gga gat ggt get tgt 48 
Met Ala Glu He Arg Arg Lys Leu Val He Val Gly Asp Gly Ala Cys 
5 10 15 



1 5 J-" 

ggt aaa acc tgt ttg ttg att gtc ttt tea aag ggt act ttt cct gag 
Glv Lys Thr Cys Leu Leu He Val Phe Ser Lys Gly Thr Phe Pro Glu 

20 25 30 

ttc tat gtt ccc acc gtt ttt gaa aat tac gta get gat gtc gaa gtc 
Phe Tyr Val Pro Thr Val Phe Glu Asn Tyr Val Ala Asp Val Glu Val 

35 40 45 

aat gga aaa cac gtg gaa tta get tta tgg gat aca gca ggc caa gaa 
Asp Gly Lys His Val Glu Leu Ala Leu Trp Asp Thr Ala Gly Gin Glu 

50 55 60 

gat tat gat cgt etc cgt ccc ttg tct tac cct gat tct cat gtt ate 
Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp Ser His Val He 
65 70 75 80 

ttg att tgt ttt get gtt gat tea ccc gat tea ttg gaa aac gtt caa 
Leu He Cys Phe Ala Val Asp Ser Pro Asp Ser Leu Glu Asn Val Gin 

85 90 95 

gag aag tgg ate tct gaa gta etc cac ttc tgt caa ggt tta cct att 
Glu Lys Trp He Ser Glu Val Leu His Phe Cys Gin Gly Leu Pro He 

100 105 HO 

gtt tta gtt ggt tgt aag aaa gat tta aga aat gat cct gga aca att 
Val Leu val Gly Cys Lys Lys Asp Leu Arg Asn Asp Pro Gly Thr He 

115 " 120 125 

gaa gaa ctt aga aga aac tct caa aaa cct gtc agt tea gaa gag ggt 
Glu Glu Leu Arg Arg Asn Ser Gin Lys Pro Val Ser Ser Glu Glu Gly 

130 135 140 

get tct att get caa aga att agt get tac aag tac ctt gaa tgt tct 
Ala Ser He Ala Gin Arg He Ser Ala Tyr Lys Tyr Leu Glu Cys Ser 
145 150 155 160 

gee aag act ggt gaa ggt gta cgt gaa gta ttt gaa cac gca aca aga 
Ala Lys Thr Gly Glu Gly Val Arg Glu Val Phe Glu His Ala Thr Arg 

165 170 175 

get gca tta atg gtt tct aag aag aaa aag tea aag agt ggt gtc tgc 
Ala Ala Leu Met Val Ser Lys Lys Lys Lys Ser Lys Ser Gly Val Cys 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 
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180 185 190 

aac ctt ttg taa 588 
Asn lieu Leu 
195 

<210> 248 
<211> 195 
<212> PRT 

<213> Mucor rouxii 
<400> 248 

Met Ala Glu lie Arg Arg Lys Leu Val lie Val Gly Asp Gly Ala Cys 

1 5 10 15 

Gly Lys Thr Cys Leu Leu He Val Phe Ser Lys Gly Thr Phe Pro Glu 

20 25 30 

Phe Tyr Val Pro Thr Val Phe Glu Asn Tyr Val Ala Asp Val Glu Val 
" - - 35 " " 40 45 ■ • 

Asp Gly Lys His Val Glu Leu Ala Leu Trp Asp Thr Ala Gly Gin Glu 

50 55 60 

Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp Ser His Val He 
65 70 75 80 

Leu He Cys Phe Ala Val Asp Ser Pro Asp Ser Leu Glu Asn Val Gin 

85 90 95 

Glu Lys Trp lie Ser Glu Val Leu His Phe Cys Gin Gly Leu Pro He 

100 105 110 

Val Leu Val Gly Cys Lys Lys Asp Leu Arg Asn Asp Pro Gly Thr He 

115 120 125 

Glu Glu Leu Arg Arg Asn Ser Gin Lys Pro Val Ser Ser Glu Glu Gly 

13 0 135 140 

Ala Ser He Ala Gin Arg He Ser Ala Tyr Lys Tyr Leu Glu Cys Ser 
145 150- 155 160 

Ala Lys Thr Gly Glu Gly Val Arg Glu Val Phe Glu His Ala Thr Arg 

165 170 175 

Ala Ala Leu Met Val Ser Lys Lys Lys Lys Ser Lys Ser Gly Val Cys 
180 185 190 

Asn Leu Leu 
195 

<210> 249 
<211> 1014 
<212> DNA 

<213> Neurospora crassa 

<220> 
<221> CDS 

<222> (259) . . (1014) 
<400> 249 

atgtcgtcgt caagcaagtt ccgcagctca caccactatc attcgcagtc ggtgtcgtca 60 

atcctaggca ggcacgatag caccagcccc ggcgcagaac tccaacgacg accgaccacc 120 

acctcgtcct acacatcctc tggtagcgcc agccgacgag taccacgcga caacgggacg 180 

agatcgagcg atgggacagt gagtaccatg atgagcacca catcgtcgac cggaagagaa 240 

tcagcagcaa ctacggcc atg acc gag ggc ccg gcc tac tec aag aag gtg 291 

Met Thr Glu Gly Pro Ala Tyr Ser Lys Lys Val 
1 5 10 

gtg gtc gtg ggc gat ggc ggt tgc gga aag aca tgt etc ctg ate agt 339 

Val Val Val Gly Asp Gly Gly Cys Gly Lys Thr Cys Leu Leu He Ser 

15 20 25 

tat agt cag gga tac ttc cca gag aaa tat gtc cca acc gtc ttt gag . 387 



675 
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Tyr Ser Gin Gly Tyr Phe Pro Glu Lys Tyr Val Pro Thr Val Phe Glu 

30 35 40 

aac tac ate acc tac cca acg cat cca ccg acc ggt aag acc gtc gag 435 
Asn Tyr He Thr" Tyr Pro Thr His Pro Pro Thr Gly Lys Thr Val Glu 

45 50 55 

etc gec ctg tgg gat acc gec ggc caa gag gaa tac gac cgc ttg cga 483 
Leu Ala Leu Trp Asp Thr Ala Gly Gin Glu Glu Tyr Asp Arg Leu Arg 
60 65 70 75 

ccg ctt tea tac cca gaa acc gac ctt att. ttt gtc tgc ttc gec att 531 
Pro Leu Ser Tyr Pro Glu Thr Asp Leu He Phe Val Cys Phe Ala He 

80 85 90 

gac tgc ccc aac tec etc gag aat gtc atg gac aag tgg tac ccc gaa 579 
Asp Cys Pro Asn Ser Leu Glu Asn Val Met Asp Lys Trp Tyr Pro Glu 

95 100 105 

gtc etc cac ttc tgt ccg tat aca ccc ctt ate etc gtc ggc etc aag 627 
Val Leu His Phe Cys Pro Tyr Thr Pro Leu He Leu Val Gly Leu Lys 

110 115 . _ 120 

tec gac etc cgc aat' aag aag acg tgc ate gac atg etc aag aca caa 
Ser Asp Leu Arg Asn Lys Lys Thr Cys He Asp Met Leu Lys Thr Gin 

125 130 135 

ggt etc acc ccc gtc acc acc gaa caa gga etc gec gtc get aag aag 723 
Gly Leu Thr Pro Val Thr Thr Glu Gin Gly Leu Ala Val Ala Lys Lys 
140 145 150 155 

atg ggc get cag tac atg gag tgc tea tea aag gag atg aag ggt gta 771 
Met Gly Ala Gin Tyr Met Glu Cys Ser Ser Lys Glu Met Lys Gly Val 

160 165 170 

gag gag att ttt gag cag gee ate etc aca gta gtc gec aac gac agg 819 
Glu Glu lie Phe Glu Gin Ala He Leu Thr Val Val Ala Asn Asp Arg 

175 180 185 

aaa aca ctg gaa cag gaa gee gcg aac ggc atg ctg ggt gtt ggc gcg 867 
Lys Thr Leu Glu Gin Glu Ala Ala Asn Gly Met Leu Gly Val Gly Ala 

190 195 200 

ggc teg gga age gga aag ggc age gga ate teg ttc age agt ggt gac 915 
Gly Ser Gly Ser Gly Lys Gly Ser Gly He Ser Phe Ser Ser Gly Asp 

205 210 215 

aag gee ggt tec ggg ata ggg ccc gtc aag gcg gec ggt gtg ggt ggc 963 
Lys Ala Gly Ser Gly He Gly Pro Val Lys Ala Ala Gly Val Gly Gly 
220 ' 225 230 235 

acg att gtc ccg aaa acg agg aag aag aag aga aag tgt ggt atg atg 1011 
Thr He Val Pro Lys Thr Arg Lys Lys Lys Arg Lys Cys Gly Met Met 
240 245 250 

tga 



1014 



<210> 250 
<211> 251 
<212> PRT 

<213> Neurospora crassa 



<400> 250 



Met 


Thr 


Glu 


Gly 


Pro 


Ala 


Tyr Ser Lys 


Lys 


Val Val 


Val Val Gly Asp 


1 






5 






10 




15 


Gly 


Gly 


Cys 


Gly 


Lys 


Thr 


Cys Leu Leu 


He 


Ser Tyr Ser Gin Gly Tyr 




20 






25 






30 


Phe 


Pro 


Glu 


Lys 


Tyr 


Val 


Pro Thr Val 


Phe 


Glu Asn 


Tyr He Thr Tyr 






35 




40 






45 


Pro 


Thr 


His 


Pro 


Pro 


Thr 


Gly Lys Thr 


Val 


Glu Leu 


Ala Leu Trp Asp 




50 










55 




60 




Thr 


Ala 


Gly 


Gin 


Glu 


Glu 


Tyr Asp Arg 


Leu Arg Pro 


Leu Ser Tyr Pro 


65 








70 






75 


80 


Glu 


Thr 


Asp 


Leu 


He 


Phe 


Val Cys Phe 


Ala 


He Asp 


Cys Pro Asn Ser 








85 






90 




95 


Leu 


Glu 


Asn 


Val 


Met 


Asp Lys Trp Tyr 


Pro 


Glu Val 


Leu His Phe Cys 








100 






105 






110 


Pro 


Tyr 


Thr 


Pro 


Leu 


He Leu Val Gly Leu Lys Ser Asp Leu Arg Asn 




115 








120 






125 
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Lys 


Lys 


Thr Cys 


lie 


Asp 


Met 


Leu 


Lys 


Thr 


Gin 


Gly 


Leu 


Thr 


Pro 


Val 


130 








135 










140 










Thr 


Thr 


Glu Gin 


Gly 


Leu 


Ala 


Val 


Ala 


Lys 


Lys 


Met 


Gly 


Ala 


Gin 


Tyr 


145 








150 










155 










160 


Met 


Glu 


Cys Ser 


Ser 
165 


Lys 


Glu 


Met 


Lys 


Gly 
170 


Val 


Glu 


Glu 


He 


Phe 
175 


Glu 


Gin 


Ala 


lie Leu 
180 


Thr 


Val 


Val 


Ala 


Asn 
185 


Asp 


Arg 


•Lys 


Thr 


Leu 
190 


Glu 


Gin 


Glu 


Ala 


Ala Asn 
195 


Gly 


Met 


Leu 


Gly 
200 


Val 


Gly. 


Ala 


Gly 


Ser 
205 


Gly 


Ser 


Gly 


Lys 


Gly 
210 


Ser Gly 


lie 


Ser 


Phe 
215 


Ser 


Ser 


Gly 


Asp 


Lys 
220 


Ala 


Gly 


Ser 


Gly 


lie 


Gly 


Pro Val 


Lys 


Ala 


Ala 


Gly 


Val 


Gly 


Gly 


Thr 


lie 


Val 


Pro 


Lys 


225 








23 0 










235 










240 


Thr 


Arg 


Lys Lys 


Lys 


Arg 


Lys 


Cys 


Gly 


Met 


Met 
















245 










250 















<210> 251 
<211> 915 
<212> DNA 

<213> Oryza sativa 

<220> 

<221> CDS 

<222> (85) . . (678) 

<400> 251 

gggaccatca ctaccaccac ttcaccacca aacccaacca agaacaagtt ttcaggggca 60 



aattaagaag agcttggtga cgat atg age acg gcg agg ttc ate aag tgc 111 

Met Ser Thr Ala Arg Phe He Lys Cys 
1 5 



gtc 


acc 


gtc 


ggc 


gac 


ggc 


gec 


gtc 


ggc 


aag 


aca 


tgc 


atg 


etc 


ate 


tec 


159 


Val 


Thr 


Val 


Gly 


Asp 


Gly 


Ala 


val 


Gly 


Lys 


Thr 


Cys 


Met 


Leu 


lie 


Ser 




10 










15 










20 










25 




tac 


acc 


age 


aac 


act 


ttc 


ccc 


acg 


gat 


tat 


gtg 


ccg 


acg 


gtt 


ttc 


gac 


207 


Tyr 


Thr 


Ser 


Asn 


Thr 


Phe 


Pro 


Thr 


Asp 


Tyr 


Val 


Pro 


Thr 


Val 


Phe 


Asp 










30 










35 










40 






aac 


ttc 


age 


gee 


aac 


gtg 


gtc 


gtc 


gac 


ggc 


aac 


acg 


gtg 


aac 


Ctt 


ggg 


255 


Asn 


Phe 


Ser 


Ala 


Asn 


Val 


Val 


Val 


Asp 


Gly 


Asn 


Thr 


Val 


Asn 


Leu 


Gly 










45 










50 










55 








etc 


tgg 


gat 


act 


get 


gga 


caa 


gag 


gat 


tac 


aat 


agg 


ctg 


agg 


cct 


etc 


303 


Leu 


Trp 


Asp 


Thr 


Ala 


Gly 


Gin 


Glu 


Asp 


Tyr 


Asn 


Arg 


Leu 


Arg 


Pro 


Leu 








60 










65 










70 










age 


tac 


aga 


gga 


get 


gat 


gtc 


ttt 


tta 


eta 


gca 


ttt 


tec 


etc 


ate 


age 


351 


Ser 


Tyr 


Arg 


Gly 


Ala 


Asp 


Val 


Phe 


Leu 


Leu 


Ala 


Phe 


Ser 


Leu 


He 


Ser 






75 










80 










85 












aaa 


gcg 


age 


tat 


gag 


aat 


att 


cae 


aag 


aag 


tgg 


ata 


cca 


gag 


ctg 


agg 


399 


Lys 


Ala 


Ser 


Tyr 


Glu 


Asn 


He 


His 


Lys 


Lys 


Trp 


He 


Pro 


Glu 


Leu 


Arg 




90 










95 










100 










105 




cat 


tat 


get 


cct 


aat 


gtg 


cca 


ate 


gta 


tta 


gtt 


gga 


acc 


aag 


ctt 


gac 


447 


His 


Tyr 


Ala 


Pro 


Asn 


Val 


Pro 


He 


Val 


Leu 


Val 


Gly 


Thr 


Lys 


Leu 


Asp 












110 










115 










120 






ttg 


cgt 


gag 


gac 


aag 


cag 


ttc 


ttc 


ctg 


gac 


cac 


cct 


ggt 


eta 


gca 


cct 


495 


Leu 


Arg 


Glu 


Asp 


Lys 


Gin 


Phe 


Phe 


Leu 


Asp 


His 


Pro 


Gly 


Leu 


Ala 


Pro 










125 










130 










135 








att 


tec 


act 


gca 


cag 


ggg 


gag 


gag 


ctg 


aag 


agg 


atg 


ata 


ggt 


get 


gcg 


543 


He 


Ser 


Thr 


Ala 


Gin 


Gly 


Glu 


Glu 


Leu 


Lys 


Arg 


Met 


He 


Gly 


Ala 


Ala 








140 










145 










150 










gcg 


tac 


ate 


gaa 


tgc 


age 


tec 


aag 


acg 


cag 


cag 


aat 


gtg 


aaa 


tea 


gta 


591 


Ala 


Tyr 


He 


Glu 


Cys 


Ser 


Ser 


Lys 


Thr 


Gin 


Gin 


Asn 


Val 


Lys 


Ser 


Val 






155 










160 










165 












ttc 


gac 


teg 


gcg 


ate 


aaa 


gtt 


gta 


ctt 


tgc 


ccg 


cca 


aag 


ccg 


aag 


aag 


639 


Phe 


Asp 


Ser 


Ala 


He 


Lys 


Val 


Val 


Leu 


Cys 


Pro 


Pro 


Lys 


Pro 


Lys 


Lys 




170 










175 










180 










185 




aag 


aac 


acc 


agg 


aag 


cag 


agg 


agt 


tgc 


tgg 


ate 


eta 


tgagtgataa 




685 


Lys 


Asn 


Thr 


Arg 


Lys 


Gin 


Arg 


Ser 


Cys 


Trp 


He 


Leu 
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190 195 
acagcaagat tggcgtattc cgcaggaaac ctgtattatt tttacagtgt gtgttgtgtt 



atgtttgtgt gttgcagatt agatgcagtt tgggttacaa aatttgctgg gtctgaaata 
ttcagctcag tgcttcttga gtactacctc tgatgattct tacaaaagtg ttctttggtt 
ggttcagata tatatattat gattattata aaaaaaaaaa aaaaaaaaaa 



745 



805 
865 
915 



<210> 252 

<211> 197 

<212> PRT 

<213> Oryza sativa 

Met°Ser 5 Thr Ala Arg Phe He Lys Cys Val Thr Val Gly Asp Gly Ala 

1 5 10 15 

Val Gly Lys Thr Cys Met Leu He Ser Tyr Thr Ser Asn Thr Phe Pro 

20 25 30 

Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val 

35 40 45 

Val Asp Gly Asn Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 

50 55 60 

Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 
65 70 75 80 

Phe Leu Leu Ala Phe Ser Leu He Ser Lys Ala Ser Tyr Glu Asn He 

85 90 - 95 

His Lys Lys Trp He Pro Glu Leu Arg His Tyr Ala Pro Asn Val Pro 

100 105 HO 

He Val Leu Val Gly Thr Lys Leu Asp Leu Arg Glu Asp Lys Gin Phe 

115 " 120 125 

Phe Leu Asp His Pro Gly Leu Ala Pro He Ser Thr Ala Gin Gly Glu 

130 135 140 

Glu Leu Lys Arg Met He Gly Ala Ala Ala Tyr He Glu Cys Ser Ser 
145 150 155 160 

Lys Thr Gin Gin Asn Val Lys "Ser Val Phe Asp Ser Ala He Lys Val 

165 170 175 

Val Leu Cys Pro Pro Lys Pro Lys Lys Lys Asn Thr Arg Lys Gin Arg 

180 185 190 

Ser Cys Trp He Leu 
195 

<210> 253 
<211> 1010 
<212> DNA 
<:213> Zea mays 

<220> 

<221> CDS 

<222> (74).. (718) 

<400> 253 , 
cactccgtcc ctctcctcca ggaccgcccg ctcttatccc gtttctccct tctctgggct 60 



109 



cggagaatcg gag atg gcg tec age gec tct egg ttc ate aag tge gtc 
eggaga g a a a s^a ^ ^ ^ ^ ^ phe Jle Ly£j ^ Val 

15 10 
acg gte ggc gac ggt gec gtg ggc aag aca tgt atg etc ate tge tac 157 
Thr Val Gly Asp Gly Ala Val Gly Lys Thr Cys Met Leu He Cys Tyr 

IS 20 -25 

ace age aac aag ttc ccc act gat tac ata ccc aeg gtg ttc gac aat 



205 
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Thr 


Ser 


As el 


Lys 


Phe 


Pro 


Thr 


Asp 


Tyr 


He 


Pro 


Thr 


Val 


Phe 


Asp 


Asn . 






30 










35 










40 










ttc 


agt 


gca 


aat 


gtc 


gtt 


gtg 


gat 


ggc 


acc 


acg 


gtg 


aat 


ttg 


ggc 


ctg 


253 


Phe 


Ser 


Ala 


Asn 


Val 


Val 


Val 


Asp 


Gly 


Thr 


Thr 


Val 


Asn 


Leu 


Gly 


Leu 




45 










50 










55 










60 




tgg 


gat 


acc 


gca 


ggg 


cag 


gaa 


gat 


tac 


aac 


cga 


ctg 


agg 


cct 


eta 


age. 


301 


Trp 


Asp 


Thr 


Ala 


Gly 


Gin 


Glu 


Asp 


Tyr 


Asn 


Arg 


Leu 


Arg 


Pro 


Leu 


Ser 












65 










70 










75 






tac 


cgt 


ggt 


gca 


gat 


gtt 


ttc 


gtg 


ctt 


gca 


ttc 


tea 


ctt 


gtg 


agt 


cga 


349 


Tyr 


Arg 


Gly 


Ala 


Asp 


Val 


Phe 


Val 


Leu 


Ala 


Phe 


Ser 


Leu 


Val 


Ser 


Arg 










80 










85 










90 








get 


age 


tat 


gag 


aat 


ate 


atg 


aag 


aag 


tgg 


ata 


ccg 


gag 


ctt 


cag 


cat 


397 


Ala 


Ser 


Tyr 


Glu 


Asn 


lie 


Met 


Lys 


Lys 


Trp 


He 


Pro 


Glu 


Leu 


Gin 


His 








95 










100 










105 










tat 


gca 


cgt 


ggg 


gtg 


cct 


gtt 


gtg 


ttg 


gta 


ggc 


aca 


aaa 


ttt 


gat 


ctt 


445 


Tyr 


Ala 


Arg 


Gly 


Val 


Pro 


Val 


Val 


Leu 


Val 


Gly 


Thr 


Lys 


Phe 


Asp 


Leu 






110 










115 










120 












cgt" 


gaa 


gac 


aag 


cac 


tac 


ttg 


atg 


gac 


cat 


cct 


ggg 


ttg 


gtg 


cct 


gtt 


493 


Arg 


Glu 


Asp 


Lys 


His 


Tyr 


Leu 


Met 


Asp 


His 


Pro 


Gly 


Leu 


Val 


Pro 


Val 




125 










130 










135 










140 




acc 


aca 


gca 


caa 


ggg 


gag 


gaa 


ctt 


cgt 


aga 


caa 


att 


ggt 


get 


atg 


tat 


541 


Thr 


Thr 


Ala 


Gin 


Gly 


Glu 


Glu 


Leu 


Arg 


Arg 


Gin 


He 


Gly 


Ala 


Met 


Tyr 












145 










150 










155 






tac 


ate 


gaa 


tgc 


agt 


teg 


aag 


aca 


cag 


cag 


aat 


gtc 


aaa 


get 


gtg 


ttc 


589 


Tyr 


lie 


Glu 


Cys 


Ser 


Ser 


Lys 


Thr 


Gin 


Gin 


Asn 


Val 


Lys 


Ala 


Val 


Phe 










160 










165 










170 








gat 


get 


gec 


ate 


aag 


gtt 


gta 


ate 


cag 


cct 


cca 


act 


aaa 


eta 


aga 


gaa 


637 


Asp 


Ala 


Ala 


lie 


Lys 


Val 


Val 


He 


Gin 


Pro 


Pro 


Thr 


Lys 


Leu 


Arg 


Glu 








175 










180 










185 










aag 


aag 


aaa 


aag 


aaa 


tea 


cgc 


aag 


gga 


tgt 


teg 


atg 


gtg 


aac 


ate 


tta 


685 


Lys 


Lys 


Lys 


Lys 


Lys 


Ser 


Arg 


Lys 


Gly 


Cys 


Ser 


Met 


Val 


Asn 


He 


Leu 






190 










195 










200 












tct 


gga 


aga 


aaa 


atg 


eta 


tgc 


ttc 


aag 


tec 


tgaatgatcg aagggggtct 


735 


Ser 


Gly 


Arg 


Lys 


Met 


Leu 


Cys 


Phe 


Lys 


Ser 

















205 210 

tacctgaact aataccatga gtgtgacccc aagttcgcga agcttgaaat ettgatgege 795 

tegttgegea tgtatatttg cacctttggt tattaatcac tagaggtaga tgattgaaac 855 

taatctgett aaccaatgtg cactgctggg cgctggcgtg gttagctata tcagttaggc 915 

agttcgacag aggtcaaaga gattgagatt ttgttctttc ttggcaatgt cacagctttt 975 

gtgaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaa 1010 



<210> 254 

<211> 214 

<212> PRT 

<213> Zea mays 

<400> 254 

Met Ala Ser Ser Ala Ser Arg Phe 
1 5 
Gly Ala Val Gly Lys Thr Cys Met 
20 

Phe Pro Thr Asp Tyr He Pro Thr 

35 40 
Val Val Val Asp Gly Thr Thr Val 

50 55 
Gly Gin Glu Asp Tyr Asn Arg Leu 
65 70 



He Lys Cys Val Thr Val Gly Asp 

10 15 
Leu He Cys Tyr Thr Ser Asn Lys 
25 30 
Val Phe Asp Asn Phe Ser Ala Asn 
45 

Asn Leu Gly Leu Trp Asp Thr Ala 
60 

Arg Pro Leu Ser Tyr Arg Gly Ala 
75 80 
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Aso Val Phe Val Leu Ala Phe Ser Leu Val Ser Arg Ala Ser Tyr Glu 

* 85 90 95 

Asn lie Met Lys Lys Trp He Pro Glu Leu Gin His Tyr Ala Arg Gly 

100 105 . 110 

Val Pro Val Val Leu Val Gly Thr Lys Phe Asp Leu Arg Glu Asp Lys 

115 120 125 

His Tyr Leu Met Asp His Pro Gly Leu Val Pro Val Thr Thr Ala Gin 

130 135 140 

Gly Glu Glu Leu Arg Arg Gin He Gly Ala. Met Tyr Tyr He Glu Cys 
145 150 155 160 

Ser Ser Lys Thr Gin Gin Asn Val Lys Ala Val Phe Asp Ala Ala He 

165 170 175 

Lvs Val Val He Gin Pro Pro Thr Lys Leu Arg Glu Lys Lys Lys Lys 

180 185 190 

Lys Ser Arg Lys Gly Cys Ser Met Val Asn He Leu Ser Gly Arg Lys 
195 200 205 

Met Leu Cys Phe Lys Ser 

210 

<210> 255 
<211> 579 
<212> DNA 

<213> Schizophyllum commune 

<220> 
<221> CDS 
<222> (1) . . (579) 

<400> 255 AQ 
atg cag acc ate aag tgt gta gtt gtt gga gac ggt gcg gtt gga aag 48 
Met Gin Thr He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 

1 5 10 15 

acg tgc ctg ttg ate tec tat acg acc aac aag ttc ccg age gag tat 96 
Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Lys Phe Pro Ser Glu Tyr 

20 25 30 

qtg ccc acc gtc ttc gac aat tat gee gtg acc gtc atg ate ggc gag 
val Pro Thr Val Phe Asp Asn Tyr Ala Val Thr Val Met He Gly Glu 

35 40 45 

aac ccc tat aca ctg ggt ctg ttt gat act get ggc cag gag gat tac 192 
Asp Pro Tyr Thr Leu Gly Leu Phe Asp Thr Ala Gly Gin Glu Asp Tyr 

50 55 60 

gat cga etc egg ccg ttg teg tac ccg cag acg gac gtg ttt ctt gtg 240 
Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu Val 
65 70 75 80 

tgc ttt agt gta aca tct cca gee tec ttc gag aac gta aag gag aaa 288 
Cys Phe Ser Val Thr Ser Pro Ala Ser Phe Glu Asn Val Lys Glu Lys 

85 90 95 

tgg ttc ccg gag gtt cgc cac cac tgt ccg ggc gtg ccc tgc etc ate 336 
Trp Phe Pro Glu Val Arg His His Cys Pro Gly Val Pro Cys Leu He 

100 105 HO 

gtc ggc acg caa ate gac ttg agg gac gac teg cag gtg ate gag aag 384 
Val Gly Thr Gin He Asp Leu Arg Asp Asp Ser Gin Val He Glu Lys 

115 ~ 120 125 

ctg gcg egg caa aag cag aga ccg gtc acg age gac caa ggc gag egg 432 
Leu Ala Arg Gin Lys Gin Arg Pro Val Thr Ser Asp Gin Gly Glu Arg 

130 135 140 

etc gtt egg gaa ctt ggc gcg gtc aag tac gtc gag tgc tea gca etc 480 
Leu Val Arg Glu Leu Gly Ala Val Lys Tyr Val Glu Cys Ser Ala Leu 
145 150 155 160 

acg cag aag gga ttg aag aac gtg ttt gat gag gee ate gtc gec gcg 528 
Thr Gin Lys Gly Leu Lys Asn Val Phe Asp Glu Ala He Val Ala Ala 

165 170 175 

ctg gag cca ccc gtg gtg aag aag aag ggg ccg aaa tgt gtc ate ctt 576 
Leu Glu Pro Pro Val Val Lys Lys Lys Gly Pro Lys Cys Val He Leu 
180 185 190 



144 



tga 



579 
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<210> 256 
<211> 192 
<212> PRT 

<213> Schizophyllum commune 



<400> 256 



Met 


Gin 


Thr 


lie 


Lys 


Cys 


Val 


Val 


Val 


Gly. Asp 


Gly Ala Val Gly 


Lys 


1 








5 










10 






15 




Thr 


Cys 


Leu 


Leu 


lie 


Ser 


Tyr 


Thr 


Thr 


Asn Lys 


Phe 


Pro 


Ser Glu 


Tyr 








20 










25 








30 




val 


Pro 


Thr 


Val 


Phe 


Asp 


Asn 


Tyr 


Ala 


Val Thr 


Val 


Met 


He Gly 


Glu 






35 










40 








45 






Asp 


Pro 


Tyr 


Thr 


Leu Gly 


Leu 


Phe 


Asp 


Thr Ala 


Gly Gin Glu Asp 


Tyr 




50 










55 








60 








Asp 


Arg 


Leu 


Arg 


Pro 


Leu 


Ser 


Tyr 


Pro 


Gin Thr 


Asp 


Val 


Phe Leu 


Val 


65 










'70 








75 








80 


Cys 


Phe 


Ser 


Val 


Thr 


Ser 


Pro 


Ala 


Ser 


Phe Glu 


Asn 


Val 


Lys Glu 


Lys 










85 










90 






95 


Trp 


Phe 


Pro 


Glu 


Val 


Arg 


His 


His 


Cys 


Pro Gly 


Val 


Pro 


Cys Leu 


He 








100 










105 








110 




Val 


Gly 


Thr 


Gin 


He Asp 


Leu 


Arg 


Asp 


Asp Ser 


Gin 


Val 


He Glu 


Lys 






115 










120 








125 






Leu 


Ala 


Arg 


Gin 


Lys 


Gin 


Arg 


Pro 


Val 


Thr Ser 


Asp Gin Gly Glu 


Arg 




130 










135 








140 








Leu 


Val 


Arg 


Glu 


Leu Gly 


Ala 


Val 


Lys 


Tyr Val 


Glu 


Cys 


Ser Ala 


Leu 


145 










150 








155 








160 


Thr 


Gin 


Lys 


Gly 


Leu 


Lys 


Asn 


Val 


Phe 


Asp Glu 


Ala 


He 


Val Ala 


Ala 










165 










170 






175 




Leu 


Glu 


Pro 


Pro 


Val 


Val 


Lys 


Lys 


Lys 


Gly Pro 


Lys 


Cys 


Val He 


Leu 



180 185 190 

<210> 257 
<211> 1075 
<212> DNA 

<213> Aspergillus fumigatus 

<220> 

<221> CDS 

<222> (197) . . (793) 

<400> 257 

ttaacctcca aaagatatac ccttaaagat tattatcgcc ccatccgcct tcacttgacg -60 



ctaaaccccc ccaagaattc acttcgacgc taattgatcg ctatcatttg ggttcatctt 120 



tacaagctgt tcatcatcaa atcctctctg tgattttctt ctctgtgcat atattcttaa 180 



ttccaacaac aaaaac atg gcg tec ggc ccc get act cag tea tta aag tgt 232 
Met Ala Ser Gly Pro Ala Thr Gin Ser Leu Lys Cys 











1 








5 










10 






gtg 


gtg 


aca 


ggt 


gat 


ggt 


gca 


gtt 


gga 


aag 


aca 


tgc 


etc 


etc 


ate 


teg 


280 


val 


Val 


Thr 
15 


Gly 


Asp 


Gly 


Ala 


Val 
20 


Gly 


Lys 


Thr 


Cys 


Leu 
25 


Leu 


He 


Ser 




tat 


ace 


ace 


aat 


gca 


ttc 


ccc 


gga 


gaa 


tac 


ate 


cct 


ace 


gta 


ttt 


gac 


328 


Tyr 


Thr 
30 


Thr 


Asn 


Ala 


Phe 


Pro 
35 


Gly 


Glu 


Tyr 


He 


Pro 
40 


Thr 


Val 


Phe 


Asp 




aae 


tat 


acg 


get 


agt 


gtg 


atg 


gta 


gat 


ggt 


agg 


ccc 


ate 


age 


ttg 


gga 


376 


Asn 


Tyr 


Thr 


Ala 


Ser 


Val 


Met 


Val 


Asp 


Gly 


Arg 


Pro 


He 


Ser 


Leu 


Gly 




45 










50 










55 










60 




ctg 


tgg 


gat 


act 


get 


ggg 


caa 


gaa 


gat 


tat 


gac 


cga 


ctg 


aga 


ccg 


ctg 


424 


Leu 


Trp 


Asp 


Thr 


Ala 
65 


Gly 


Gin 


Glu 


Asp 


Tyr 
70 


Asp 


Arg 


Leu 


Arg 


Pro 
75 


Leu 




tec 


tac 


cct 


caa 


ace 


gac 


gtc 


ttc 


etc 


ate 


tgc 


ttc 


tec 


att 


gtc 


age 


472 



He 


Cys Phe 


Ser 


He 


Val 


Ser 








on 








aag 


tgg tac 


ccg 


g a g 


CLUw 


gag 




Lys 


Trp Tyr 


Pro 


Glu 


He 


Glu 




ctt 


gtt ggt 


acc 


aaa 




y cll. 


-J O O 


Leu 


val Giy 

120 


rpVi v 

inr 


Lys 


Leu 


A an 
Asp 




tec 


etc egg 


cag 


aaa 


aarr 


a frr 


U X w 


Ser' 


Leu Arg 
135 


Gin- 


Lys 


Lys 


Met 
140 




teg 


cca aaa 


gag 


ate 


cga 


gcg 


c a a 


Ser 


Pro Lys 


Glu 


He 


Arg 


Ala 




150 








155 






agg 


cag cgc 


aac 


ttc 


aaa 


age 


712 


Arg 


Gin Arg 


Asn 


Phe 
170 


Lys 


Ser 




etc 


aac cct- 


gga 


cct 


gee 


gca~ 


760 


Leu 


Asn Pro 


Gly 
185 


Pro 


Ala 


Ala 
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Ser Tyr Pro Gin Thr Asp Val Phe Leu 

80 85 
cct cct teg ttt gac aac gtc aag gca 
Pro Pro Ser Phe Asp Asn Val Lys Ala 

95 100 
cac cat gee ccc aat gtt ccc ate ate- 
His His Ala Pro Asn Val Pro He He 

110 115 
ctg aga gac gat ccc gcg aca gca gaa 
Leu Arg Asp Asp Pro Ala Thr Ala Glu 
125 " " 130 
gac ctg tct cgt acg aga cac tgg ccg 
Asp Leu Ser Arg Thr Arg His Trp Pro 
145 

cac aag tat etc gaa tgt tct gee etc 
His Lys Tyr Leu Glu Cys Ser Ala Leu 
160 165 
■ gtc ttt gat gaa gee att cgt get gtc 
Val Phe Asp Glu Ala He Arg Ala Val 

175 .180 
aaa ccg aag age aag aaa tgc acc ata ctg tagaccatta ctttcagctt 810 
Lys Pro Lys Ser Lys Lys Cys Thr He Leu 

190 195 
ttcatcatet aatcataaac aactaattcg gcgtctggga tagttgaagg ttttgcaatg 870 

atccccttga tgattgttcc aactgtgttc atatttcttt ctctctttct eggtcacatg 930 

tcatgatcct gatgagcttc tttctgggtc aggacacccc tttcctcatc ttgtcgtctt ^90 

tttgtgcaca tgcagcataa acttgtaaca tgctacgcca taacaggaag ctgtcactta 1050 

atcgatttaa cctccattgt cctaa 1075 



<210> 258 
<211> 198 
<212> PRT 

<213> Aspergillus fumigatus 
<400> 258 

Met Ala Ser Gly Pro Ala Thr Gin Ser Leu Lys Cys Val Val Thr Gly 

15 io 15 

Asp Gly Ala Val Gly Lys Thr Cys Leu Leu He Ser Tyr Thr Thr Asn 

20 25 3 0 

Ala Phe Pro Gly Glu Tyr He Pro Thr Val Phe Asp Asn Tyr Thr Ala 

35 40 45 

Ser Val Met Val Asp Gly Arg Pro He Ser Leu Gly Leu Trp Asp Thr 

50 55 60 

Ala Gly Gin Glu Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin 
65 70 75 80 

Thr Asp Val Phe Leu He Cys Phe Ser He Val Ser Pro Pro Ser Phe 

85 90 95 

Asp Asn Val Lys Ala Lys Trp Tyr Pro Glu He Glu His His Ala Pro 

100 105 HO 

Asn Val Pro He He Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp 

115 120 125 

Pro Ala Thr Ala Glu Ser Leu Arg Gin Lys Lys Met Asp Leu Ser Arg 

130 135 140 

Thr Arg His Trp Pro Ser Pro Lys Glu He Arg Ala His Lys Tyr Leu 
145 150 155 160 

Glu Cys Ser Ala Leu Arg Gin Arg Asn Phe Lys Ser Val Phe Asp Glu 
165 170 175 
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Ala lie Arg Ala Val Leu Asn Pro Gly Pro Ala Ala Lys Pro Lys Ser 

180 185 190 

Lys Lys Cys Thr lie Leu 
195 

<210> 259 
<211> 594 
<212> DNA 

<213> Aspergillus fumigatus 

<220> 

<221> CDS 

<222> (1) • . (594) 

<400> 259 



atg 


gag 


ctg 


tgc 


ggg 


cgc 


cag 


aaa gtt 


gtc 


cag 


cgc 


aag 


atg 


gta 


etc 


48 


Met 


Glu 


Leu 


Cys 


Gly 


Arg 


Gin Lys Val 


Val 


Gin Arg 


Lys 


Met 


Val 


Leu 




i 


"* 






5 








10 










15" 






tta 


gga 


gat 


ggt 


get 


tgc 


ggc 


aag acg 


tea 


get 


ttg 


aat 


gtg 


ttt 


aca 


96 


Leu 


Gly 


Asp 


Gly Ala 


Cys 


Gly Lys Thr 


Ser 


Ala 


Leu 


Asn 


Val 


Phe 


Thr 










20 








25 










30 








aga 


ggg 


ttc 


ttc 


ccg 


aca 


gtc 


tat gag 


ccg 


act 


gtt 


ttt 


gaa aac 


tat 


144 


Arg 


Gly 


Phe 


Phe 


Pro 


Thr 


Val 


Tyr Glu 


Pro 


Thr 


Val 


Phe 


Glu Ash Tyr 








35 










40 








45 










gtc 


cat 


gac 


att 


ttc 


gtc 


gat 


aac gta 


cac 


atg 


gag 


ttg 


teg 


ctg 


tgg 


192 


Val 


His 


Asp 


He 


Phe 


Val 


Asp 


Asn Val 


His 


Met 


Glu 


Leu 


Ser 


Leu Trp 






50 










55 








60 












gat 


aca 


gec 


ggt 


caa 


gaa 


gaa 


ttc gat 


cga 


tta 


cga 


gca 


ctg 


tec 


tac 


240 


Asp 


Thr 


Ala 


Gly Gin 


Glu 


Glu Phe Asp 


Arg 


Leu Arg 


Ala 


Leu 


Ser 


Tyr 




65 










70 








75 










80 ° 




gag 


gat 


aca 


cat 


gtt 


att 


atg 


eta tgt 


ttc 


age 


gtc 


gat 


age 


cct 


gac 


288 


Glu 


Asp 


Thr 


His 


Val 


He 


Met 


Leu Cys 


Phe 


Ser 


Val 


Asp 


Ser 


Pro Asp 












85 








90 










95 






teg 


ttc 


gaa 


aat 


gtg 


gcg 


acg 


aaa tgg 


att 


gat 


gag 


att 


cgc 


gag 


aat 


336 


Ser 


Phe 


Glu 


Asn 
ioo 


Val 


Ala 


Thr 


Lys Trp 
105 


He 


Asp 


Glu 


He 


Arg 
110 


Glu 


Asn 




tgc 


ccc 


ggc 


gtg 


aag 


tta 


gtc 


etc acg 


gca 


etc 


aaa 


tgc 


gat 


ctg 


cga 


384 


Cys 


Pro 


Gly 


Val 


Lys 


Leu 


Val 


Leu Thr 


Ala 


Leu Lys 


Cys 


Asp 


Leu Arg 








115 










120 








125 










aaa 


gac 


gac 


gag 


ttg 


aac 


gac 


aac ccg 


aac 


gee 


ate 


acg 


ttc 


gaa 


caa 


432 


Lys 


Asp 
130 


Asp 


Glu 


Leu 


Asn 


Asp 
135 


Asn Pro 


Asn 


Ala 


He 
140 


Thr 


Phe 


Glu 


Gin 




gga 


tta 


gcg 


aaa 


gca 


aag 


gaa 


ate ggc 


get 


gta 


aaa 


tac 


ctt 


gaa 


tgc 


480 


Gly 


Leu 


Ala 


Lys Ala 


Lys 


Glu 


He Gly 


Ala 


Val 


Lys 


Tyr 


Leu Glu Cys 




145 










150 








155 










160 




tct 


get 


gtt 


cag 


aat 


cgc 


ggt 


ate agg 


gag 


acc 


ttt 


tat 


gaa 


gee 


gee 


528 


Ser 


Ala 


Val 


Gin 


Asn 


Arg 


Gly 


He Arg 


Glu 


Thr 


Phe 


Tyr 


Glu Ala 


Ala 












165 








170 










175 






aag 


gtc 


get 


ctt 


gat 


gtg 


aag 


cct gca 


gga 


tec 


age 


ggg 


tec 


aag 


gga 


576 


Lys 


Val 


Ala 


Leu Asp 


Val 


Lys 


Pro Ala 


Gly 


Ser 


Ser 


Gly 


Ser Lys Gly 










180 








185 










190 








cag 


tgc 


att 


•ate 


etc 


tga 




















594 


Gin 


Cys 


lie 
195 


He 


Leu 

























<210> 260 
<211> 197 
<212> PRT 

<213> Aspergillus fumigatus 
<400> 260 

Met Glu Leu Cys Gly Arg Gin ; Lys Val Val Gin Arg Lys Met Val Leu 

15 10 15 

Leu Gly Asp Gly Ala Cys Gly Lys Thr Ser Ala Leu Asn Val Phe Thr 

20 25 .30 

Arg Gly Phe Phe Pro Thr Val Tyr Glu Pro Thr Val Phe Glu Asn Tyr 
35 40 45 
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Val His Asp He Phe Val Asp Asn Val His Met Glu Leu Ser Leu Trp 

50 ~ 55 60 

Asp Thar Ala Gly Gin Glu Glu Phe Asp Arg Leu Arg Ala Leu Ser Tyr 
65 70 75 80 

Glu Asp Thr His Val He Met Leu Cys Phe Ser Val Asp Ser Pro Asp 

85 90 95 

Ser Phe Glu Asn Val Ala Thr Lys Trp He Asp Glu He Arg Glu Asn 

100 105 HO 

Cys Pro Gly Val Lys Leu Val Leu Thr Ala Leu Lys Cys Asp Leu Arg 

115 120 125 

Lys Asp Asp Glu Leu Asn Asp Asn Pro Asn Ala He Thr Phe Glu Gin 

130 135 140 

Gly Leu Ala Lys Ala Lys Glu He Gly Ala Val Lys Tyr Leu Glu Cys 
145 150 155 160 

Ser Ala Val Gin Asn Arg Gly He Arg Glu Thr Phe Tyr Glu Ala Ala 

165 170 175 

Lys Val Ala Leu Asp Val Lys Pro Ala Gly Ser Ser Gly Ser Lys Gly 

180 I85 190 

Gin Cys He He Leu 
195 

<210> 261 

<211> 582 

<212> DNA 

<213> Mus musculus 

<220> 
<221> CDS 

<222> (1) . . (582) j, 

atg°gct 6 gcc ate egg aag aaa ctg gtg ate gtg gga gat gga get tgt •. 48 

Met Ala Ala He Arg Lys Lys Leu Val He Val Gly Asp Gly Ala Cys 

1 5 10 I 5 

gga aaa aca tgt ttg etc ate gtc ttc age aag gac cag ttt cct gag 96 

Gly Lys Thr Cys Leu Leu He Val Phe Ser Lys Asp Gin Phe Pro Glu 

20 25 3 0 

gtt tac gtg ccc aca gta ttt gag aac tat gtg get gat ate gaa gtg 144 
Val Tyr Val Pro Thr Val Phe Glu Asn Tyr Val Ala Asp He Glu Val 

35 40 45 

qat qga aaa cag gtg gag ttg gee ctg tgg gat aca get gga caa gaa 
Asp Ily Lys Gil Val Glu Leu Ala Leu Trp Asp Thr Ala Gly Gin Glu 

50 55 60 

gat tat gat cgc ctg agg cca etc tec tat ccc gac act gat gtt etc 
Isp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp Thr Asp Val Leu 



65' - 70 75 80 

ttg tta tgt ttc tec att ggc aac cct gat age ttt ggg aac ate cca 
Leu Leu Cys Phe Ser He Gly Asn Pro Asp Ser Phe Gly Asn He Pro 

85 90 95 

cat aaa tgg att cca gaa gtc aag cat ttc tgt ccc aac gtg ccc ate 
His Lys Trp He Pro Glu Val Lys His Phe Cys Pro Asn Val Pro He 

100 105 HO 

ate ctg gtt ggg agt aag aag gat ctt egg aat gac ttc tac acg ata 
He Leu val Gly Ser Lys Lys Asp Leu Arg Asn Asp Phe Tyr Thr He 

115 120 125 

caa gag tta gee aag agg aag caa gcg cct gtg aga cct gaa caa ggc 
Gin Glu Leu Ala Lys Arg Lys Gin Ala Pro Val Arg Pro Glu Gin Gly 
130 ' 135 140 

ttg gcg aac a gc att ggc get ttc gag tat gtg gag tgt tea 
Gin lly Leu Ala Asn Ser He Gly Ala Phe Glu Tyr Val Glu Cys Ser 
145 150 155 160 

gcg aag ace aaa gat gga gtg agg agg gtc ttt gaa aag gee aca agg 
Ala Lys Thr Lys Asp Gly Val Arg Arg Val Phe Glu Lys Ala Thr Arg 

165 17 0 175 

get get ctg caa acg aat cga gtg aag aaa aag act ggt tgc ttt gtc 
Ala Ala Leu Gin Thr Asn Arg Val Lys Lys Lys Thr Gly Cys Phe Val 
180 185 190 

ttt tga 



192 



240 



288 



336 



384 



432 



480 



528 



576 



582 
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Phe 



<210> 262 

<211> 193 

<212> PRT 

<213> Mus musculus 

<400> 262 

Met Ala Ala lie Arg Lys Lys Leu Val lie Val Gly Asp Gly Ala Cys 

1 5 10 15 

Gly Lys Thr Cys Leu Leu He Val Phe Ser Lys Asp Gin Phe Pro Glu 

20 25 ~ 30 

Val Tyr Val Pro Thr Val Phe Glu Asn Tyr Val Ala Asp He Glu Val 

35 40 45 

Asp Gly Lys Gin Val Glu Leu Ala Leu Trp Asp Thr Ala Gly Gin Glu 

50 55 "60 

Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp Thr Asp Val Leu 
65 70 75 80 

Leu Leu Cys Phe Ser He Gly Asn Pro Asp Ser Phe Gly Asn lie Pro 

85 90 95 

His Lys Trp lie Pro Glu Val Lys His Phe Cys Pro Asn Val Pro He 

100 105 110 

He Leu Val Gly Ser Lys Lys Asp Leu Arg Asn Asp Phe Tyr Thr He 

115 120 ' 125 

Gin Glu Leu Ala Lys Arg Lys Gin Ala Pro Val Arg Pro Glu Gin Gly 

130 135 140 

Gin Gly Leu Ala Asn Ser He Gly Ala Phe Glu Tyr Val Glu Cys Ser 
145 150 155 160 

Ala Lys Thr Lys Asp Gly Val Arg Arg Val Phe Glu Lys Ala Thr Arg 

165 170 175 

Ala Ala Leu Gin Thr Asn Arg Val Lys Lys Lys Thr Gly Cys Phe Val 
180 185 * 190 

Phe 



<210> 263 
<211> 576 
<212> DNA 
<213> Mus musculus 

<220> 

<221> CDS 

<222> (1) . . (576) 



<400> 263 








atg 


ctg 


age 


tea 


ate 


aag 


Met 


Leu 


Ser 


Ser 


He 


Lys 


1 








5 




aaa 


acc 


tea 


ctg 


ttg gtg 


Lys 


Thr 


Ser 


Leu 


Leu 


Val 








20 






tac 


aaa 


ccc 


acg gtg 


tac 


Tyr 


Lys 


Pro 


Thr 


Val 


Tyr 






35 








ggc 


ate 


cag 


ate 


age 


ctg 


Gly 


He 


Gin 


He 


Ser 


Leu 




50 










ttc 


aga 


agt 


ate 


egg 


ccc 


Phe 


Arg 


Ser 


He Arg Pro 


65 










70 


atg 


tgc 


tac 


tct 


gtg gec 


Met 


Cys 


Tyr 


Ser 


Val 


Ala 








85 




aaa 


tgg 


att 


agt 


gag 


ate 


Lys 


Trp 


He 


Ser 


Glu 


lie 








100 







tgc gtg ctg gta ggg gac 
Cys Val Leu Val Gly Asp 
10 

cgc ttc acc tct gag acc 
Arg Phe Thr Ser Glu Thr 
25 

gag aat . acg ggt gta gac 
Glu Asn Thr Gly Val Asp 
40 

ggt etc tgg gac act gee 
Gly Leu Trp Asp Thr Ala 
55 60 
ctg tec tac cag cag gca 
Leu Ser Tyr Gin Gin Ala 
75 

aac cat aac teg ttc ctg 
Asn His Asn Ser Phe Leu 
90 

agg age aac eta ccc tgt 
Arg Ser Asn Leu Pro Cys 
105 



agt get gtg ggg 48 
Ser Ala Val Gly 
15 

ttc ccg gag gee 96 
Phe Pro Glu Ala 
30 

gtc ttc atg gat 144 

Val Phe Met Asp 

45 

ggc aac gac gec 192 
Gly Asn Asp Ala 

gac gtg gta ctg 240 
Asp Val Val Leu 
80 

aac ttg aag aac 288 
Asn Leu Lys Asn 
95 

acc ccg gtg ctg 336 
Thr Pro Val Leu 
110 



WO 2005/014828 



gtt 


gtg 


gec 


aca 


cag acg 


Val 


Val 


Ala 


Thr 


Gin Thx 






115 






tec 


tgc 


ate 


aat 


gec ata 


Ser 


Cys 


lie 


Asn 


Ala lie 




130 








gec 


aag 


ggc 


tac 


ctg gag 


Ala 


Lys 


Gly 


Tyr 


Leu Glu 


145 








150 


cag 


gta 


ttt 


gaa 


tgt get 


Gin 


Val 


Phe 


Glu 


Cys Ala 










165 


cga 


aac 


aga 


agg 


aag ctg 


Arg 


Asn 


Arg 


Arg 


Lys Leu 








180 




taa 











198/291 

gac cag aga gag gtg gga 
Asp Gin Arg Glu Val Gly 
120 

gaa ggg aag aga ctt gec 
Glu Gly Lys Arg Leu Ala 
135 140 
tgc tea gee etc age aac 
Cys Ser Ala Leu Ser Asn 
. 155 

gtc cga aca get gtc aac 
val Arg Thr Ala Val Asn 
170 

ttc tec ate aat gaa tgc 
Phe Ser lie Asn Glu Cys 
185 



PCT/EP2004/008136 



cct cac agg get 384 

Pro His Arg Ala 

125 

cag gat gtg aga 432 
Gin Asp Val Arg 

egg gga gta cag 480 
Arg Gly Val Gin 
160 

cag gee agg agg 528 
Gin Ala Arg Arg 
175 

aag ate ttc 573 
Lys lie Phe 
190 

576 



<210> 264 
<211> 191 
<212> PRT 
<213> Mus musculus 

<400> 264 

Met Leu Ser Ser He Lys Cys Val Leu Val Gly Asp Ser Ala Val Gly 

1-5 10 15 

Lys Thr Ser Leu Leu Val Arg Phe Thr Ser Glu Thr Phe Pro Glu Ala 

20 25 30 

Tyr Lys Pro Thr Val Tyr Glu Asn Thr Gly Val Asp Val Phe Met Asp 

35 40 45 

Gly He Gin He Ser Leu Gly Leu Trp Asp Thr- Ala Gly Asn Asp Ala 

50 55 60 

Phe Arg Ser He Arg Pro Leu Ser Tyr Gin Gin Ala Asp Val Val Leu 
65 ~ 70 75 80 

Met Cys Tyr Ser Val Ala Asn His Asn Ser Phe Leu Asn Leu Lys Asn 

85 90 95 

Lys Trp He Ser Glu He Arg Ser Asn Leu Pro Cys Thr Pro Val Leu 

100 105 HO 

Val Val Ala Thr Gin Thr Asp Gin Arg Glu Val Gly Pro His Arg Ala 

115 120 125 

Ser Cys He Asn Ala He Glu Gly Lys Arg Leu Ala Gin Asp Val Arg 

130 135 140 

Ala Lys Gly Tyr Leu Glu Cys Ser Ala Leu Ser Asn Arg Gly Val Gin 
145 " 150 155 160 

Gin Val Phe Glu Cys Ala Val Arg Thr Ala Val Asn Gin Ala Arg Arg 

165 170 175 

Arg Asn Arg Arg Lys Leu Phe Ser He Asn Glu Cys Lys He Phe 
180 185 190 

<210> 265 
<211> 786 
<212> DNA 
<213> Mus musculus 

<220> 

<221> CDS 

<222> (1) - . (786) 

<400> 265 

atg gec ccg cag caa ggc egg ccg gcg ctg ccc gec cgc tgc gag ccg 48 
Met Ala Pro Gin Gin Gly Arg Pro Ala Leu Pro Ala Arg Cys Glu Pro 
15 10 15 

ccg gcg gcg ccg ccg gta ccg cct cgc cga gag cgc ggg ggg cgc ggg 96 
Pro Ala Ala Pro Pro Val Pro Pro Arg Arg Glu Arg Gly Gly Arg Gly 

20 25 30 

gcg cgc ggg ccc ggg gtg tec ggg ggt egg ggg cgc gcg ggc ggc gee 144 
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Ala 


Arg 


Gly 


Pro 


Gly 


Val 


Ser 


Gly Gly 


Arg 


Gly 


Arg 


Ala 


Gly Gly 


Ala 








35 










40 








45 










gag 


gga 


cgc 


ggc 


gtc 


aag 


tgc 


gtg ctg 


gtc 


ggc 


gac 


ggc 


gcg 


gtg 


ggc 


192 


Glu 


Gly 

50 


Arg 


Gly 


Val 


Lys 


Cys 
55 


val Leu 


Val 


Gly 


Asp 
60 


Gly 


Ala 


Val 


Gly 




aag 


acc 


age 


ctg 


gtg 


gtc 


age 


tac acc 


act 


aac 


ggc 


tac 


ccc 


acc 


gag 


240 


Lys 


Thr 


Ser 


Leu 


Val 


Val 


Ser 


Tyr Thr 


Thr 


Asn 


Gly 


Tyr 


Pro 


Thr 


Glu 




65 










70 








75 










80 




tac 


ate 


cct 


acg 


gee 


ttc 


gac 


aac ttc 


teg 


gee 


gtg 


gtg 


tct 


gta 


gat 


288 


Tyr 


lie 


Pro 


Thr 


Ala 


Phe 


Asp 


Asn Phe 


Ser 


Ala 


Val 


Val 


Ser 


val 


Asp 










85 








90 










95 






ggg 


egg 


cct 


gtg 


aga 


etc 


cag 


etc tgt 


gac 


act 


gca 


gga 


cag 


gat 


gag 


336 


Gly 


Arg 


Pro 


Val 


Arg 


Leu 


Gin 


Leu Cys 


Asp 


Thr 


Ala 


Gly 


Gin 


Asp 


Glu 








100 








105 










110 








ttt 


gac 


aag 


ctg 


agg 


ccc 


etc 


tgc tac 


acc 


aac 


aca 


gac 


ate 


ttc 


ctg 


384 


Phe 


Asp 


Lys 
115 


Leu 


Arg 


Pro 


Leu 


Cys Tyr 
120 


Thr 


Asn 


Thr 


Asp 
125 


He 


Phe 


Leu 




ctg 


tgc 


ttc 


age 


gtg 


gtg 


age 


ccc aca 


tec 


ttc 


cag 


aac 


gtg ggc 


gag 


432 


Leu 


Cys 
130 


Phe 


Ser 


Val 


Val 


Ser 
13 5 


Pro Thr 


Ser 


Phe 


Gin 
140 


Asn 


Val 


Gly 


Glu 




aag 


tgg 


gtt 


cca 


gag 


att 


cga 


cgt cac 


tgc 


cca 


aag 


gee 


ccc 


ate 


ate 


480 


Lys 


Trp 


Val 


Pro 


Glu 


He 


Arg 


Arg His 


Cys 


Pro 


Lys 


Ala 


Pro 


He 


He 




145 










150 








155 










160 




ctg 


gtc 


ggg 


aca 


cag 


teg 


gac 


etc agg 


gag 


gac gtc 


aaa 


gtg 


etc 


ata 


528 


Leu 


Val 


Gly 


Thr 


Gin 


Ser 


Asp 


Leu Arg 


Glu 


Asp Val 


Lys 


Val 


Leu 


He 










165 








170 










175 






gaa 


ctg 


gac 


aag 


tgc 


aaa 


gag 


aag ccg 


gtg 


cct 


gaa 


gag 


gcg 


gcg 


aag 


576 


Glu 


Leu 


Asp 


Lys 
180 


Cys 


Lys 


Glu 


Lys Pro 
185 


Val 


Pro 


Glu 


Glu 


Ala 
190 


Ala 


Lys 




ctg 


tgc 


gcg 


gag 


gaa 


gtc 


aaa 


get gtc 


tec 


tac 


ate' 


gag 


tgc 


tea 


gcg 


624 


Leu 


Cys 


Ala 


Glu 


Glu 


Val 


Lys 


Ala val 


Ser 


Tyr 


He 


Glu 


Cys 


Ser 


Ala 






195 










200 








205 










ttg 


act 


cag 


aaa 


aac 


etc 


aaa 


gag gtt 


ttc 


gac 


gee 


gee 


att 


gtt 


get 


672 


Leu 


Thr 


Gin 


Lys 


Asn 


Leu 


Lys 


Glu Val 


Phe 


Asp Ala 


Ala 


He 


Val 


Ala 






210 










215 








220 












ggt 


ate 


cag 


cac 


tea 


gac 


tec 


cag eta 


cag 


cca 


aag 


aag 


tct 


aaa 


age 


720 


Gly 


lie 


Gin 


His 


Ser 


Asp 


Ser 


Gin Leu 


Gin 


Pro 


Lys 


Lys 


Ser 


Lys 


Ser 




225 










23 0 








235 










240 




agg 


acc 


ccg 


gat 


aag 


gtg 


egg 


gac ctg 


tec 


aag 


tct 


tgg 


tgg 


agg 


aag 


768 


Arg 


Thr 


Pro 


Asp 


Lys 


Val 


Arg 


Asp Leu 


Ser 


Lys 


Ser 


Trp 


Trp 


Arg 


Lys 










245 








250 










255 






tat 


tgc 


tgc 


ctg 


gee 


tga 




















786 


Tyr 


Cys 


Cys 


Leu 
260 


Ala 

























<210> 266 
<211> 261 
<212> PRT 
<213> Mus mus cuius 



<400> 266 



Met 


Ala 


Pro 


Gin 


Gin 


Gly 


Arg 


Pro Ala 


Leu 


Pro Ala 


Arg 


Cys Glu 


Pro 


1 








5 








10 






15 




Pro 


Ala 


Ala 


Pro 


Pro 


Val 


Pro 


Pro Arg 


Arg 


Glu Arg 


Gly 


Gly Arg 


Gly 








20 








25 








30 




Ala 


Arg 


Gly 


Pro 


Gly 


Val 


Ser 


Gly Gly 


Arg 


Gly Arg 


Ala 


Gly Gly 


Ala 






35 










40 






45 






Glu 


Gly 


Arg 


Gly 


Val 


Lys 


Cys 


Val Leu 


val 


Gly Asp 


Gly 


Ala Val 


Gly 




50 










55 






60 








Lys 


Thr 


Ser 


lieu 


Val 


Val 


Ser 


Tyr Thr 


Thr 


Asn Gly 


Tyr 


Pro Thr 


Glu 


65 










70 








75 






80 


Tyr 


He 


Pro 


Thr 


Ala 


Phe 


Asp 


Asn Phe 


Ser 


Ala Val 


Val 


Ser Val 


Asp 








85 








90 






95 




Gly 


Arg 


Pro 


Val 


Arg 


Leu 


Gin 


Leu Cys 


Asp 


Thr Ala 


Gly 


Gin Asp 


Glu 




100 








105 








110 




Phe 


Asp 


Lys 


Leu 


Arg 


Pro 


Leu 


Cys Tyr 


Thr 


Asn Thr 


Asp 


He Phe 


Leu 



115 120 125 



Leu 


Cys 
130 


Phe 


Ser 


Val 


Val 


Lys 


Trp 


Val 


Pro 


Glu 


He 


145 








150 


Leu 


Val Gly 


Thr 


Gin 


Ser 










165 




Glu 


Leu Asp 


Lys 


Cys 


Lys 








180 






Leu 


Cys 


Ala 


Glu 


Glu 


Val 




195 








Leu 


Thr 
210 


Gin 


Lys 


Asn 


Leu 


Gly 


He 


Gin 


His 


Ser 


Asp 


225 










230 


Arg 


Thr 


Pro 


Asp 


Lys 


Val 








245 




Tyr 


Cys 


Cys 


Leu 


Ala 










260 
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Ser Pro Thr Ser Phe Gin Asn Val Gly Glu 
135 140 
Arg Arg His Cys Pro Lys Ala Pro He He 
155 160 
Asp Leu Arg Glu Asp Val Lys Val Leu He 

170 175 
Glu Lys Pro Val Pro -Glu Glu Ala Ala Lys 

185 190 
Lys Ala Val Ser. Tyr He Glu Cys Ser Ala 

200 205 
Lys Glu Val Phe Asp Ala Ala He Val Ala 
215 220 
Ser Gin Leu Gin Pro Lys Lys Ser Lys Ser 
235 240 
Arg Asp Leu Ser Lys Ser Trp Trp Arg Lys 
250 255 



<210> 267 
<211> 624 
<212> DNA 

<213> Trichoderma reesei 

<220> 

<221> CDS 

<222> (1)..(624) 

<400> 267 

atg cct etc tgc ggc ggc tec aag acg gtg cag cgc aag ctg gtt ctt 48 
Met Pro Leu Cys Gly Gly Ser Lys Thr Val Gin Arg Lys Leu Val Leu 
15 10 15 

ctg ggc gat ggt gec age gga aag acg teg ctg etc aac gtc ttc aca 96 
Leu Gly Asp Gly Ala Ser Gly Lys Thr Ser Leu Leu Asn Val Phe Thr 

20 25 30 

aga ggt tac ttt ccc ace gtc tac gaa cct acc gtc ttt gaa aat tac 144 
Arg Gly Tyr Phe Pro Thr Val Tyr Glu Pro Thr Val Phe Glu Asn Tyr 

35 40 45 

gtc cac gac ate ttt gtc gac aac gtc cac ate gag etc tec etc tgg 192 
Val His Asp He Phe Val Asp Asn Val His He Glu Leu Ser Leu Trp 

50 55 60 

gat acg gcg gga cag gag gaa ttc gat egg ctg cga teg etc tec tac 240 
Asp Thr Ala Gly Gin Glu Glu Phe Asp Arg Leu Arg Ser Leu Ser Tyr 
65 70 75 80 

gat gac acc gat ttg ate gtg etc tgt tac teg gtc gat age aaa gac 288 
Asp Asp Thr Asp Leu He Val Leu Cys Tyr Ser Val Asp Ser Lys Asp 

85 90 95 

teg eta gaa aac gtc gaa tec aaa tgg gtc gga gag att gee gac aac 336 
Ser Leu Glu Asn Val Glu Ser Lys Trp Val Gly Glu He Ala Asp Asn 

100 105 HO 

tgc ccc ggc gtc aag ctg gtc etc gtc gee etc aag tgc gac ctg cgc 384 
Cys Pro Gly Val Lys Leu Val Leu Val Ala Leu Lys Cys Asp Leu Arg 

115 120' 125 

cag caa gag gac gac gag ccc gag gac cag gca gcg gee gac ggc aac 432 
Gin Gin Glu Asp Asp Glu Pro Glu Asp Gin Ala Ala Ala Asp Gly Asn 

130 A ~ 135 140 

gca cag cgc gag aaa ccg ccc acg att tec tac gac gag ggc ctt gag 
Ala Gin Arg Glu Lys Pro Pro Thr He Ser Tyr Asp Glu Gly Leu Glu 
145 ~ 150 155 160 

gtc gee aag egg ata ggc gee teg cgc tac ctg gag tgc teg gcg atg 
Val Ala Lys Arg He Gly Ala Ser Arg Tyr Leu Glu Cys Ser Ala Met 

165 170 175 

aag aac cgc ggc gtc aac gag gee ttt acc gag gcg gee cgc gta gcg 576 
Lys Asn Arg Gly Val Asn Glu Ala Phe Thr Glu Ala Ala Arg Val Ala 

180 185 190 

eta age gtc aag aag gag agg gaa gac aac aag tgc aca ate atg 621 
Leu Ser Val Lys Lys Glu Arg Glu Asp Asn Lys Cys Thr He Met 
195 200 205 



480 



528 



PCI7EP2004/008136 
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624 



<210> 268 
<211> 207 
<212> PRT 

<213> Trichoderma reesei 
<400> 268 



Met 


Pro 


Leu 


Cys Gly Gly Ser 


Lys 


Thr 


Val Gin 


Arg Lys Leu Val 


Leu 


1 






5 






10 


15 




Leu 


Gly 


Asp 


Gly Ala Ser Gly 


Lys 


Thr 


Ser Leu 


Leu Asn Val Phe 


Thr 








20 




25 




30 




Arg 


Gly 


Tyr 


Phe Pro Thr Val 


Tyr 


Glu 


Pro Thr 


Val Phe Glu Asn 


Tyr 






35 




40 






45 




"Val 


His 


Asp 


lie Phe Val Asp 


Asn 


Val 


His He 


Glu Leu Ser Leu 


Trp 




50 


55 








60 




Asp 


Thr 


Ala 


Gly Gin Glu Glu 


Phe 


Asp 


Arg Leu 


Arg Ser Leu Ser 


Tyr 


65 






70 






75 




80 


Asp 


Asp 


Thr 


Asp Leu lie Val 


Leu 


Cys 


Tyr Ser 


Val Asp Ser Lys 


Asp 








85 






90 


95 




Ser 


Leu 


Glu 


Asn Val Glu Ser 


Lys 


Trp 


Val Gly 


Glu lie Ala Asp 


Asn 








100 




105 




110 




Cys 


Pro 


Gly 


Val Lys Leu Val 


Leu 


Val 


Ala Leu 


Lys Cys Asp Leu 


Arg 






115 




120 






125 




Gin 


Gin 


Glu 


Asp Asp Glu Pro 


Glu 


Asp 


Gin Ala 


Ala Ala Asp Gly 


Asn 




130 




135 








140 




Ala 


Gin 


Arg 


Glu Lys Pro Pro 


Thr 


He 


Ser Tyr 


Asp Glu Gly Leu 


Glu 


145 






150 






155 




160 


Val 


Ala 


Lys 


Arg lie Gly Ala 


Ser 


Arg 


Tyr Leu 


Glu Cys Ser Ala 


Met 








165 






170. 


175 




Lys 


Asn 


Arg 


Gly Val Asn Glu 


Ala 


Phe 


Thr Glu 


Ala Ala Arg Val 


Ala 






180 




.185 




190 




Leu 


Ser 


Val 


Lys Lys Glu Arg 


Glu 


Asp 


Asn Lys 


Cys Thr He Met 








195 




200 






205 





<210> 269 
<211> 675 
<212> DNA 

<213> Ashbya gossypii 

<220> 

<221> CDS 

<222> (1)..(675) 

<4G0> 269 



atg 


cct 


ctg 


tgt 


ggg 


teg 


age 


teg 


teg 


teg aag 


cat 


cct 


ate 


gag 


cgc 


48 


Met 


Pro 


Leu 


Cys 


Gly 


Ser 


Ser 


Ser 


Ser 


Ser Lys 


His 


Pro 


He 


Glu 


Arg 




1 








5 










10 








15 






aag 


ate 


gtc 


ate 


etc 


gga 


gac 


ggt 


get 


tgc ggg 


aag 


acg 


teg 


ctg 


ttg 


96 


Lys 


He 


Val 


He 


Leu 


Gly Asp 


Gly 


Ala 


Cys Gly 


Lys 


Thr 


Ser 


Leu 


Leu 










20 










25 








30 








aac 


gtg 


ttc 


acg 


cga 


ggg 


tac 


ttt 


ccg 


aag gtg 


tac 


gag 


ccc 


acg 


gta 


144 


Asn 


Val 


Phe 


Thr 


Arg 


Gly 


Tyr 


Phe 


Pro 


Lys Val 


Tyr 


Glu 


Pro 


Thr 


Val 








35 










40 








45 










ttc 


gaa 


aac 


tac 


ate 


cat 


gac 


ate 


ttc 


gtg gac 


aac 


cag 


cac 


ate 


acg 


192 


Phe 


Glu 


Asn 


Tyr 


He 


His 


Asp 


He 


Phe 


Val Asp 


Asn 


Gin His 


He 


Thr 






50 










55 








60 












ctg 


age 


ctg 


tgg 


gac 


act 


get 


ggg 


cag 


gag gag 


ttt 


gac 


egg 


ttg 


cga 


240 


Leu 


Ser 


Leu 


Trp 


Asp 


Thr 


Ala 


Gly 


Gin 


Glu Glu 


Phe Asp Arg Leu 


Arg 




65 










70 








75 










80 




teg 


ctg 


teg 


tac 


teg 


gac 


aca 


cac 


acg 


att atg 


ctg 


tgt 


ttc 


teg 


gtg 


288 


Ser 


Leu 


Ser 


Tyr 


Ser 


Asp 


Thr 


His 


Thr 


He Met 


Leu 


Cys 


Phe 


Ser 


Val 












85 










90 








95 






gac 


teg 


egg 


gac 


teg 


ctg 


gag 


aac 


gtc 


aag aac 


aag 


tgg 


gtg 


age 


gaa . 


336 


Asp 


Ser 


Arg 


Asp 


Ser 


Leu 


Glu 


Asn 


Val 


Lys Asn 


Lys 


Trp Val 


Ser 


Glu 
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100 105 110 

att gcg gac cac tgc gag ggc gtg aag ctg gtg eta gtg gcg ctg aag 384 
lie Ala Asp His Cys Glu Gly Val Lys Leu Val Leu Val Ala Leu Lys 

115 "* 120 125 

tgc gac ttg cgc age age gac gag tac ggc aac gag age gee ate acg 432 
Cys Asp Leu Arg Ser Ser Asp Glu Tyr Gly Asn Glu Ser Ala He Thr 

130 ^ 135 .140 

ccg ggg tec ate cag aac cag aag tac aac ggc ggc ggc ggc aac ggg 480 
Pro Gly Ser He Gin Asn Gin Lys Tyr Asn. Gly Gly Gly Gly Asn Gly 
145 150 155 160 

ctg ate ccc tac gac gag ggg ctg gcg atg gee aag cag att ggg gcg 528 
Leu He Pro Tyr Asp Glu Gly Leu Ala Met Ala Lys Gin He Gly Ala 

165 170 175 

ctg cgc tat ctg gag tgc age gee aag atg aac cgt ggc gtg aac gag 576 
Leu Arg Tyr Leu Glu Cys Ser Ala Lys Met Asn Arg Gly Val Asn Glu 

180 185 190 

gcg ttc ace gag get gcg cgc tgc gcg ctg act gcg aca ccg aag ggg ..._ 624 
Ala Phe Thr "Glu Ala Ala Arg Cys "Ala Leu Thr Ala Thr Pro Lys Gly 

195 200 205 

gee egg gac tct gcg ccc gag gee gaa age age agt tgt act ate atg 
Ala Arg Asp Ser Ala Pro Glu Ala Glu Ser Ser Ser Cys Thr He Met 
210 215 220 



672 



tga 



675 



<210> 270 
<211> 224 
<212> PRT 

<213> Ashbya gossypii 
<400> 270 

Met Pro Leu Cys Gly Ser Ser Ser Ser Ser Lys His Pro He Glu Arg 

.! 5 10 15 

Lys He Val He Leu Gly Asp Gly Ala Cys Gly Lys Thr Ser Leu Leu 

20 25 30 

Asn Val Phe Thr Arg Gly Tyr Phe Pro Lys Val Tyr Glu Pro Thr Val 

35 40 45 

Phe Glu Asn Tyr He His Asp He Phe Val Asp Asn Gin His He Thr 

50 55 60 

Leu Ser Leu Trp Asp Thr Ala Gly Gin Glu Glu Phe Asp Arg Leu Arg 
65 .70 75 80 

Ser Leu Ser Tyr Ser Asp Thr His Thr He Met Leu Cys Phe Ser Val 

85 90 95 

Asp Ser Arg Asp Ser Leu Glu Asn Val Lys Asn Lys Trp Val Ser Glu 

100 105 HO 

He Ala Asp His Cys Glu Gly Val Lys Leu Val Leu Val Ala Leu Lys 

115 120 125 

Cys Asp Leu Arg Ser Ser Asp Glu Tyr Gly Asn Glu Ser Ala He Thr 

130 " 135 140 

Pro Gly Ser He Gin Asn Gin Lys Tyr Asn Gly Gly Gly Gly Asn Gly 
145 150 155 160 

Leu He Pro Tyr Asp Glu Gly Leu Ala Met Ala Lys Gin He Gly Ala 

165 170 175 

Leu Arg Tyr Leu Glu Cys Ser Ala Lys Met Asn Arg Gly Val Asn Glu 

180 185 190 

Ala Phe Thr Glu Ala Ala Arg Cys Ala Leu Thr Ala Thr Pro Lys Gly 

195 200 205 

Ala Arg Asp Ser Ala Pro Glu Ala Glu Ser Ser Ser Cys Thr He Met 
210 213 220 

<210> 271 
<211> 624 
<212> DNA 

<213> Ashbya gossypii 



<220> 
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<221> CDS 

<222> (1) . . (624) 

<400> 271 

atg tct cag caa atg cat aac ccc agt ate agg aga aaa ttg gtg ate 48 

Met Ser Gin Gin Met His Asn Pro. Ser lie Arg Arg Lys Leu Val He 

1 5 10 15 

gtc gga gat ggt gca tgc ggg aaa aca tgt ctt ttg att gtg ttt gee 96 

Val Gly Asp Gly Ala Cys Gly Lys Thr Cys. Leu Leu He Val Phe Ala 

20 25 30 

aag gga aag ttc cca cag gtg tat gtt cct acg gtt ttc gac aac tac 144 
Lys Gly Lys Phe Pro Gin Val Tyr Val Pro Thr Val Phe Asp Asn Tyr 

35 40 45 

gtt gca gat gtg gag gta gac ggc aga egg gtg gag ctt gcg ctt tgg 192 
Val Ala Asp Val Glu Val Asp Gly Arg Arg Val Glu Leu Ala Leu Trp 

50 55 60 

gat acg get ggg cag gag gat tac gac agg cta_ egg ccg tta teg tac 240 
Asp Thr" Ala Gly Gin Glu Asp Tyr Asp Arg 'L'eu Arg Pro Leu Ser Tyr 
65 70 75 80 

cca gac tec aat gtt gtg ttg ate tgc tac teg att gac eta cca gac 288 
Pro Asp Ser Asn Val Val Leu He Cys Tyr Ser lie Asp Leu Pro Asp 

85 90 95 

teg ttg gag aac gtg atg gag aag tgg ate age gag gtg eta tac ttc 336 
Ser Leu Glu Asn Val Met Glu Lys Trp He Ser Glu Val Leu Tyr Phe 

100 105 110 

tgc cag ggt gtt ccg ate ate ttg gtg ggg tgc aag get gac ttg egg 384 
Cys Gin Gly Val Pro He He Leu Val Gly Cys Lys Ala Asp Leu Arg 

115 120 125 

aac gat ccg caa gtg ate gag cag ttg aga cag cag gga cag cag cct 432 
Asn Asp Pro Gin Val He Glu Gin Leu Arg Gin Gin Gly Gin Gin Pro 

130 135 140 

gtc teg cag get cag gcg cag gag gta gcg gac cag ate ggc gcg gta 480 
Val Ser Gin Ala Gin Ala Gin Glu Val Ala Asp Gin He Gly Ala Val 
145 150 155 160 

gag tac att gag tgc tct gca aag ace ggc ttt ggt gtg cgc gag gtg 528 
Glu Tyr lie Glu Cys Ser Ala Lys Thr Gly Phe Gly Val Arg Glu Val 

165 170 175 

ttt gag gcg gee acg cgt get tec ttg atg ggg aaa caa ggc aag tct 576 
Phe' Glu Ala Ala Thr Arg Ala Ser Leu Met Gly Lys Gin Gly Lys Ser 

180 185 ~* 190 

aag gcg aag tct gac aag aag aag aag aaa aag tgt gtg gtc ttg 621 
Lys Ala Lys Ser Asp Lys Lys Lys Lys Lys Lys Cys Val Val Leu 

195 200 205 

tag 624 



<210> 272 
<211> 207 
<212> PRT 

<213> Ashbya gossypii 
<400> 272 

Met Ser Gin Gin Met His Asn Pro Ser He Arg Arg Lys Leu Val He 

1 5 10 15 

Val Gly Asp Gly Ala Cys Gly Lys Thr Cys Leu Leu He Val Phe Ala 

20 25 30 

Lys Gly Lys Phe Pro Gin Val Tyr Val Pro Thr Val Phe Asp Asn Tyr 

35 40 45 

Val Ala Asp Val Glu Val Asp Gly Arg Arg Val Glu Leu Ala Leu Trp 

50 55 60 

Asp Thr Ala Gly Gin Glu Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr 
65 70 75 80 

Pro Asp Ser Asn Val Val Leu He Cys Tyr Ser He Asp Leu Pro Asp 

85 90 95 

Ser Leu Glu Asn Val Met Glu Lys Trp He Ser Glu Val Leu Tyr Phe 
100 105 110 
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Cys Gin Gly Val Pro lie lie Leu Val Gly Cys Lys Ala Asp Leu Arg 

115 120 125 

Asn Asp Pro Gin Val lie Glu Gin Leu Arg Gin Gin Gly Gin Gin Pro 

130 135 140 

Val Ser Gin Ala Gin Ala Gin Glu Val Ala Asp Gin lie Gly Ala Val 
145 150 155 160 

Glu Tyr lie Glu Cys Ser Ala Lys Thr Gly Phe Gly Val Arg Glu Val 

165 170 175 

Phe Glu Ala Ala Thr Arg Ala Ser Leu Met. Gly Lys Gin Gly Lys Ser 

180 185 190 

Lys Ala Lys Ser Asp Lys Lys Lys Lys Lys Lys Cys Val Val Leu 
195 200 205 

<210> 273 
<211> 615 
<212> DNA 

<213> Yarrowia lipolytics. _ . 

<220> 

<221> CDS 

<222> (1) . . (615) 

<400> 273 

atg ccc caa cca acg gat ctg cga aga aaa etc gtc att gtc ggc gac 48 
Met Pro Gin Pro Thr Asp Leu Arg Arg Lys Leu Val He Val Gly Asp 
15 10 15 

gga gec tgc ggc aag acc tgt ctg ctg att gtc ttc gcg aag ggc acc 96 
Gly Ala Cys Gly Lys Thr Cys Leu Leu He Val Phe Ala Lys Gly Thr 

20 25 . 30 

ttc ccg gag gtg tac gtg ccc acc gtg ttt gag aac tac gtg gec gac 144 
Phe Pro Glu Val Tyr Val Pro Thr Val Phe Glu Asn Tyr Val Ala Asp 

35 " 40 45 

gtg gag att gac ggc cga cga gtg gag ctg gec ctg tgg gat acc gec 192 
Val Glu He Asp Gly Arg Arg Val Glu Leu Ala Leu Trp Asp Thr Ala 

50 *" 55 60 

ggc cag gag gat tac gac cga ctg cga ccc ctg tec tac ccc gac gec 240 
Gly Gin Glu Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp Ala 
65 70 75 80 

aac gtc ate ate ate tgc ttc gee ate gac age ccc gac teg etc gac 288 
Asn Val He He He Cys Phe Ala He Asp Ser Pro Asp Ser Leu Asp 

85 90 95 

aac gtg cag gag aag tgg ate tec gag gtg ctg cac ttt tgc cag ggc 336 
Asn Val Gin Glu Lys Trp He Ser Glu Val Leu His Phe Cys Gin Gly 

100 105 HO 

gtg cct att ctg ctg gtc ggt tgc aag gtc gac ctg cga aac gac ccc 384 
Val Pro He Leu Leu Val Gly Cys Lys Val Asp Leu Arg Asn Asp Pro 

115 120 125 

aag acc ate gag gag ctg aga aga acg tec cag cga ccc gtc acg acc 432 
Lys Thr He Glu Glu Leu Arg Arg Thr Ser Gin Arg Pro Val Thr Thr 

130 135 140 

gag gag gga aat gee gtg gee cag aag att ggc gec ggc aag tac ctg 480 
Glu Glu Gly Asn Ala Val Ala Gin Lys He Gly Ala Gly Lys Tyr Leu 
145 150 155 160 

gag tgc tct gec cga acc cac gac gga gtg aga gag gtc ttc gag cat 528 
Glu Cys Ser Ala Arg Thr His Asp Gly Val Arg Glu Val Phe Glu His 

165 170 175 

gee acc cga get gec ctg act gee cac ggc cag aag ggc tec aag age 576 
Ala Thr Arg Ala Ala Leu Thr Ala His Gly Gin Lys Gly Ser Lys Ser 

180 185 190 

tec age aga gag ggc aag aaa aag tgt ttg att ttg taa 615 
Ser Ser Arg Glu Gly Lys Lys Lys Cys Leu He Leu 
195 200 



<210> 274 
<211> 204 
<212> PRT 
<213> Yarrowia 



lipolytica 



WO 2005/014828 



205/291 
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<400> 274 




















Met 


Pro 


Gin 


Pro Thr Asp 


Leu Arg 


Arg 


Lys 


Leu Val 


lie 


Val 


Gly 


Asp 


1 






' 5 






10 








15 




Gly 


Ala 


Cys 


Gly Lys Thr 


Cys Leu 


Leu 


He 


Val Phe 


Ala 


Lys 


Gly 


Thr 








20 




25 








30 






Phe 


Pro 


Glu 


Val Tyr Val 


Pro Thr 


Val 


Phe 


Glu Asn 


Tyr 


Val 


Ala 


Asp 






35 




40 








45 








Val 


Glu 


He 


Asp Gly Arg 


Arg Val 


Glu 


Leu. 


Ala Leu 


Trp 


Asp 


Thr 


Ala 




50 






55 






60 










Gly 


Gin 


Glu 


Asp Tyr Asp 


Arg Leu 


Arg 


Pro 


Leu Ser 


Tyr 


Pro 


Asp 


Ala 


65 






70 








75 








80 


Asn 


Val 


He 


He lie Cys 


Phe Ala 


lie 


Asp 


Ser Pro 


Asp 


Ser 


Leu 


Asp 








85 






90 








95 




Asn 


Val 


Gin 


Glu Lys Trp 


He Ser 


Glu 


Val 


Leu His 


Phe 


Cys 


Gin 


Gly 








100 




105 








110 






Val 


Pro 


He 


Leu Leu Val 


Gly Cys 


Lys 


Val 


Asp Leu 


Arg 


Asn 


Asp 


Pro 






115 




120 








125 






Lys 


Thr 


He 


Glu Glu Leu 


Arg Arg 


Thr 


Ser 


Gin Arg 


Pro 


Val 


Thr 


Thr 




130 






135 






140 










Glu 


Glu 


Gly 


Asn Ala Val 


Ala Gin 


Lys 


lie 


Gly Ala 


Gly 


Lys 


Tyr 


Leu 


145 






150 








155 








160 


Glu 


Cys 


Ser 


Ala Arg Tnr 


His Asp 


Gly 


Val 


Arg Glu 


Val 


Phe 


Glu 


His 








165 






170 








175 




Ala 


Thr 


Arg 


Ala Ala Leu 


Thr Ala 


His 


Gly 


Gin Lys 


Gly 


Ser 


Lys 


Ser 








180 




185 








190 






Ser 


Ser 


Arg 


Glu Gly Lys 


Lys Lys 


Cys 


Leu 


He Leu 














195 




200 

















<210> 275 
<211> 915 
<212> DNA 
<213> Zea mays 

<220> 

<221> CDS 

<222> (10) . . (657) 

<400> 275 

gtcttggcc atg gcc tec age gec tec egg ttc ate aag tgc gtc aca gtg 51 
Met Ala Ser Ser Ala Ser Arg Phe He Lys Cys Val Thr Val 
1 5 10 

ggg gac ggc gcc gtg gga aag ace tgc atg etc ate tgc tac ace age 99 
Gly Asp Gly Ala Val Gly Lys Thr Cys Met Leu lie Cys Tyr Thr Ser 
15 20 25 30 ' 

aac aag ttc ccc act gat. tac ata ccc act. gtc ttc gat aac ttc agt 147 
Asn Lys Phe Pro Thr Asp Tyr lie Pro Thr . Val Phe Asp Asn Phe Ser 

35 40 45 

gca aac gtg gta gtc gac ggt acc acg gtg aat ttg ggc ctt tgg gat 195 
Ala Asn Val Val Val Asp Gly Thr Thr Val Asn Leu Gly Leu Trp Asp 

50 55 60 

act gca ggg cag gaa gat tac aac aga ctg agg cca ctg age tac cgc 243 
Thr Ala Gly Gin Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg 

65 70 75 

gga gca gat gtg ttc gtg etc gcc ttc teg ctt gtc age cga get age 291 
Gly Ala Asp Val Phe Val Leu Ala Phe Ser Leu Val Ser Arg Ala Ser 

80 ' gs 90 

tac gag aat gtt atg aag aag tgg eta ccg gag ctt cag cat tat gca 339 
Tyr Glu Asn Val Met Lys Lys Trp Leu Pro Glu Leu Gin His Tyr Ala 
95 100 105 110 

ccg ggt gtc ccc ata gtg ttg gcc gga act aaa ttg gat ctt cgt gaa 387 
Pro Gly Val Pro lie Val Leu Ala Gly Thr Lys Leu Asp Leu Arg Glu 

115 120 125 

gac agg cac tac tt.a gtt gac cat cct ggt gcg gta cct gtt act aca 435 
Asp Arg His Tyr Leu Val Asp His Pro Gly Ala Val Pro Val Thr Thr 

130 135 140 

gca cag ggg gag gaa ctg cgc aag cac att ggc gca act tgc tac ate 483 
Ala Gin Gly Glu Glu Leu Arg Lys His He Gly Ala Thr Cys Tyr He 
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145 150 155 

gaa tgc age tea aaa act cag cag aat gtg aaa get gtg ttt gat get 531 
Glu Cys Ser Ser Lys Thr Gin Gin Asn Val Lys Ala Val Phe Asp Ala 

160 165 170 

gee ate aag gta gta ate agg cct ccg acg aag cag cga gaa agg aag 579 
Ala He Lys Val Val He Arg Pro Pro Thr Lys Gin Arg Glu Arg Lys 
175 180 185 190 

aaa aag aaa gaa egg cga gga tgc tea ata ttc tgc age cgt ate atg 627 
Lys Lys Lys Glu Arg Arg Gly Cys Ser He Phe Cys Ser Arg lie Met 

195 200* 205 

cac gca aga aga eta gga tgc ttc aag tgatagaagg ccttcttcga 674 
His Ala Arg Arg Leu Gly Cys Phe Lys 
210 215 
egataaegta ggcttctaag atgaatcgag atcgtgtata gtttgtctcc atgttccaaa 734 



gtgctcgttg cgcttgtgca cttgggctgg taatgtgtat atttagtctt tgccaattga 794 

tatttaggag aaatattaag cacttcgttt atgtaccatt ggctcataat gagctgggaa 854 

gtccaataca tactaagaag taacaatatt ttcttccgat gattttatta tggtttctcg 914 

t 915 



<210> 276 
<211> 215 
<212> PRT 
<213> Zea mays 



<400> 276 

Met Ala Ser Ser Ala Ser Arg Phe He Lys Cys Val Thr Val Gly Asp 

15 10 15 

Gly Ala Val Gly Lys Thr Cys Met Leu He Cys Tyr Thr Ser Asn Lys 

20 25 30 

Phe Pro Thr Asp Tyr He Pro Thr Val Phe Asp Asn Phe Ser Ala Asn 

35 40 45 

Val Val Val Asp Gly Thr Thr Val Asn Leu Gly Leu Trp Asp Thr Ala 

50 55 60 

Gly Gin Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala 
65 ■ 70 75 80 

Asp Val Phe Val Leu Ala Phe Ser Leu Val Ser Arg Ala Ser Tyr Glu 

85 90 95 

Asn Val Met Lys Lys Trp Leu Pro Glu Leu Gin His Tyr Ala Pro Gly 

100 ** 105 110 

Val Pro He Val Leu Ala Gly Thr Lys Leu Asp Leu Arg Glu Asp Arg 

115 120 125 

His Tyr Leu Val Asp His Pro Gly Ala Val Pro Val Thr Thr Ala Gin 

130 " 135 140 

Gly Glu Glu Leu Arg Lys His He Gly Ala Thr Cys Tyr He Glu Cys 
145 150 155 160 

Ser Ser Lys Thr Gin Gin Asn Val Lys Ala Val Phe Asp Ala Ala He 

165 170 175 

Lys Val Val He Arg Pro Pro Thr Lys Gin Arg Glu Arg Lys Lys Lys 

180 ~ 185 190 

Lys Glu Arg Arg Gly Cys Ser He Phe Cys Ser Arg He Met His Ala 

195 200 205 

Arg Arg Leu Gly Cys Phe Lys 
210 215 



<210> 277 
<211> 1002 
<212> DNA 
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<213> Zea mays 

<220> 

<22I> CDS 

<222> (149).. (787) 

<400> 277 

agaagataag caaagcaagg caaccgttgt cgttgtctct gttcctccgc ttcctgctct 60 
cttggcttgc tgcttgccct cggagcagtg ccttccgccg ccgccgccgc cgcctgttgt 120 



gagaaggaga ggccggggct gggaggag atg age gtg acc aag ttc ate aag 172 

Met Ser Val Thr Lys Phe He Lys 
1 5 



tgc 


gtc 


acg 


gtg 


ggg 


gac 


ggc 


gcg gtg ggc 


aag 


acc 


tgc 


atg -etc 


ate 


220 


Cys 


val 


Thr 


Val 


Gly Asp 


Gly 


Ala Val Gly "Lys 


Thr 


Cys 


Met Leu 


He " 






10 










15 






20 










tgc 


tac 


acc 


age 


aac 


aag 


ttc 


ccc acg gat 


tac 


ate 


ccc 


acg gtg 


ttc 


268 


Cys 


Tyr 


Thr 


Ser 


Asn 


Lys 


Phe 


Pro Thr Asp 


Tyr 


He 


Pro 


Thr Val 


Phe 




25 










30 






35 








40 




gac 


aac 


ttc 


age 


gee 


aac 


gtc 


tec gtg gac 


ggc 


age 


ate 


gtc aac 


ctg 


316 


Asp 


Asn 


Phe 


Ser 


Ala 
45 


Asn 


Val 


Ser Val Asp 
50 


Gly 


Ser 


He 


Val Asn 
55 


Leu 




ggc 


etc 


tgg 


gac 


act 


gca 


gga 


caa gag gac 


tac 


age 


aga 


ttg egg 


cca 


364 


Gly 


Leu 


Trp 


Asp 
60 


Thr 


Ala 


Gly 


Gin Glu Asp 
65 


Tyr 


Ser 


Arg 


Leu Arg 
70 


Pro 




ctg 


age 


tac 


agg 


ggc 


gcg 


gac 


gtg ttc gtg 


ctg 


gee 


ttc 


tec ttg 


ate 


412 


Leu 


Ser 


Tyr 


Arg 


Gly Ala 


Asp 


Val Phe Val 


Leu 


Ala 


Phe 


Ser Leu 


He 








75 










80 






85 








age 


agg 


gcg 


age 


tat 


gag 


aac 


gtc ctt aag 


aag 


tgg 


gtg 


cca gag 


ctt 


'460 


Ser 


Arg 


Ala 


Ser 


Tyr Glu 


Asn 


Val Leu Lys 


Lys 


Trp 


Val 


Pro Glu 


Leu 






90 










95 






100 










cgc 


aga 


ttc 


gcg 


ccc 


gac 


gtc 


ccg gtc gtt 


ctt 


gtc 


ggg 


acc aag 


tta 


508 


Arg 


Arg 


Phe 


Ala 


Pro Asp 


Val 


Pro Val Val 


Leu 


Val 


Gly 


Thr Lys 


Leu 




105 










110 






115 




120 




gat 


etc 


cgt 


gac 


cac 


agg 


gee 


tac ctt get 


gac 


cat 


cct 


gga gcg 


teg 


556 


Asp 


Leu 


Arg 


Asp 


His 
125 


Arg 


Ala 


Tyr Leu Ala 
130 


Asp 


His 


Pro 


Gly Ala 
135 


Ser 




acg 


ate 


acg 


acg 


gca 


cag 


ggc 


gaa gaa ctg 


agg 


agg 


cag 


ate ggc 


get 


604 


Thr 


He 


Thr 


Thr 
140 


Ala 


Gin 


Gly 


Glu Glu Leu 
145 


Arg 


Arg 


Gin 


He Gly 
150 


Ala 




gcg 


get 


tac 


ate 


gag 


tgc 


agt 


tec aaa acg 


cag 


cag 


aat 


gtc aag 


teg 


652 


Ala 


Ala 


Tyr 


lie 


Glu 


Cys 


Ser 


Ser Lys Thr 


Gin 


Gin 


Asn 


Val Lys 


Ser 








155 










160 






165 






gtc 


ttc 


gac 


aea 


gee 


ate 


aaa 


gtg gtc ctt 


cag 


ccc 


ccg 


egg agg 


agg 


700 


val 


Phe 


Asp 


Thr 


Ala 


He 


Lys 


Val Val Leu 


Gin 


Pro 


Pro 


Arg Arg 


Arg 






170 










175 






180 






gag 


gcg 


acg 


cct 


gee 


agg 


agg 


aag aac agg 


cgt 


ggc 


tec 


ggg tgc 


tct 


748 


Glu 


Ala 


Thr 


Pro 


Ala Arg 


Arg 


Lys Asn Arg 


Arg 


Gly 


Ser 


Gly Cys 


Ser 




185 










190 






195 








200 




ate 


atg 


aac 


etc 


atg 


tgt 


ggc 


age acg tgc 


get 


get 


taggagtcta 




794 


He 


Met 


Asn 


Leu 


Met 
205 


Cys 


Gly 


Ser Thr Cys 
210 


Ala 


Ala 











gaacactgat ctggaaggag gtgaaggtga aggcatggtg tctatgtgct atggcgactg 854 



gcaagttaat ggggcegcat ggatgactgc tgctcttgtt tttttaagct cgtctgccgt 914 
atgctttgtt tttttaggct tcaaggactg acaattgeaa gaatgcagtg tttatgtaag 974 



aggttgtttg ctggaatagg attgetgt 



1002 
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<210> 278 
<211> 212 
<212> PRT 
<213> Zea mays 

<400> 278 

Met Ser Val Thr Lys Phe lie Lys Cys Val Thr Val Gly Asp Gly Ala 

15 10 15 

Val Gly Lys Thr Cys Met Leu lie Cys Tyr' Thr Ser Asn Lys Phe Pro 

20 25 30 

Thr Asp Tyr lie Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Ser 

35 40 45 

Val Asp Gly Ser lie Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 

50 55 60 

Glu Asp Tyr Ser Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 
65 70 75 80 

Phe Val Leu- Ala Phe Ser Leu lie Ser Arg Ala Ser Tyr Glu Asn Val 

85 90 95 

Leu Lys Lys Trp Val Pro Glu Leu Arg Arg Phe Ala Pro Asp Val Pro 

100 105 110 

Val Val Leu Val Gly Thr Lys Leu Asp Leu Arg Asp His Arg Ala Tyr 

115 120 125 

Leu Ala Asp His Pro Gly Ala Ser Thr lie Thr Thr Ala Gin Gly Glu 

130 " 135 140 

Glu Leu Arg Arg Gin lie Gly Ala Ala Ala Tyr lie Glu Cys Ser Ser 
145 150 155 160 

Lys Thr Gin Gin Asn Val Lys Ser Val Phe Asp Thr Ala lie Lys Val 

165 170 175 

Val Leu Gin Pro Pro Arg 'Arg Arg Glu Ala Thr Pro Ala Arg Arg Lys 

180 185 190 

Asn Arg Arg Gly Ser Gly Cys Ser lie Met Asn Leu Met Cys Gly Ser 

195 200 205 

Thr Cys Ala Ala 
210 

<210> 279 
<211> 850 
<212> DNA 
<213> Oryza sativa 

<220> 

<221> CDS 

<222> (112) . . (705) 

<400> 279 

agcaagcagc agctgaggtg aggtccgtgg cgttggagtg aggactgagg aggaagaaga 60 



gggcgggatc tagggtaccg gatgcgctgg ctgtgctgag tgagagtaga g atg age 

Met Ser 



gcg 


tct 


egg 


ttc 


ate 


aag 


tgc 


gtc 


acc 


gtg 


ggg 


gac 


ggc 


gec 


1 

gtg 


ggc 


165 


Ala 


Ser 


Arg 


Phe 


He 


Lys 


Cys 


val 


Thr 


val 


Gly Asp 


Gly 


Ala 


Val 


Gly 








5 










10 










15 










aag 


acc 


tgc 


atg 


etc 


ate 


tec 


tac 


acc 


tec 


aac 


acc 


ttc 


ccc 


acg 


gac 


213 


Lys 


Thr 


Cys 


Met 


Leu 


He 


Ser 


Tyr 


Thr 


Ser 


Asn 


Thr 


Phe 


Pro 


Thr 


Asp 




20 








25 










30 












tat 


gtt 


cca 


act 


gtt 


ttt 


gat 


aac 


ttc 


agt 


gca 


aat 


gtt 


gtg 


gtc 


gat 


261 


Tyr 


Val 


Pro 


Thr 


Val 


Phe 


Asp 


Asn 


Phe 


Ser 


Ala 


Asn 


Val 


Val 


Val 


Asp 




35 










40 










45 










50 


309 


ggg 


age 


act 


gtg 


aac 


ttg 


ggg 


ttg 


tgg 


gat 


aca 


gca 


gga 


caa 


gag 


gac 


Gly 


Ser 


Thr 


Val 


Asn 


Leu 


Gly 


Leu 


Trp 


Asp 


Thr 


Ala 


Gly 


Gin 


Glu 


Asp 










55 










60 










65 






tac 


aat 


agg 


eta 


cgc 


ccg 


ttg 


age 


tat 


cgt 


ggc 


get 


gat 


gtt 


ttc 


ctg 


357 


Tyr 


Asn 


Arg 


Leu 


Arg 


Pro 


Leu 


Ser 


Tyr 


Arg 


Gly Ala 


Asp 


Val 


Phe 


Leu 








70 










75 










80 








ctg 


gec 


ttt 


tct 


ctg 


ate 


age 


aaa 


gca 


age 


tat 


gag 


aat 


gtt 


tct 


aaa 


405 
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Leu 


Ala 


Phe 


Ser 


Leu 


lie 


Ser 


Lys 


Ala 


Ser 


Tyr 


Glu 


Asn 


Val 


Ser 


Lys 








85 










90 








95 








aag 


tgg 


ata 


cct 


gaa 


tta 


agg 


cat 


tat 


get 


cct 


ggt 


gtg 


cca 


ata 


att 


453 


Lys 


Trp 


He 


Pro 


Glu 


Leu 


Arg 


His 


Tyr 


Ala 


Pro 


Gly 


Val 


Pro 


He 


He 






100 










105 








110 












etc 


gtt 


gga 


aca 


aag 


ctt 


gat 


ctg 


egg 


gat 


gat 


aag 


caa 


ttt 


ttc 


gta 


501 


Leu 


Val 


Gly 


Thr 


Lys 


Leu 


Asp 


Leu 


Arg 


Asp 


Asp 


Lys 


Gin 


Phe 


Phe 


Val 




115 










120 










125 










130 




gat 


cac 


cct 


ggt 


get 


gta 


cct 


att 


tec 


act 


get 


cag 


ggc 


gaa 


gag 


ctg 


549 


Asp 


His 


Pro 


Gly 


Ala 


val 


Pro 


lie 


Ser 


Thr' 


Ala 


Gin 


Gly 


Glu 


Glu 


Leu 












135 










140 








145 






agg 


aaa 


etc 


att 


ggt 


gca 


gcg 


gca 


tac 


att 


gaa 


tgc 


agt 


tea 


aaa 


aca 


597 


Arg 


Lys 


Leu 


lie 


Gly 


Ala 


Ala 


Ala 


Tyr 


He 


Glu 


Cys 


Ser 


Ser 


Lys 


Thr 










150 










155 










160 






cag 


caa 


aac 


ate 


aag 


gca 


gtt 


ttc 


gat 


get 


gcg 


att 


aag 


gtg 


gtt 


etc 


645 


Gin 


Gin 


Asn 
165 


He 


Lys 


Ala 


Val 


Phe 
170 


Asp 


Ala 


Ala 


He 


Lys 
175 


Val 


Val 


Leu 




cag 


cct 


cca 


aag 


caa - 


aag 


aag 


aag 


aag 


aaa 


aag 


gcg 


cag 


aaa" 


"gga 


tgt 


693 


Gin 


Pro 


Pro 


Lys 


Gin Lys 


Lys 


Lys 


Lys 


Lys 


Lys 


Ala 


Gin 


Lys 


Gly 


Cys 






180 










185 










190 








gec 


ate 


ttg 


taattaaatg gtagacagtg cagtgeagat cgatgtatcc 


cttcatttgt 


752 


Ala 


He 


Leu 






























195 



































agcctctggc ttcaatcgtc gcttgtttgt ataattaege tagatgecac eggcagaaga 812 
tataatatag tcctcctgcc tttgtggtgt tggtctct 850 



<210> 280 
<211> 197 
<212> PRT 

<213> Oryza sativa 
<400> 280 

Met Ser Ala Ser Arg Phe He Lys Cys Val Thr Val Gly Asp Gly Ala 

1 5 10 15 

Val Gly Lys Thr Cys Met Leu He Ser Tyr Thr Ser Asn Thr Phe Pro 

20 25 30 

Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Ash Val Val 

35 40 45 

Val Asp Gly Ser Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 

50 55 60 

Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 
65 70 75 " 80 

Phe Leu Leu Ala Phe Ser Leu He Ser Lys Ala Ser Tyr Glu Asn Val 

85 90 95 

Ser Lys Lys Trp He Pro Glu Leu Arg His Tyr Ala Pro Gly Val Pro 

100 105 no 

He He Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe 

115 120 ~ 125 

Phe Val Asp His Pro Gly Ala Val Pro He Ser Thr Ala Gin Gly Glu 

130 135 140 

Glu Leu Arg Lys Leu He Gly Ala Ala Ala Tyr He Glu Cys Ser Ser 
145 150 155 160 

Lys Thr Gin Gin Asn He Lys Ala Val Phe Asp Ala Ala lie Lys Val 

165 170 175 . 

Val Leu Gin Pro Pro Lys Gin Lys Lys Lys Lys Lys Lys Ala Gin Lys 

180 185 "* 190 

Gly Cys Ala He Leu 
195 

<210> 281 
<211> 800 
<212> DNA 

<213> Suillus bovinus 



WO 2005/014828 



210/291 
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<220> 

<221> CDS 

<222> (123).. (707) 

<400> 281 ^ „ 

ccgtcacgtt gccttctacc atctttcaaa gttaaaggtt .attcaagcac tcgctgattc 60 

tgcgcacaca ttgctctcct tgagggcttc tctttcaggc aaccaagtag gattcaatca" 120 

ga atg cac aac ate aaa tgt gtt gta gtc ggc gat ggt get gtt ggc 167 
Met His Asn He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly 
15 10 15 

aag acg tgt ctt etc ate tct tat ace aca aat gee ttt cca gga gaa 215 
Lys Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Ala Phe Pro Gly Glu 

20 -'25 30 

tac gtg cca aca gta ttc gac aac tac tct gca aat gtg atg gtc gac 263 
Tyr Val Pro Thr Val Phe Asp Asn Tyr Ser Ala Asn Val Met Val Asp 

35 40 45 

ggg aaa act ate tct etc ggt eta tgg gat ace get gga caa gaa gat 311 
Gly Lys Thr He Ser Leu Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp 

50 55 60 

tac gat cgt etc egg cca etc tec tac cct caa aca gat gtc ttt ttg 359 
Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu 

65 "* " 70 75 

ate tgt ttc teg etc gtc agt cca cct age tac gag aac gtt aga ace 407 
He Cys Phe Ser Leu Val Ser Pro Pro Ser Tyr Glu Asn Val Arg Thr 
80 85 90 95 

aag tgg tgg cct gag att tea cat cat gcg cca tct act teg gtg gtc 455 
Lys Trp Trp Pro Glu He Ser His His Ala Pro Ser Thr Ser Val Val 

100 105 HO 

ttg gtt ggc act aaa ctg gat ttg cgt gaa gat cct get acc ate gag 503 
Leu Val Gly Thr Lys Leu Asp Leu Arg Glu Asp Pro Ala Thr He Glu 

115 120 125 

aaa etc cgt gac egg cgt atg cag cct ate cag tac acg caa ggg gtc 551 
Lys Leu Arg Asp Arg Arg Met Gin Pro He Gin Tyr Thr Gin Gly Val 

130 135 140 

teg atg gcg agg gat att ggc gee gtc aag tat ctt gaa tgt tct gcg 599 
Ser Met Ala Arg Asp He Gly Ala Val Lys Tyr Leu Glu Cys Ser Ala 

145 150 155 

eta teg caa aag ggc eta aag acc gtg ttt gat gag gtt ate cgt get 
Leu Ser Gin Lys Gly Leu Lys Thr Val Phe Asp Glu Val lie Arg Ala 
160 165 170 175 

gtt ttg aac cct ccc ccg aag gaa aag aag cgc agt ggt cgt ggt tgt 695 
Val Leu Asn Pro Pro Pro Lys Glu Lys Lys Arg Ser Gly Arg Gly Cys 

180 185 190 

gtt ate gta tgagtggatt tgccattcca catgetcata tagatggatt ttctttgttt 754 
Val He Val 

tatcat'atca cgcccattta tgttgttgct tctgttaaaa aaaaaa 800 



647 



<210> 282 
<211> 194 
<212> PRT 

<213> Suillus bovinus 
<400> 282 

Met His Asn He Lys Cys Val Val 
1 5 
Thr Cys Leu Leu He Ser Tyr Thr 
20 

Val Pro Thr Val Phe Asp Asn Tyr 
35 40 



Val Gly Asp Gly Ala Val Gly Lys 

10 15 
Thr Asn Ala Phe Pro Gly Glu Tyr 
25 30 
Ser Ala Asn Val Met Val Asp Gly 
45 



WO 2005/014828 PCT/EP2004/008136 

211/291 



Lys 


Thr 


He 


Ser 


Leu 


Gly 


Leu 


Trp Asp 


Thr 


Ala Gly Gin 


Glu 


Asp 


Tyr 




50 










55 






60 






Asp 


Arg 


Leu 


Arg 


Pro 


Leu 


Ser 


Tyr Pro 


Gin 


Thr Asp Val 


Phe 


Leu 


He 


65 










70 








75 






80 


Cys 


Plie 


Ser 


Leu 


Val 


Ser 


Pro 


Pro Ser 


Tyr 


Glu Asn Val 


Arg 


Thr 


Lys 










85 








90 




95 


Trp 


Trp 


Pro 


Glu 


He 


Ser 


His 


His Ala 


Pro 


Ser Thr Ser 


Val 


Val 


Leu 








100 








105 






110 






Val 


Gly 


Thr 


Lys 


Leu 


Asp 


Leu 


Arg Glu 


Asp. 


Pro Ala Thr 


lie 


Glu 


L y s 






115 










120 




125 






Leu 


Arg 


Asp 


Arg 


Arg 


Met 


Gin 


Pro He 


Gin 


Tyr Thr Gin 


Gly 


Val 


Ser 




13 0 










135 






140 






Met 


Ala 


Arg 


Asp 


lie 


Gly 


Ala 


Val Lys 


Tyr 


Leu Glu Cys 


Ser 


Ala 


Leu 


145 










150 






155 






160 


Ser 


Gin 


Lys 


Gly 


Leu 


Lys 


Thr 


Val Phe 


Asp 


Glu Val He 


Arg 


Ala 


Val 










165 








170 




175 




Leu 


Asn 


Pro 


Pro 


Pro 


Lys 


Glu 


Lys Lys 


Arg 


Ser Gly Arg 


Gly 


Cys 


Val 








180 








185 




190 





He Val 



<210> 283 
<2ill> 1065 
<212> DNA 

<213> Suillus bovinus 

<220> 

<221> CDS 

<222> (170) (745) 

<400> 283 

gccaacatcg acaccgtact ttctcatttc ttctaccgac gtcctgttcg taaattagag 60 

tcaaccttct ctgtcgtgat tgggttaata gcatttcctc gcttcaactc tcctccttga 120 

attttccgga agaatcgcca agtttatcac gtagaaccat ctcaacgac atg cag act 178 

Met Gin Thr 
1 

ate aag gtt gta gtg gtg ggg gac ggt get gta ggc aag act tgt ttg 226 
He Lys Val Val Val Val Gly Asp Gly Ala Val Gly Lys Thr Cys Leu 

5 10 15 

ttg ate tea tac acc acg aac aaa ttt cca age gac tat gtt ccg act 274 
Leu He Ser Tyr Thr Thr Asn Lys Phe Pro Ser Asp Tyr Val Pro Thr 
20 25 30 35 

gtc ttc gat aac tac get gtc act gta atg att ggt gaa gat cct tac 322 
Val Phe Asp Asn Tyr Ala Val Thr Val Met lie Gly Glu Asp Pro Tyr 

40 45 ~ 50 

aca eta ggt ctt ttc gac acc gee ggg caa gag gat tat gac cgt etc 370 
Thr Leu Gly Leu Phe Asp Thr Ala Gly Gin Glu Asp Tyr Asp Arg Leu 

55 60 65 

egg cca ctg tea tac ccc caa acc gac gtc ttc ctt gtt tgc ttt agt 418 
Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu Val Cys Phe Ser 

70 75 80 

gta acg tec cct cca tec ttc gag aac gtg agg gag aaa tgg att ccg 466 
Val Thr Ser Pro Pro Ser Phe Glu Asn Val Arg Glu Lys Trp He Pro 

85 90. 95 

gag gta cac cat cat tgt ccg ggt gtc cca tgt ctt ate gtt ggg acg 514 
Glu Val His His His Cys Pro Gly Val Pro Cys Leu He Val Gly Thr 
100 105 110 115 

caa ate gat ttg egg gac gac get caa gtg att gag aag etc tec agg 562 
Gin He Asp Leu Arg Asp Asp Ala Gin Val He Glu Lys Leu Ser Arg 

120 125 130 

cag aag caa cga ccg gtg ccg acc gag cac ggc gag agg ctt tec aga 610 
Gin Lys Gin Arg Pro Val Pro Thr Glu His Gly Glu Arg Leu Ser Arg 

135 140 145 

gag ctt ggt gee gtc aag tac gtg gag tgc tea gee ctg acg cag aaa 658 
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Glu Leu Gly Ala Val Lys Tyr Val Glu Cys Ser Ala Leu Thr Gin Lys 

150 155 160 

ggg ttg aag aac gtc ttc gac gag get att gtc get gcg eta gag cca 706 
Gly Leu Lys Asn Val Phe Asp Glu Ala He Val Ala Ala Leu Glu Pro 

165 170 175 

ccc gtg gtt aag aag aca cat aaa tgc gtc gtc gtt taatgaatga 752 
Pro Val Val Lys Lys Thr His Lys Cys Val Val Val 
180 185 190 

tccatcacga eggtacaget tgttttatct tcttatgact cctttacgcc ttgttctgcc 812 
tcatatatcc ttctcatctt tttttatgtc gtttctctcg ttgcgccaag acatcatgat 872 
tttccctagt cctgttgcat ctgttctctt ccgcacgagc tgagtggact tgagctatct 932 
" cgtgctctta aagaagctga tctcatccgt ctttctacac cacaattgat gactagttcg 992 
tattgttgga actgttgggt ttgataggaa tttaataccc atgacgggga cttgtcatgt 1052 



caaaaaaaaa aaa 



1065 



<210> 284 
<211> 191 
<212> PRT 

<213> Suillus bovinus 

i 

<400> 284 ' _ 

Met Gin Thr He Lys Val Val Val Val Gly Asp Gly Ala Val Gly Lys 

X 5 10 I 5 

Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Lys Phe Pro Ser Asp Tyr 

20 25 30 

Val Pro Thr Val Phe Asp Asn Tyr Ala Val Thr Val Met He Gly Glu 

35 40 45 

Asp Pro Tyr Thr Leu Gly Leu Phe Asp Thr Ala Gly Gin Glu Asp Tyr 

50 55 60 

Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu Val 
65 70 75 80 

Cys Phe Ser Val Thr Ser Pro Pro Ser Phe Glu Asn Val Arg Glu Lys 

85 90 95 

Trp He Pro Glu Val His His His Cys Pro Gly Val Pro Cys Leu He 

100 105 HO 

Val Gly Thr Gin He Asp Leu Arg Asp Asp Ala Gin Val He Glu Lys 

115 120 125 

Leu Ser Arg Gin Lys Gin Arg Pro Val Pro Thr Glu His Gly Glu Arg 

130 135 140 

Leu Ser Arg Glu Leu Gly Ala Val Lys Tyr Val Glu Cys Ser Ala Leu 
145 ~* 150 155 160 

Thr Gin Lys Gly Leu Lys Asn Val Phe Asp Glu Ala He Val Ala Ala 

165 170 175 

Leu Glu Pro Pro Val Val Lys Lys Thr His Lys Cys Val Val Val 
180 185 190 

<210> 285 
<211> 576 
<212> DNA 

<213> Yarrowia lipolytica 



<220> 

<221> CDS 

<222> (1) (576) 



WO 2005/014828 PCT7EP2004/008136 

213/291 

<400> 285 



atg 


cag 


acc 


at a 


aaa 


tgt 


gtt gtt gtc ggc gat ggt 


gee gtc gga aag 


48 


Met 


Gin 


Thr 


lie 


Lys 


Cys 


Val Val Val Gly Asp Gly Ala Val Gly Lys 




1 








■5 




10 


15 




act: 


tgt 


etc 


etc 


ate 


tea 


tac aca aca aac aag ttc 


ccc tct gaa tac 


96 


Thr 


Cys Leu 


Leu 


lie 


Ser 


Tyr Thr Thr Asn Lys Phe 


Pro Ser Glu Tyr 










20 






25 


30 




gtt 


ccc 


acc 


gtt 


ttt 


gac 


aac tat gec gtg act gtc 


atg att gga gac 


144 


Val 


Pro 


Thr 


Val 


Phe 


Asp 


Asn Tyr Ala Val Thr Val Met lie Gly Asp 








35 








40 


45 




gag 


ccc 


tac 


aca 


etc 


gga 


ctg ttc gac acc gec ggt 


cag gag gat tac 


192 


Glu 


Pro 


Tyr 


Thr 


Leu 


Gly 


Leu Phe Asp Thr Ala Gly Gin Glu Asp Tyr 






50 










55 60 






gac 


cga 


ctg 


cga 


cct 


ctt 


tgt tac cct cag acc gat 


gtt ttc etc gtc 


240 


Asp Arg 


Leu 


Arg 


Pro 


Leu 


Cys Tyr Pro Gin Thr Asp Val Phe Leu Val 




65 










70 


75 


80 




tgc 


ttt 


tec 


gtc 


acc 


tct 


ccc gee tec ttt gag aac 


gtc aag gag aag 


288 


Cys 


Phe 


Ser 


Val 


Thr 


Ser 


Pro Ala Ser Phe G1U Asn 


Val Lys Glu Lys 












85 




90 


95 




tgg 


ttc 


cct 


gag 


gtc 


cac 


cac cac tgc ccc ggc gtg 


cct tgc etc att 


336 


Trp 


Phe 


Pro 


Glu 


Val 


His 


His His Cys Pro Gly Val 


Pro Cys Leu lie 










100 






105 


110 




gtt 


ggt 


acc 


cag 


gtt 


gat 


ccg cga agt gac agg atg 


att ctt gac aag 


384 


Val 


Gly Thr 


Gin 


Val 


Asp 


Pro Arg Ser Asp Arg Met 


lie Leu Asp Lys 








115 








120 


125 




ctt 


tec 


cga 


cac 


aag 


ctg 


cga ccc atg acc act gag 


caa ggc tac cag 


432 


Leu 


Ser Arg 


His 


Lys 


Leu 


Arg Pro Met Thr Thr Glu 


Gin Gly Tyr Gin 






130 










135 140 




- 


etc 


gec 


cga 


gaa 


etc 


ggt 


gee gtc aag tac gtc gag 


tgt tct gee ctt 


480 


Leu 


Ala 


Arg 


Glu 


Leu 


Gly 


Ala Val Lys Tyr Val Glu 


Cys Ser Ala Leu 




145 










150 


155 


160 




act 


cag 


aag 


ggt 


etc 


aag 


gac gtt ttc gac gag gee 


ate gtg gca get 


528 


Thr Gin Lys 


Gly 


Leu 


Lys 


Asp Val Phe Asp Glu Ala 


lie Val Ala Ala 












165 




170 


175 




ctt 


gag 


cct 


cca 


gtg 


gtc 


aag aag aac aaa aag tgc 


att gtg etc 


573 


Leu Glu 


Pro 


Pro 


Val 


Val 


Lys Lys Asn Lys Lys Cys 


He Val Leu 










180 






185 


190 





tag 576 



<210> 286 
<211> 191 
<212> PRT 

<213> Yarrowia lipolytica 
<400> 286 

Met Gin Thr He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 

1 5 10 15 

Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Lys Phe Pro Ser Glu Tyr 

20 25 30 

Val Pro Thr Val Phe Asp Asn Tyr Ala Val Thr Val Met lie Gly Asp 

35 40 45 

Glu Pro Tyr Thr Leu Gly Leu Phe Asp Thr Ala Gly Gin Glu Asp Tyr 

50 55 60 

Asp Arg Leu Arg Pro Leu Cys Tyr Pro Gin Thr Asp Val Phe Leu Val 
65 70 75 80 

Cys Phe Ser Val Thr Ser Pro Ala Ser Phe Glu Asn Val Lys Glu Lys 

85 90 95 

Trp Phe Pro Glu Val His His His Cys Pro Gly Val Pro Cys Leu He 

100 105 110 

Val Gly Thr Gin Val Asp Pro Arg Ser Asp Arg Met He Leu Asp Lys 

115 120 125 

Leu Ser Arg His Lys Leu Arg Pro Met Thr Thr Glu Gin Gly Tyr Gin 

130 135 140 

Leu Ala Arg Glu Leu Gly Ala Val Lys Tyr Val Glu Cys Ser Ala Leu 
145 150 155 160 
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Thr Gin Lys Gly Leu Lys Asp Val Phe Asp Glu Ala lie Val Ala Ala 

165 170 175 

Leu Glu Pro Pro Val Val Lys Lys Asn Lys Lys Cys He Val Leu 
180 185 190 

<210> 287 
<211> 1157 
<212> DNA 

<213> Ashbya gossypii 

<220> 

<221> CDS 

<222> (192) . . (971) 

<400> 287 

gcaagagcct taaatcacct gcatcccccg atttcaacac ccttgttaga aagaggtaaa 60 

cagat caeca cggcgccagt aggtgaaaat eggcaggcag tgggtggcgt teaggacgeg 120 

gcaaggaagg gcagtggaag caatcgeggt cggtttagcg ggacagcgcg tcaagaggca 180 

gacagaacga a atg age gca ggg ccg ttg caa get gcg cca aag aag aat 230 
Met Ser Ala Gly Pro Leu Gin Ala Ala Pro Lys Lys Asn 
1 5 10 

tac gga gca ctg ata ggc gcg ggg ccg gcg gtg ggc ggg gcg gca ttc 278 
Tyr Gly Ala Leu He Gly Ala Gly Pro Ala Val Gly Gly Ala Ala Phe 

15 20 25 

aac egg acg ctg age gag gtt gcg age tac gag cgc age egg cgc gac 326 
Asn Arg Thr Leu Ser Glu Val Ala Ser Tyr Glu Arg Ser Arg Arg Asp 
30 35 40 45 

cac gcg acg ccg gac tac egg ate aag ate gtg gtg gtg ggg gac ggc 374 
His Ala Thr Pro Asp Tyr Arg He Lys He Val Val Val Gly Asp Gly 

50 55 60 

gcg acg ggg aag acg tct ctg ctg atg teg tac aca cag ggc cag ttc 422 
Ala Thr Gly Lys Thr Ser Leu Leu Met Ser Tyr Thr Gin Gly Gin Phe 

65 70 75 

cca gag gac tac gtg ccg ace ate ttc gag aac tac gtg acg aac att 
Pro Glu Asp Tyr Val Pro Thr He Phe Glu Asn Tyr Val Thr Asn He 

80 85 90 

gag ggc cca cgc ggg aag gtg ate gag ctg gcg ctg tgg gac act get 518 
Glu Gly Pro Arg Gly Lys Val He Glu Leu Ala Leu Trp Asp Thr Ala 

95 100 105 

ggg cag gag gag tac age egg ctg egg ccg ctg tct tac ggg gac gtg 566 
Gly Gin Glu Glu Tyr Ser Arg Leu Arg Pro Leu Ser Tyr Gly Asp Val 
110 115 120 125 

gac ate gtg atg gtg tgc tac gcg gca gac aac cgc acg teg ctg acg 614 
Asp He Val Met Val Cys Tyr Ala Ala Asp Asn Arg Thr Ser Leu Thr ■ 

130 135 140 

aac gcg gaa gag ctg tgg ttc ccg gag gtg cgc cac ttc tgc ccg cac 662 
Asn Ala Glu Glu Leu Trp Phe Pro Glu Val Arg His Phe Cys Pro His 

145 150 155 

gcg ccg atg atg ctg gtg ggg etc aag age gac ctg tac teg ctt gat 710 
Ala Pro Met Met Leu Val Gly Leu Lys Ser Asp Leu Tyr Ser Leu Asp 

160 165 170 

gcg ctg gac egg etc gtg gac ccg acg gac gcg gaa ctg gtg gcg cgc 758 
Ala Leu Asp Arg Leu Val Asp Pro Thr Asp Ala Glu Leu Val Ala Arg 

175 " 180 185 

aag atg ggc gee ttc gtg cac ctg cag tgc teg gee aag acg cga cag 806 
Lys Met Gly Ala Phe Val His Leu Gin Cys Ser Ala Lys Thr Arg Gin 
190 ^ 195 200 205 

tgc ctg gag gac gtg ttc aac acg gcg ata cac acg gcg ctg tac gac 854 
Cys Leu Glu Asp Val Phe Asn Thr Ala He His Thr Ala Leu Tyr Asp 

210 215 220 

gag ctg cgc gcg ccc ccg cag cgc ggg gtg aag ggc atg ttc aag aag 
Glu Leu Arg Ala Pro Pro Gin Arg Gly Val Lys Gly Met Phe Lys Lys 



470 



902 
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225 230 235 

aag cag cag egg gac ccg cag gcg cag tct tac aag egg gtg cgc aag 950 
Lys Gin Gin Arg Asp Pro Gin Ala Gin Ser Tyr Lys Arg Val Arg Lys 

240 245 250 

cac cgc tgt gtg gtc eta tagegcaegt gaeagecatg tttcgccggc 998 
His Arg Cys Val Val Leu 
255 

cctggttcct ggcageggag gcgcaagcgc cgcgccggcg cgcctagtcc gctgtcgagg 1058 



cctgcgtgac cgtgtagtta ggcatctagt egtagegage caagatgtta gtttaaatcg 1118 



ateggtgega cagcatcttg ttaggataga gtactggga 1157. 



<210> 288 
<211> 259 
<212> PRT 

<213> Ashbya gossypii 
<400> 288 



Met 


Ser 


Ala 


Gly 


Pro 


Leu 


Gin 


Ala 


Ala 


Pro 


Lys Lys Asn Tyr Gly Ala 


1 








5 










10 


15 


Leu 


lie 


Gly 


Ala 


Gly 


Pro 


Ala 


Val 


Gly 


Gly 


Ala Ala Phe Asn Arg Thr 








20 










25 




30 


Leu 


Ser 


Glu 


Val 


Ala 


Ser 


Tyr 


Glu 


Arg 


Ser 


Arg Arg Asp His Ala Thr 






35 










40 






45 


Pro 


Asp 


Tyr 


Arg 


He 


Lys 


He 


Val 


Val 


Val 


Gly Asp Gly Ala Thr Gly 




50 










55 








60 


Lys 


Thr 


Ser 


Leu 


Leu 


Met 


Ser 


Tyr 


Thr 


Gin 


Gly Gin Phe Pro Glu Asp 


65 










70 










75 80 


Tyr 


Val 


Pro 


Thr 


lie 


Phe 


Glu 


Asn 


Tyr 


Val 


Thr Asn He Glu Gly Pro 










85 










90 


95 


Arg 


Gly 


Lys 


Val 


He 


Glu 


Leu 


Ala 


Leu 


Trp 


Asp Thr Ala Gly Gin Glu 








100 










105 




110 


Glu 


Tyr 


Ser 


Arg 


Leu 


Arg 


Pro 


Leu 


Ser 


Tyr 


Gly Asp Val Asp He Val 
















120 






125 


Met 


Val 


Cys 


Tyr 


Ala 


Ala 


Asp 


Asn 


Arg 


Thr 


Ser Leu Thr Asn Ala Glu 




130 










135 








140 


Glu 


Leu 


Trp 


Phe 


Pro 


Glu 


Val 


Arg 


His 


Phe 


Cys Pro His Ala Pro Met 


145 










150 










155 160 


Met 


Leu 


Val 


Gly 


Leu 


Lys 


Ser 


Asp 


Leu 


Tyr 


Ser Leu Asp Ala Leu Asp 










165 










170 


175 


Arg 


Leu 


Val 


Asp 


Pro 


Thr 


Asp 


Ala 


Glu 


Leu 


Val Ala Arg Lys Met Gly 








180 










185 




190 


Ala 


Phe 


Val 


His 


Leu 


Gin 


Cys 


Ser 


Ala 


Lys 


Thr Arg Gin Cys Leu Glu 






195 










200 






205 


Asp 


Val 


Phe 


Ash 


Thr 


Ala 


He 


His 


Thr 


Ala 


Leu Tyr Asp Glu Leu Arg 




210 










215 








220 


Ala 


Pro 


Pro 


Gin 


Arg 


Gly 


Val 


Lys 


Gly 


Met 


Phe Lys Lys Lys Gin Gin 


225 










230 










235 240 


Arg 


Asp 


Pro 


Gin 


Ala 


Gin 


Ser 


Tyr 


Lys 


Arg 


Val Arg Lys His Arg Cys 










245 










250 


255 


Val 


Val 


Leu 



















<210> 289 

<211> 1067 

<212> DNA 

<213> Oryza sativa 

<220> 

<221> CDS 

<222> (131) . . (775) 
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<400> 289 

aagaacgaac aactcttctc cccctctccc cctccttcct tgctcctcct cccatcttgt 



60 



ctcctccctc tctctcgcaa agtcgcaacg cgctcctcct cggtgaaacg ggagtgaggt 120 

gcgtgcggcc atg gcg tec age gcc tec egg ttc ate aag tgc gtc acg 169 
Met Ala Ser Ser Ala Ser Arg Phe lie Lys Cys Val Thr 
1 5 10 

gtg ggc gac ggc gcc gtg ggc aag acc tgc atg etc ate tgc tac acc 217 
Val Gly Asp Gly Ala Val Gly Lys Thr Cys Met Leu lie Cys Tyr Thr 

15 20 25 

age aac aag ttc ccc act gac tac ata cca acg gtg ttc gac aat ttc 265 
Ser Asn Lys Phe Pro Thr Asp Tyr lie Pro Thr Val Phe Asp Asn Phe 
30 * 35 40 45 

age gca aac gtt gtt gtg gac age acc acg gtg aat ctg ggc etc tgg 313 

Ser Ala Asn Val Val Val Asp Ser Thr Thr Val Asn Leu Gly Leu Trp 

50 55 60 

gat act get ggg caa gag gat tac aac egg etc agg cct ctg age tat 361 
Asp Thr Ala Gly Gin Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr 

65 70 75 

cgt ggt get gat gtt ttc gtg ctt gcc ttc tct ctt- gtg age cga get 409 
Arg Gly Ala Asp Val Phe Val Leu Ala Phe Ser Leu Val Ser Arg Ala 

80 85 90 

age tat gaa aat att atg aag aag tgg ata ccg gag eta cag cat tat 457 
Ser Tyr Glu Asn lie Met Lys Lys Trp lie Pro Glu Leu Gin His Tyr 

95 100 105 

gca ccc ggc gtg ccc att gtg .ttg gtt ggc aca aaa ttg gat etc cgt 505 
Ala Pro Gly Val Pro lie Val Leu Val Gly Thr Lys Leu Asp Leu Arg 
110 115 120 125 

gaa gac aag cac tac ttg ttg gac cat cct ggc atg ata cct gtt acc 553 
Glu Asp Lys His Tyr Leu Leu Asp His Pro Gly Met lie Pro Val Thr 

130 135 140 

aca gca cag ggt gaa gaa ctt cga aaa caa ata ggt get get tat tac 
Thr Ala Gin Gly Glu Glu Leu Arg Lys Gin lie Gly Ala Ala Tyr Tyr 

145 150 155 

att gag tgc age tea aag aca caa cag aat gtc aaa ggt gtg ttt gat 649 
lie Glu Cys Ser Ser Lys Thr Gin Gin Asn Val Lys Gly Val Phe Asp 

160 165 170 

get get ate aag gta gta ate cag cct cca act aag cag agg gaa aag 697 
Ala Ala He Lys Val Val He Gin Pro Pro Thr Lys Gin Arg Glu Lys 

175 180 185 

aag aaa aag aaa tea cga caa gga tgc tct atg atg aac atg ttc cgt 745 
Lys Lys Lys Lys Ser Arg Gin Gly Cys Ser Met Met Asn Met Phe Arg 
190 ^ 195 200 205 

gga agg aaa atg tea tgc ttc aaa tec tgattgatcg agatgtccct 792 
Gly Arg Lys Met Ser Cys Phe Lys Ser 
210 

tacatgatgc aatgttctgg gtgcaacctc aagctggcaa cctttggagt ctgaatgtgc 852 
tggtagagtt agatgeaaag ttcttgtgta tatttgeate ttttggtaat taattactag 912 
agttagatgg ttaaactagt ctgcttatgt accgcacaat gcttnttatt attagctctt 972 
gttggtctag aaagtctgat agatgecaaa ggaacaagga gacaccgttc tgtcaggaca 1032 
aaatcacaga ttttgtaatc aaaaaaaaaa aaaaa 1067 



601 



<210> 290 
<211> 214 
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<212> PRT 

<213> Oryza sativa 
. <400> 290 



Met 


Ala 


Ser 


Ser 


Ala 


Ser Arg Phe He Lys Cys Val 


Thr Val Gly 


Asp 


1 








5 


■ 10 


15 




Gly 


Ala 


Val Gly 


Lys 


Thr Cys Met Leu. He Cys Tyr 


Thr Ser Asn 


Lys 








20 




25 


30 


Phe 


Pro 


Thr Asp 


Tyr 


He Pro Thr Val Phe Asp Asn 


Phe Ser Ala 


Asn 






35 






40 


45 




Val 


Val 


val 


Asp 


Ser 


Thr Thr Val Asn Leu Gly Leu 


Trp Asp Thr 


Ala 




50 . 








55 60 






Gly 


Gin 


Glu Asp 


Tyr 


Asn Arg Leu Arg Pro Leu Ser 


Tyr Arg Gly 


Ala 


65 










70 ~ 75 




80 


Asp 


Val 


Phe 


Val 


Leu 


Ala Phe Ser Leu Val Ser Arg 


Ala Ser Tyr 


Glu 










85 


90 


95 




Asn 


lie 


Met 


Lys 


Lys 


Trp He Pro Glu Leu Gin His 


Tyr Ala Pro 


Gly 








100 




105 


110 




Val 


Pro 


lie 


Val 


Leu 


Val Gly Thr Lys Leu Asp Leu 


Arg Glu Asp 


Lys 






115 






120 


125 


His 


Tyr 


Leu 


Leu 


Asp 


His Pro Gly Met He Pro Val 


Thr Thr Ala 


Gin 




130 








135 140 






Gly 


Glu 


Glu 


Leu 


Arg 


Lys Gin He Gly Ala Ala Tyr 


Tyr He Glu 


Cys 


145 










150 155 




160 


Ser 


Ser 


Lys 


Thr 


Gin 


Gin Asn Val Lys Gly Val Phe 


Asp Ala Ala 


lie 










165 


170 


175 




Lys 


Val 


Val 


He 


Gin 


Pro Pro Thr Lys Gin Arg Glu 


Lys Lys Lys 


Lys 








180 




185 


190 




Lys 


Ser 


Arg 


Gin 


Gly 


Cys Ser Met Met Asn Met Phe 


Arg Gly. Arg 


Lys 






195 






200 


205 




Met 


Ser 


Cys 


Phe 


Lys 


Ser 








210 















<210> 291 
<211> 1087 
<212> DNA 

<213> Oryza sativa 

<220> 

<221> CDS 

<222> (141) . . (785) 

<400> 291 

aacaaagcag cttgccaccg tccctgcctc tcctgctttg ctttgccttc tcggagaaga 60 



caaatctact gtagctgctt gtctctgttc gccggtggcg accgtgagag gaggaggtag 12 0 



tcctagtcct agtaggcgag atg age ggc gee acc aag ttc ate aag tgc gtc 173 

Met Ser Gly Ala Thr Lys Phe lie Lys Cys Val 
1 5 10 



acc 


gtc 


ggc 


gac 


ggc 


gee 


gtc 


ggc 


aag 


acc 


tgc 


atg 


etc 


ate 


tgc 


tac 


221 


Thr 


Val 


Gly 


Asp 


Gly 


Ala 


Val 


Gly 


Lys 


Thr 


Cys 


Met 


Leu 


lie 


Cys 


Tyr 










15 










20 










25 








acc 


age 


aac 


aag 


ttc 


ccc 


acc 


gat 


tac 


ate 


ccc 


acc 


gtg 


ttc 


gac 


aac 


269 


Thr 


Ser 


Asn 


Lys 


Phe 


Pro 


Thr 


Asp 


Tyr 


lie 


Pro 


Thr 


Val 


Phe 


Asp 


Asn 








30 










35 










40 










ttc 


agt 


get 


aat 


gtt 


tea 


gtg 


gat 


ggg 


aac 


ate 


gtc 


aac 


ttg 


gga 


tta 


317 


Phe 


Ser 


Ala 


Asn 


Val 


Ser 


Val 


Asp 


Gly 


Asn 


He 


Val 


Asn 


Leu 


Gly 


Leu 






45 










50 










55 












tgg 


gac 


act 


get 


gga 


caa 


gag 


gat 


tac 


age 


agg 


ctg 


agg 


cca 


ctg 


age 


365 


Trp 


Asp 


Thr 


Ala 


Gly 


Gin 


Glu 


Asp 


Tyr 


Ser 


Arg 


Leu 


Arg 


Pro 


Leu 


Ser 




60 










65 










70 










75 




tac 


agg 


gga 


gcg 


gat 


ata 


ttt 


gtg 


ctg 


gca 


ttc 


tea 


ctg 


ate 


age 


aga 


413 


Tyr 


Arg 


Gly 


Ala 


Asp 


He 


Phe 


Val 


Leu 


Ala 


Phe 


Ser 


Leu 


He 


Ser 


Arg 












80 










85 










90 




gca 


age 


tat 


gag 


aat 


gtt 


etc 


aag 


aag 


tgg 


atg 


ccg 


gag 


ctt 


cgc 


egg 


461 
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Ala 


Ser 


Tyr 


Glu 
95 


Asn 


Val 


Leu 


Lys 


Lys 
100 


Trp 


Met 


Pro 


Glu 


Leu 
105 


Arg 


Arg 




ttc 


gca 


ccg 


aat 


gtt 


cca 


att 


gtt 


ctt 


gtt 


ggg 


ace 


aag 


tta 


gat 


eta 


509 


Phe 


Ala 


Pro 


Asn 


Val 


Pro 


He 


Val 


Leu 


Val 


Gly Thr 


Lys 


Leu 


Asp 


Leu 








110 










115 










120 










cgt 


gac 


cac 


aga 


tct 


tac 


ctt 


gcg 


gac 


cat 


cct 


get 


get 


tec 


gca 


att 


557 


Arg 


Asp 


His 


Arg 


Ser 


Tyr 


Leu 


Ala 


Asp 


His 


Pro .Ala 


Ala 


Ser 


Ala 


He 




125 










130 










135 












acg 


act 


gca 


cag 


ggt 


gaa 


gaa 


ctt 


aga 


aag 


cag 


ata 


ggc 


gee 


gcg 


gec 


605 


Thr 


Thr 


Ala 


Gin 


Gly 


Glu 


Glu 


Leu 


Arg 


Lys 


Gin 


He 


Gly 


Ala 


Ala 


Ala 




140 










145 










150 










155 




tac 


ate 


gaa 


tgc 


agt 


teg 


aaa 


aca 


caa 


cag 


aac 


ate 


aag 


gee 


gtg 


ttt 


653 


Tyr 


He 


Glu 


Cys 


Ser 


Ser 


Lys 


Thr 


Gin 


Gin 


Asn 


He 


Lys 


Ala 


Val 


Phe 










160 










165 










170 






gat 


act 


gec 


ate 


aag 


gtg 


gtc 


ctt 


cag 


cct 


cct 


egg 


aga 


agg 


ggg 


gag 


701 


Asp 


Thx 


Ala 


He 


Lys 


Val 


Val 


Leu 


Gin 


Pro 


Pro 


Arg 


Arg 


Arg 


Gly 


Glu 








175 










180 










185 








acg- 


acg 


atg 


gca 


agg- 


"aag 


aag 


aca 


agg 


cga 


age 


ace 


ggc 


tgc 


teg 


tta 


* 749 


Thr 


Thr 


Met 


Ala 


Arg Lys 


Lys 


Thr 


Arg 


Arg 


Ser 


Thr 


Gly 


Cys 


Ser 


Leu 








190 










195 










200 










aag 


aac 


ttg 


atg 


tgt 


ggc 


agt 


gca 


tgt 


gtt 


gtt 


taggacctgt ctgaattctt 


802 


Lys 


Asn 


Leu 


Met 


Cys Gly 


Ser 


Ala 


Cys 


Val 


Val 















205 210 
gaaggtgtgt aagcgttgtg taatggcaag tgaatttget gactgttctg ctgcttccafc 862 



ttactactac taagctcttt cgtgtgtttg aattactaca tctaaaacta gagcatcagg 922 

ctcatagact gacttcatat caggtgtttg tatcattaat attattatgt aagaaggttg 982 

tggtttgtgg aacaactaaa tacaacaatg caggctatgt tgtactatgc tetgetactg 1042 

gtgtttgtta taaatatcac atectgetaa aaaaaaaaaa aaaaa 1087 



<210> 292 
<211> 214 
<212> PRT 
<213> Oryza sativa 



<400> 292 



Met 


Ser 


Gly 


Ala 


Thr 


Lys 


Phe 


He 


Lys 


Cys 


Val Thr 


Val Gly Asp Gly 


1 






5 








10 




15 


Ala 


Val 


Gly 


Lys 


Thr 


Cys 


Met 


Leu 


He 


Cys 


Tyr Thr 


Ser Asn Lys Phe 






20 










25 






30 


Pro 


Thr 


Asp 


Tyr 


He 


Pro 


Thr 


Val 


Phe 


Asp 


Asn Phe 


Ser Ala Asn Val 






35 








40 








45 


Ser 


Val 


Asp 


Gly 


Asn 


He 


Val 


Asn 


Leu 


Gly 


Leu Trp 


Asp Thr Ala Gly 




50 










55 








60 




Gin 


Glu 


Asp 


Tyr 


Ser 


Arg 


Leu 


Arg 


Pro 


Leu 


Ser Tyr 


Arg Gly Ala Asp 


65 






70 










75 


80 


He 


Phe 


Val 


Leu 


Ala 


Phe 


Ser 


Leu 


He 


Ser 


Arg Ala 


Ser Tyr Glu Asn 










85 










90 




95 


Val 


Leu 


Lys 


Lys 


Trp 


Met 


Pro 


Glu 


Leu 


Arg 


Arg Phe 


Ala Pro Asn Val 








100 










105 






110 


Pro 


He 


Val 


Leu 


Val 


Gly 


Thr 


Lys 


Leu 


Asp 


Leu Arg 


Asp His Arg Ser 






115 










120 








125 


Tyr 


Leu 


Ala 


Asp 


His 


Pro 


Ala 


Ala 


Ser 


Ala 


He Thr 


Thr Ala Gin Gly 


130 








13 5 








140 




Glu 


Glu 


Leu 


Arg 


Lys 


Gin 


He 


Gly 


Ala 


Ala 


Ala Tyr 


He Glu Cys Ser 


145 










150 










155 


160 


Ser 


Lys 


Thr 


Gin 


Gin 


Asn 


He 


Lys 


Ala 


Val 


Phe Asp 


Thr Ala He Lys 








165 










170 




175 
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Val Val Leu Gin Pro Pro Arg Arg Arg Gly Glu Thr Thr Met Ala Arg 

180 185 ~ 190 

Lys Lys Thr Arg Arg Ser Thr Gly Cys Ser Leu Lys Asn Leu Met Cys 

195 200 205 

Gly Ser Ala Cys Val Val 
210 

<210> 293 

<211> 1059 

<212> DNA 

<2l3> Cicer arietinura 

<220> 

<221> CDS 

<222> (118) . . (711) 

<400> 293 

cttcttcatc ccttcttctt gatcattcat tcgttcattc accaagttcg "aacttgaacg 60 

cacaacagca ttaccacatg agtaattagg gtttatacag caacaagaaa cataaaa 117 

atg agt ggg tec agg ttc att aag tgt gtt acc gta ggt gac ggt gec 165 

Met Ser Gly Ser Arg Phe lie Lys Cys Val Thr Val Gly Asp Gly Ala 

1 5 .10 15 

gta ggc aag act tgt ttg ctt ate tec 'tac acc age aac act ttc cct 213 

Val Gly Lys Thr Cys Leu Leu lie Ser Tyr Thr Ser Asn Thr Phe Pro 

20 25 30 

acg gac tat gtg cct act gtc ttt gat aat ttc agt gca aat gta gtt 261 
Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val 

35 40 45 

gtg gat ggg age aet ata aat ctg ggt ttg tgg gat act get ggc caa 309 
Val Asp Gly Ser Thr lie Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 

50 55 60 

gaa gat tac aat aga tta aga ccc tta age tat cgt gga gcg gat gtt 357 
Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 
65 70 75 80 

ttc ctg ctt get ttc tea etc ata age agg gee age tat gaa aat ate 405 
Phe Leu Leu Ala Phe Ser Leu lie Ser Arg Ala Ser Tyr Glu Asn lie 

85 90 95 

gec aag aaa tgg att cct gag ttg agg cat tat get cct ggt gtt cca 453 
Ala Lys Lys Trp lie Pro Glu Leu Arg His Tyr Ala Pro Gly Val Pro 

100 105 110 

att att etc gtt gga aca aaa ctt gat ctt egg gat gat aag caa ttc 501 
lie lie Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe 

115 120 125 

ttt caa gac cat cct ggt gcg gtg cca ate acc aca gca cag ggt gag 549 
Phe Gin Asp His Pro Gly Ala Val Pro lie Thr Thr Ala Gin Gly Glu 

130 ~ 135 140 

gaa ctg aga aag ctt att ggt get cca gtt tac att gaa tgt agt tec 597 
Glu Leu Arg Lys Leu He Gly Ala Pro Val Tyr He Glu Cys Ser Ser 
145 150 155 160 

aaa aca caa aag aat gtg aag get gtt ttt gat gcg gee ate aaa gta 645 
Lys Thr Gin Lys Asn Val Lys Ala Val Phe Asp Ala Ala He Lys Val 

165 170 175 

gtt eta cag ccg cca aag caa aag aaa act aag aga aag ggg caa aaa 693 
Val Leu Gin Pro Pro Lys Gin Lys Lys Thr Lys Arg Lys Gly Gin Lys 

180. 185 190 

gee tgt tec att ttg tgatcttcag ttcttcatat tgtagttggt ggagaggacg 748 
Ala Cys Ser He Leu 
195 

acatgatagt aaccatttcc t ggt agt ctt tgccattctg attctatatc ttaccctatt 808 



ttcaacctac atggaggaat cgacgtgtcc aactgagtcc taactacata aggtttgett 



868 
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ataagcatca agctatttgt ctctggctga aggatatatg taaagtcata atttcaaaca 928 
tgtttggtgt gtagaacaac tcttgtgttt ctctgggatg tttttacctg ttgtattgat 988 
tattaaatga ccatctccaa ttgtggacat. gggttttact cacattgtaa tctaatctag 1048 
gaagtgcttg c 1059 



<210> 294 
<211> 197 
<212> PRT 

<213> Cicer arietinunr 
<400> 294 

Met Ser Gly Ser Arg Phe He Lys Cys Val Thr Val Gly Asp Gly Ala 

1 5 10 15 

Val Gly Lys Thr Cys Leu Leu He Ser Tyr Thr Ser Asn Thr Phe Pro 

20 25 30 

Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val 

35 40 45 

Val Asp Gly Ser Thr He Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 

50 55 60 

Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 
65 70 75 80 

Phe Leu Leu Ala Phe Ser Leu He Ser Arg Ala Ser Tyr Glu Asn He 

85 90 95 

Ala Lys Lys Trp He Pro Glu Leu Arg His Tyr Ala Pro Gly Val Pro 

100 105 HO 

He He Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe 

115 120 125 

Phe Gin Asp His Pro Gly Ala Val Pro He Thr Thr Ala Gin Gly Glu 

130 135 140 

Glu Leu Arg Lys Leu He Gly Ala Pro Val Tyr He Glu Cys Ser Ser 
145 " 150 155 160 

Lvs Thr Gin Lys Asn Val Lys Ala Val Phe Asp Ala Ala He Lys Val 

165 170 175 

Val Leu Gin Pro Pro Lys Gin Lys Lys Thr Lys Arg Lys Gly Gin Lys 

180 185 190 

Ala Cys Ser He Leu 
195 

<210> 295 
<211> 805 
<212> DNA 

<213> Wuchereria bancrofti 

<220> 

<221> CDS 

<222> (26) • . (601) 

<400> 295 

cggcacgagc aagtttgagg taaag atg cag act ate aaa tgt gta gtg gtt 52 
~ " " Met Gin Thr He Lys Cys Val Val Val 

1 5 

gga gat ggt get gtc ggc aaa aca tgc ctt ctt att teg tat act acc 100 

Glv Asp Gly Ala Val Gly Lys Thr Cys Leu Leu He Ser Tyr Thr Thr 

10 15 20 25 

aac aag ttt ccc agt gaa tat gta ccc act gta ttt gac aat tat get 148 

Asn Lys Phe Pro Ser Glu Tyr Val Pro Thr Val Phe Asp Asn Tyr Ala 

30 35 40 

gta aca gta atg att gga gga gaa cca tat acg ctt ggt ttg ttc gat 196 
Val Thr Val Met He Gly Gly Glu Pro Tyr Thr Leu Gly Leu Phe Asp 
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45 










50 










55 








aca 


gcc 


ggt 


caa 


gag 


gat 


tat 


gat 


aga 


ttg 


cgt 


cct 


ctt 


tea 


tat 


ccg 


244 


Thr 


Ala 


Gly 
60 


Gin 


Glu 


Asp 


Tyr 


Asp 
65 


Arg 


Leu 


Arg 


Pro 


Leu 
70 


Ser 


Tyr 


Pro 




caa 


aca 


gat 


gta 


ttt 


tta 


gtg 


tgc 


ttt 


tct 


gtc 


gtt 


gca 


cct 


tea 


tct 


292 


Gin 


Thr 
75 


Asp 


val 


Phe 


Leu 


Val 
80 


Cys 


Phe 


Ser 


Val 


Val 
85 


Ala 


Pro 


Ser 


Ser 




ttc 


gaa 


aat 


gtg 


aaa 


gag 


aag 


tgg 


gtg 


ccg 


gag 


ata 


gca 


cat 


cac 


tgc 


340 


Phe 


Glu 


Asn 


Val 


Lys 


Glu 


Lys 


Trp 


Val 


Pro 


Glu 


He 


Ala 


His 


His 


Cys 




90 










95 










100 










105 




atg 


aag 


aca 


ccg 


ttc 


ctg 


ctg 


gtc 


gga 


act 


cag 


att 


gat 


ctt 


cgt 


gac 


388 


Met 


Lys 


Thr 


Pro 


Phe 
110 


Leu 


Leu 


val 


Gly 


Thr 
115 


Gin 


He 


Asp 


Leu 


Arg 
120 


Asp 




gat 


cct 


tec 


tac 


ate 


gaa 


aaa 


ttg 


gca 


aaa 


ate 


aag 


caa 


cga 


cca 


att 


436 


Asp 


Pro 


Ser 


Tyr 
125 


lie 


Glu 


Lys 


Leu 


Ala 
13 0 


Lys 


He 


Lys 


Gin 


Arg 
135 


Pro 


lie 




aca 


ttc 


gaa 


gtt 


gga 


gag 


aag 


tta 


gcg 


aaa 


gaa 


tta 


aag 


gca 


gtg 


aaa 


484 


Thr 


Phe 


Glu 
140 


Val 


Gly 


Glu 


Lys 


Leu 
145 


Ala 


Lys" 


Glu 


Leu 


Lys 
150 


Ala 


Val 


Lys 




tac 


gtc 


gaa 


tgt 


tct 


gca 


etc 


aca 


cag 


aaa 


ggt 


eta 


aaa 


aat 


gta 


ttt 


532 


Tyr 


Val 


Glu 


Cys 


Ser 


Ala 


Leu 


Thr 


Gin 


Lys Gly 


Leu 


Lys 


Asn 


Val 


Phe 






155 










160 










165 












gat 


gaa 


gca 


ata 


ctg 


gca 


gca 


ttg 


gaa 


cca 


ccg 


gca 


cag 


gag 


aaa 


aag 


580 


Asp 


Glu 


Ala 


lie 


Leu 


Ala 


Ala 


Leu 


Glu 


Pro 


Pro 


Ala 


Gin 


Glu 


Lys 


Lys 




170 










175 










180 










185 




aag 


aaa 


tgt 


act 


ata 


Ctt 


tagcatcatt gtttctcgat caegttatat 




628 


Lys 


Lys 


Cys 


Thr 


He 

t on 


Leu 

























190 



tgatttctga tttgttagtc atttatctgt aaatgattat gttaaegtet taaatgctgc 688 



acataccatt actacgtccc ctcttctggt tggtccatct gcgcatatgt gtagtgtttc 748 



catctatgeg atatacgtgt agaatgaatg tattcctgta caaaaaaaaa aaaaaaa 805 



<210> 296 
<211> 191 
<212> PRT 

<213> Wuchereria bancrofti 



<400> 296 



Met 


Gin 


Thr 


He 


Lys 


Cys 


Val 


Val 


Val 


Gly 


Asp 


Gly 


Ala 


Val 


Gly 


Lys 


1 








5 










10 










15 




Thr 


Cys 


Leu 


Leu 


He 


Ser 


Tyr 


Thr 


Thr 


Asn 


Lys 


Phe 


Pro 


Ser 


Glu 


Tyr 






20 










25 










30 






Val 


Pro 


Thr 
35 


Val 


Phe 


Asp 


Asn 


Tyr 
40 


Ala 


Val 


Thr 


Val 


Met 
45 


He 


Gly 


Gly 


Glu 


Pro 
50 


Tyr 


Thr 


Leu 


Gly 


Leu 
55 


Phe 


Asp. 


Thr 


Ala 


Gly 
60 


Gin 


Glu 


Asp 


Tyr 


Asp 


Arg 


Leu 


Arg 


Pro 


Leu 


Ser 


Tyr 


Pro 


Gin 


Thr 


Asp 


Val 


Phe 


Leu 


Val 


65 










70 










75 










80 


Cys 


Phe 


Ser 


Val 


Val 


Ala 


Pro 


Ser 


Ser 


Phe 


Glu 


Asn 


Val 


Lys 


Glu 


Lys 








85 










90 










95 




Trp 


Val 


Pro 


Glu 
100 


He 


Ala 


His 


His 


Cys 
105 


Met 


Lys 


Thr 


Pro 


Phe 
110 


Leu 


lieu 


Val 


Gly 


Thr 
115 


Gin 


He 


Asp 


Leu 


Arg 
120 


Asp 


Asp 


Pro 


Ser 


Tyr 
125 


He 


Glu 


Lys 


Leu 


Ala 
130 


Lys 


He 


Lys 


Gin 


Arg 
135 


Pro 


He 


Thr 


Phe 


Glu 
140 


Val 


Gly 


Glu 


Lys 


Leu 


Ala 


Lys 


Glu 


Leu 


Lys 


Ala 


Val 


Lys 


Tyr 


Val 


Glu 


Cys 


Ser 


Ala 


Leu 


145 










150 










155 










160 


Thr 


Gin 


Lys 


Gly 


Leu 


Lys 


Asn 


Val 


Phe 


Asp 


Glu 


Ala 


lie 


Leu 


Ala 


Ala 






165 










170 










175 




Leu 


Glu 


Pro 


Pro 


Ala 


Gin 


Glu 


Lys 


Lys 


Lys 


Lys 


Cys 


Thr 


He 


Leu 
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180 185 190 

<210> 297 
<211> 579 
<212> DNA 

<213> Emericella nidulans 

<220> 

<221> CDS 

<222> (1) . . (579) 

<400> 297 

atg gtc gta gca act ate aag tgt gtt gtg gtc ggt gat ggt get gtc 48 
Met Val Val Ala Thr He Lys Cys Val Val Val Gly Asp Gly Ala Val 
1.5 10 15 
ggg aag acc tgt etc ctg ate tea tac aca acg aac aaa tte cct tea 96 
Gly Lys Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Lys Phe Pro Ser 
20 25 30 

. gaa tat gtg ccc acc gtg tte gat aac tat get gtg act gtc atg ate 144 
Glu Tyr Val Pro Thr Val Phe Asp Asn Tyr Ala Val Thr Val Met He 

35 40 45 

ggc gat gag ccc tac act eta gga tta ttt gat act get ggt caa gaa 192 
Gly Asp Glu Pro Tyr Thr Leu Gly Leu Phe Asp Thr Ala Gly Gin Glu 

50 " 55 60 

gat tat gac cga ctt cgt cct etc tea tac ccg cag aca gat gta tte 240 
Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe 
65 70 75 80 

ctt gtc tgc ttt tct gtg acc tec ccc gca tct tte gag aac gtg cgc 288 
Leu Val Cys Phe Ser Val Thr Ser Pro Ala Ser Phe Glu Asn Val Arg 

85 90 95 

gaa aag tgg tte ccc gaa gtt cac cac cac tgc ccc ggc gtc cca tgc 336 
Glu Lys Trp Phe Pro Glu Val His His His Cys Pro Gly Val Pro Cys 

100 ' 105 HO 

ctg att gtt ggg aca caa gtc gat etc egg gat gac ccc gee gtt cgc 384 
Leu He Val Gly Thr Gin Val Asp Leu Arg Asp Asp Pro Ala Val Arg 

115 120 125 

gac aag ctg get egg caa aag atg caa ccg att cgc aaa gaa gac ggt 432 
Asp Lys Leu Ala Arg Gin Lys Met Gin Pro He Arg Lys Glu Asp Gly 

130 " 135 140 

gac cgc atg gee aag gat ctg ggc gee gtg aaa tat gtc gaa tgc tec 480 
Asp Arg Met Ala Lys Asp Leu Gly Ala Val Lys Tyr Val Glu Cys Ser 
145 150 155 160 

gca ttg acc caa tac aag etc aag gat gtc ttt gac gag get att gtt 528 
Ala Leu Thr Gin Tyr Lys Leu Lys Asp Val Phe Asp Glu Ala He Val 

165 170 175 

gcg gca etc gaa ccg gee cct aag aag agg teg agg tgc gtt ctg eta 576 
Ala Ala Leu Glu Pro Ala Pro Lys Lys Arg Ser Arg Cys Val Leu Leu 
180 185 190 

tag 



579 



<210> 298 
<211> 192 
<212> PRT 

<213> Emericella nidulans 
<400> 298 

Met Val Val Ala Thr He Lys Cys 
1 5 
Gly Lys Thr Cys Leu Leu He Ser 
20 

Glu Tyr Val Pro Thr Val Phe Asp 

35 40 
Gly Asp Glu Pro Tyr Thr Leu Gly 

50 55 
Asp Tyr Asp Arg Leu Arg Pro Leu 



Val Val Val Gly Asp Gly Ala Val 

10 15 
Tyr Thr Thr Asn Lys Phe Pro Ser 
25 30 
Asn Tyr Ala Val Thr Val Met He 
45 

Leu Phe Asp Thr Ala Gly Gin Glu 
60 

Ser Tyr Pro Gin Thr Asp Val Phe 



WO 2005/014828 PCT/EP2004/008136 

223/291 



65 










70 




75 


80 


Leu 


val 


Cys 


Phe 


Ser 


Val 


Thr 


Ser Pro Ala Ser Phe 


Glu Asn Val Arg 










85 






90 


95 


Glu 


Lys 


Trp 


Phe 


Pro 


Glu 


Val 


His His His Cys Pro 


Gly Val Pro Cys 








100 








105 


110 


Leu 


He 


Val 


Gly 


Thr 


Gin 


Val 


Asp Leu Arg Asp Asp 


Pro Ala Val Arg 






115 










120 


125 


Asp 


Lys 


Leu 


Ala 


Arg 


Gin 


Lys 


Met Gin Pro He Arg 


Lys Glu Asp Gly 




130 










135 


140 


Asp 


Arg 


Met 


Ala 


Lys 


Asp 


Leu 


Gly Ala Val Lys Tyr 


Val Glu Cys Ser 


145 










150 




155 


160 


Ala 


Leu 


Thr 


Gin 


Tyr 


Lys 


Leu 


Lys Asp Val Phe Asp 


Glu Ala lie Val 










165 






170 


175 


Ala 


Ala 


Leu 


Glu 


Pro 


Ala 


Pro 


Lys Lys Arg Ser Arg 


Cys Val Leu Leu 








180 








185 


190 



<210> 299 
<211> 588 
<212> DNA 

<213> Drosophila melanogaster 

<220> 

<221> CDS 

<222> (1) . . (588) 

<400> 299 

atg tea acc gga agg ccc ata aag tgt gta gtc gtt ggt gac ggc acc 48 

Met Ser Thr Gly Arg Pro He Lys Cys Val Val Val Gly Asp Gly Thr 

1 5 10 15 

gtc gga aag acc tgc atg eta ate tec tac acg aca gac tgc ttt ccc 96 

Val Gly Lys Thr Cys Met Leu He Ser. Tyr Thr Thr Asp Cys Phe Pro 

20 25 30 

ggc gaa tat gtg ccc aca gtc ttc gac aac tac teg gcg ccc atg caa 144 
Gly Glu Tyr Val Pro Thr Val Phe Asp Asn Tyr Ser Ala Pro Met Gin 

35 40 45 

gtg gac aca ata cag gtc teg ctg gga ctg tgg gat acg gcg ggt cag 192 
Val Asp Thr lie Gin Val Ser Leu Gly Leu Trp Asp Thr Ala Gly Gin 

50 55 60 

gag gac tac gac cgc ctg aga ccg eta tec tac ccg cag aca gac gtt 240 
Glu Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val 
65 70 75 80 

ttc ctg ata tgc tac age gtg gcg agt ccc teg tec ttt gag aac gtc 288 
Phe Leu He Cys Tyr Ser Val Ala Ser Pro Ser Ser Phe Glu Asn Val 

85 90 95 

acc teg aaa tgg tat ccg gag ata aag cac cac tgt ccc gat gcg ccc 336 
Thr Ser Lys Trp Tyr Pro Glu He Lys His His Cys Pro Asp Ala Pro 

100 105 110 

ate att eta gtt ggc acc aaa ate gat ttg cgc gaa gat cga gag aca 31 84 

He He Leu Val Gly Thr Lys He Asp Leu Arg Glu Asp Arg Glu Thr 

115 120 125* 

etc age ggc ctg gca gag cag gga ctg acg ccg ctg aag cgc gag cag 432 
Leu Ser Gly Leu Ala Glu Gin Gly Leu Thr Pro Leu Lys Arg Glu Gin 

130 135 140 

ggc cag aag ctg gca aac aag ata cgc get gtg aaa tac atg gag tgc 480 
Gly Gin Lys Leu Ala Asn Lys He Arg Ala Val Lys Tyr Met Glu Cys 
145 150 155 160 

tec gee ttg acg cag cgc ggt etc aag ccg gtg ttc gag gaa gcg gtg 528 
Ser Ala Leu Thr Gin Arg Gly Leu Lys Pro Val Phe Glu Glu Ala Val 

165 170 175 

cgc gcg gtg etc aga cca gag ccg eta aag cga cgc cag cga aag tgt 576 
Arg Ala Val Leu Arg Pro Glu Pro Leu Lys Arg Arg Gin Arg Lys Cys 

180 185 " 190 

tta ata atg taa 588 
Leu He Met 

195 



<210> 300 
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<211> 195 
<212> PRT 

<213> Drosophila melanogaster 
<400> 300 

Met Ser Thr Gly Arg Pro lie Lys Cys Val Val Val Gly Asp Gly Thr 

x 5 10 15 

Val Gly Lys Thr Cys Met Leu lie Ser Tyr Thr Thr Asp Cys Phe Pro 

20 25 30 

Gly Glu Tyr Val Pro Thr Val Phe Asp Asn Tyr Ser Ala Pro Met Gin 

35 40 45 

Val Asp Thr lie Gin Val Ser Leu Gly Leu Trp Asp Thr Ala Gly Gin 

50 55 60 

Glu Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val 
65 70 75 80 

Phe Leu He Cys Tyr Ser Val Ala Ser Pro Ser Ser Phe Glu Asn Val 
85 90 95 

* Thr' Ser Lys Trp tyr" Pro "Glu lie Lys His His Cys Pro Asp Ala Pro 
100 105 HO 

He lie Leu Val Gly Thr Lys He Asp Leu Arg Glu Asp Arg Glu Thr 

115 120 125 

Leu Ser Gly Leu Ala Glu Gin Gly Leu Thr Pro Leu Lys Arg Glu Gin 

130 135 140 

Gly Gin Lys Leu Ala Asn Lys He Arg Ala Val Lys Tyr Met Glu Cys 
145 150 155 160 

Ser Ala Leu Thr Gin Arg Gly Leu Lys Pro Val Phe Glu Glu Ala Val 

165 170 175 

Arg Ala Val Leu Arg Pro Glu Pro Leu Lys Arg Arg Gin Arg Lys Cys 
180 185 190 

Leu He Met 
195 

<210> 301 
<211> 582 
<212> DNA 

<213> Xenopus laevis 

<220> 

<221> CDS 

<222> (1) . . (582) 

<400> 301 

atg gca gcc att cgt aag aag etc gta att gtt gga gat ggt gca tgc 48 
Met Ala Ala He Arg Lys Lys Leu Val He Val Gly Asp Gly Ala Cys 
15 10 15 

ggg aaa acc tgc ctt ctg att gtg ttc agt aaa gac cag ttt cca gaa 96 
Gly Lys Thr Cys Leu Leu He Val Phe Ser Lys Asp Gin Phe Pro Glu 

20 25 30 

gtg tat gtc cca aca gtt ttt gag aac tat gtg gca gac ata gaa gtg 144 
Val Tyr Val Pro Thr Val Phe Glu Asn Tyr Val Ala Asp He Glu Val 

35 40 45 

gat ggc aag cag gtt gag ttg gcc eta tgg gat aca get ggt caa gaa 192 
Asp Gly Lys Gin Val Glu Leu Ala Leu Trp Asp Thr Ala Gly Gin Glu 

50 55 60 

gat tat gac aga ctg cga cct ctg tec tat cca gac act gat gtt ata 
Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp Thr Asp Val He 
65 A ~ 70 75 80 

tta atg tgc ttt tec att gat age ccc gac agt tta gaa aac ata cct 288 
Leu Met Cys Phe Ser. He Asp Ser Pro Asp Ser Leu Glu Asn He Pro 

85 90 95 

gag aaa tgg acc ccg gag gtg aaa cat ttc tgc ccc aat gtg ccc att 336 
Glu Lys Trp Thr Pro Glu Val Lys His Phe Cys Pro Asn Val Pro- He 

100 105 HO 

att tta gtt gga aac aag aag gat ctg cgt aat gat gaa cac act cgt 384 
He Leu Val Gly Asn Lys Lys Asp Leu Arg Asn Asp Glu His Thr Arg 

115 120 125 

agg gag etc acc aaa atg aaa cag gag cct gtg aag ccc gaa gaa ggt 432 
Arg Glu Leu Thr Lys Met Lys Gin Glu Pro Val Lys Pro Glu Glu Gly 



240 
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135 140 
gca aac cgt ate tec gcc tac gcg tac atg gaa tgc tct 480 
Ala Asn Arg lie Ser Ala Tyr Ala Tyr Met Glu Cys Ser 
150 155 160 

aaa gat ggc gtg agg gaa gtg ttt gag ctg get aca egg 528 
Lys Asp Gly Val Arg Glu Val Phe Glu Leu Ala Thr Arg 

165 170 175 

caa get agg cgt ggc aag aag aaa cca cgt tgc ctt etc 576 





130 




cgt 
Arg 
145 


gac 
Asp 


atg 
Met 


gca 
Ala 


aaa 
Lys 


acg 
Thr 


gca 
Ala 


gcc 
Ala 


ctg 
Leu 


ate 
lie 


taa 





180 185 190 



582 



<210> 302 
<211> 193 

<212> PRT " — . • * " • 

<213> Xenopus laevis 

<400> 302 

Met Ala Ala lie Arg Lys Lys Leu Val lie Val Gly Asp Gly Ala Cys 

1 5 10 15 

Gly Lys Thr Cys Leu Leu lie Val Phe Ser Lys Asp Gin Phe Pro Glu 

20 25 " 30 

Val Tyr Val Pro Thr Val Phe Glu Asn Tyr Val Ala Asp lie Glu Val 

35 40 45 

Asp Gly Lys Gin Val Glu Leu Ala Leu Trp Asp Thr Ala Gly Gin Glu 

50 55 ~ 60 

Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp Thr Asp Val lie 
65 70 75 * 80 

Leu Met' Cys Phe Ser lie Asp Ser Pro Asp Ser Leu Glu Asn Xle Pro 

85 .90 95 

Glu Lys Trp Thr Pro Glu Val Lys His Phe Cys Pro Asn Val Pro lie 

100 105 110 

lie Leu Val Gly Asn Lys Lys Asp Leu Arg Asn Asp Glu His Thr Arg 

115 120 ~ 125 

Arg Glu Leu Thr Lys Met Lys Gin Glu Pro Val Lys Pro Glu Glu Gly 

130 135 140 

Arg Asp Met Ala Asn Arg lie Ser Ala Tyr Ala Tyr Met Glu Cys Ser 
145 150 155 160 

Ala Lys Thr Lys Asp Gly Val Arg Glu Val Phe Glu Leu Ala Thr Arg 

165 170 175 

Ala Ala Leu Gin Ala Arg Arg Gly Lys Lys Lys Pro Arg Cys Leu Leu 
180 185 190 

He 

<210> 303 
<211> 591 
<212> DNA 

<213> Physcomitrella patens 

<220> 

<221> CDS 

<222> (1) . . (591) 

<400> 303 

atg age act tea egg ttt ate aag tgc gtg act gtt gga gat gga get 48 

Met Ser Thr Ser Arg Phe He Lys Cys Val Thr Val Gly Asp Gly Ala 

1 5 10 15 

gtc ggg aag acg tgc atg ctt att tea tac acc age aac aca ttt cct 96 

Val Gly Lys Thr Cys Met Leu - He Ser Tyr Thr Ser Asn Thr Phe Pro 

20 25 30 

act gat tac gtt cct acc gtg ttt gac aac ttc age gca aat gta gtg 144 
Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val 

35 40 45 

gtc gat gga aat acc gtc aac etc ggg tta tgg gat aca gca ggt caa 192 
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240 



Val Asp Gly Asn Tlir Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 

50 55 60 

gaa gat tac aac agg ctt cgt cct ctg agt tac agg ggt get gat gtt 
Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 
65 ~ * 70 75 80 

ttt etc ctg gcg ttc tec etc ate age aag get agt tat gaa aac ata 288 
Phe Leu Leu Ala Phe Ser Leu lie Ser Lys Ala Ser Tyr Glu Asn He 

.85 30 95 

tea aag aag tgg ate ccg gaa ctg aga cat. tac gcg cca tct gtg cca 33 6 

Ser Lys Lys Trp He Pro Glu Leu Arg His Tyr Ala Pro Ser Val Pro 

100 105 110 

ate att etc gtc gga aca aaa ctt gat ctt cgc gat gac aaa caa ttc 384 
He He Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe 

115 120 125 

ttt get gat cat cct gga gcg get cca ata act act tct caa ggg gag 432 
Phe Ala Asp His Pro Gly Ala Ala Pro He Thr Thr Ser Gin Gly Glu 

130 135 140 

gag etc" agg aag teg att ggg gcg gee teg tac ata gag tgc age tea 480 

Glu Leu Arg Lys Ser He Gly Ala Ala Ser Tyr He Glu Cys Ser Ser 

145 150 155 160 

aag act cag cag aat gta aaa gca gtt ttt gac gca gca ate aag gtg 528 

Lys Thr Gin Gin Asn Val Lys Ala Val Phe Asp Ala Ala He Lys Val 

165 170 175 

gtt etc caa cca cca aag cag aag aag aag aag aaa aaa caa aag aat 576 
Val Leu Gin Pro Pro Lys Gin Lys Lys Lys Lys Lys Lys Gin Lys Asn 

180 185 190 

tgc gtc att ctg tga 59 1 

Cys Val He Leu 
195 

<210> 304 
<211> 196 
<212> PRT 

<213> Physcomitrella patens 
<400> 304 

Met Ser Thr Ser Arg Phe He Lys Cys Val Thr Val Gly Asp Gly Ala 

15 10 15 

Val Gly Lys Thr Cys Met Leu He Ser Tyr Thr Ser Asn Thr Phe Pro 

20 25 30 

Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val 

35 40 45 

Val Asp Gly Asn Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 

50 55 60 

Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 
65 ^ 70 75 80 

Phe Leu Leu Ala Phe Ser Leu He Ser Lys Ala Ser Tyr Glu Asn He 

85 90 95 

Ser Lys Lys Trp He Pro Glu Leu Arg His Tyr Ala Pro Ser Val Pro 

100 105 HO 

He He Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe 

115 A ~ 120 125 

Phe Ala Asp His Pro Gly Ala Ala Pro He Thr Thr Ser Gin Gly Glu 

130 135 140 

Glu Leu Arg Lys Ser He Gly Ala Ala Ser Tyr He Glu Cys Ser Ser 
145 ~ * 150 155 160 

Lys Thr Gin Gin Asn Val Lys Ala Val Phe Asp Ala Ala He Lys Val 

165 170 175 

Val Leu Gin Pro Pro Lys Gin Lys Lys Lys Lys Lys Lys Gin Lys Asn 

180 185 190 

Cys Val He Leu 
195 



<210> 305 
<211> 841 
<212> DNA 

<213> Schistosoma mansoni 
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<220> 

<221> CDS 

<222> (7) . • (588) 

<400> 305 

gcaaca atg gcg agt gcg gta cgt aag aaa ctt gtt att gtt gga gat 48 
Met Ala Ser Ala Val Arg Lys Lys Leu Val He Val Gly Asp 
1 5 .10 



ggt 


gca 


tgc 


ggg 


aaa 


act 


tgt 


tta 


ctt 


att 


gta 


ttc 


acre 


aaa 


aac 


caa 


96 


Gly 


Ala 


Cys 


Gly 


Lys 


Tnr 


Cys 


Leu 


Leu 


He 


Val 


Phe 


Ser 


Lys 


ASP 


Gin 




15 










20 










25 








30 




ttt 


ccc 


gaa 


gtt 


tac 


gtt 


cct 


act 


gta 


ttc 


gaa 


aac 


tat 


gtt 


aca 


aat 


144 


Phe 


Pro 


Glu 


Val 


Tyr 


Val 


Pro 


Thr 


Val 


Phe 


Glu 


Asn 


Tyr 


Val 


Ala 


Asp 












35 










40 








45 




ate 


gaa 


gtt 


gat 


aac 


aga 


caa 


gtt 

^ mm mm 


gaa 


tta 


get 


etc 


torcr 

"33 


aac 


act 


act 

3^^ 


192 


He 


Glu 


Val 


Asp 


Asn 


Arg 


Gin 


Val 


Glu 


Leu 


Ala 


Leu 


Trp 


Asp 


Thr 


Ala 










50 










55 








60 








ggc 


caa 


gaa 


gat 


tat 


aat 


aaa 


eta 


caa 


cca 


ctt 


tct 


tat 


cct 


gat 


acg 


240 


Gly 


Gin 


Glu 


Asp 


Tyr 


Asp 


Arg 


Leu 


Arg 


Pro 


Leu 


Ser 


Tyr 


Pro 


Asp 


Thr 






gta 


65 










70 










75 










gat 


gtt 


ctg 


tta 


tgt 


tat 


agt 


att 


gac 


tct 


ccc 


aat 


aat 


ttt 


aca 


288 


Asp 


val 
80 


Val 


Leu 


Leu 


Cys 


Tyr 
85 


Ser 


He 


Asp 


Ser 


Pro 
90 


Asp 


Ser 


Phe 


Ala 




aac 


att 


gag 


gaa 


aaa 


tgg 


tta 


cca 


gaa 


ate 


cgt 


cat 


ttt 


tat 


cct 


aat 


336 


Asn 


lie 


Glu 


Glu 


Lys 


Trp 


Leu 


Pro 


Glu 


He 


Arcr 


His 


Phe 


Cvs 


Pro 


ASP 




95 










100 










105 










110 




gtt 


ccc 


att 


gtc 


tta 


gtt 


gga 


aac 


aaa 


aaa 


gac 


tta 


cga 


cat 


gac 


gag 


384 


Val 


Pro 


He 


Val 


Leu 
115 


Val 


Gly 


Asn 


Lys 


Lys 
120 


Asp 


Leu 


Arg 


His 


Asp 
125 


Glu 




get 


aca 


aag 


aat 


gag 


eta 


cac 


aga 


acc 


aaa 


caa 


ctt 


cct 


gtc 


act 


ttt 


432 


Ala 


Thr 


Lys 


Asn 
130 


Glu 


Leu 


His 


Arg 


Thr 
135 


Lys 


Gin 


Leu 


Pro 


Val 
140 


Thr 


Phe 




aac 


gag 


ggt 


aaa 


caa 


gta 


get 


gaa 


aag 


att 


tct 


get 


tat 


gee 


ttc 


ttt 


480 


Asn 


Glu 


Gly 
145 


Lys 


Gin 


Val 


Ala 


Glu 
150 


Lys 


He 


Ser 


Ala 


Tyr 
155 


Ala 


Phe 


Phe 




gag 


tgt 


tea 


get 


aag 


acc 


aag 


gaa 


gga 


gtc 


age 


gat 


gtt 


ttc 


gta 


gca 


528 


GlU 


Cys 
160 


Ser 


Ala 


Lys 


Tnr 


Lys 
165 


Glu 


Gly 


Val 


Ser 


Asp 
170 


Val 


Phe 


val 


Ala 




get 


act 


cga 


gee 


gee 


ttg 


aat 


tea 


gcg 


aag 


aag 


aag 


agg 


agg 


agg 


tgt 


576 


Ala 


Thr 


Arg 


Ala 


Ala 


Leu 


Asn 


Ser 


Ala 


Lys 


Lys 


Lys 


Arg 


Arg 


Arg 


Cys 




175 










180 










185 










190 




gat 


tta 


att 


tgatctccac tgttctctaa attgtctaga tttgattttc 


cggtctccga 


635 



Asp Leu He 

aacatacatt cccctcctaa ttcttcctca ggaccttaaa ttgetactte tateggctge 695 

gtgtttcttt tacaatcata ttttttgttt gecaacaaga caatccattc caeggaaata 755 

tttgccattt atttttgect tggtactaaa atagggcatg cctcatgtaa cttatattta 815 

ttgagaataa attactataa tcagtc 841 



<210> 306 
<211> 193 
<212> PRT 

<213> Schistosoma mansoni 
<400> 306 

Met Ala Ser Ala Val Arg Lys Lys Leu Val He Val Gly Asp Gly Ala 

1 5 10 15 

Cys Gly Lys Thr Cys Leu Leu He Val Phe Ser Lys Asp Gin Phe Pro 
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20 25 30 

Glu Val Tyr Val Pro Thr Val Plie Glu Asn Tyr Val Ala Asp lie Glu 

35 40 45 

Val Asp Asn Arg Gin Val Glu Leu Ala Leu Trp Asp Thr Ala Gly Gin 

50 55 60 

Glu Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp Thr Asp Val 
65 ~ " ^ 70 75 80 

Val Leu Leu Cys Tyr Ser lie Asp Ser Pro Asp Ser Phe Ala Asn lie 

85 90 95 

Glu Glu Lys Trp Leu Pro Glu lie Arg His" Phe Cys Pro Asp Val Pro 

100 105 110 

lie Val Leu Val Gly Asn Lys Lys Asp Leu Arg His Asp Glu Ala Thr 

115 120 125 

Lys Asn Glu Leu His Arg Thr Lys Gin Leu Pro Val Thr Phe Asn Glu 

130 135 140 

Gly Lys Gin Val Ala Glu Lys He Ser Ala Tyr Ala Phe Phe Glu Cys 
145 150 155 160 

Ser* Ala Lys Thr" Lys Glu" Gly Val Ser Asp Val Phe Val Ala Ala Thr 

165 170 175 

Arg Ala Ala Leu Asn Ser Ala Lys Lys Lys Arg Arg Arg Cys Asp Leu 
180 185 190 

He 

<210> 307 
<211> 576 
<212> DNA 

<213> Gallus gallus 

<220> 

<221> CDS 

<222> (1) . . (576) 

i 

<400> 307 

atg cag acg att aag tgt gta gtt gtg ggt gat ggt get gtt ggt aaa 48 

Met Gin Thr He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 

x 5 10 15 

acc tgt etc tta att tct tac aca aca aat aaa ttt cca teg gaa tac 96 

Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Lys Phe Pro Ser Glu Tyr 

20 25 30 

gta cca acg gtt ttt gat aac tat get gta aca gtg atg att gga gga 144 
Val Pro Thr Val Phe Asp Asn Tyr Ala Val Thr Val Met He Gly Gly 

35 40 45 

gag cct tac act eta ggc etc ttt gat act gca ggt cag gaa gat tat 192 
Glu Pro Tyr Thr Leu Gly Leu Phe Asp Thr Ala Gly Gin Glu Asp Tyr 

50 ' 55 60 

gat aga tta cga ccc etc age tat cca cag aca gat gta ttt ctg gtc 
Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu Val 
65 ** 70 75 80 

tgt ttt tea gtg gta tct cct tct tea ttt gaa aat gtg aaa gaa aag 
Cys Phe Ser Val Val Ser Pro Ser Ser Phe Glu Asn Val Lys Glu Lys 

85 90 95 

tgg gta cct gaa att act cac cat tgt cca aag act cct ttt ctg ctt 336 
Trp Val Pro Glu He Thr His His Cys Pro Lys Thr Pro Phe Leu Leu 

100 105 HO 

gtt ggg acc caa att gat eta aga gat gat ccc tea aca att gaa aaa 384 
Val Gly Thr Gin He Asp Leu Arg Asp Asp Pro Ser Thr He Glu Lys 

115 120 125 

ctt gec aag aac aag cag aag ccc ata act cca gag acg get gaa aaa 432 
Leu Ala Lys Asn Lys Gin Lys Pro He Thr Pro Glu Thr Ala Glu Lys 

130 135 140 

ctg gec egg gac ctg aag get gtt aaa tat gtg gaa tgc tct gcg ctt 480 
Leu Ala Arg Asp Leu Lys Ala Val Lys Tyr Val Glu Cys Ser Ala Leu 
145 ~ ~ 150 155 160 

acg cag aaa ggc eta aag aat gta ttt gat gag gcg ata ttg get gec 528 
Thr Gin Lys Gly Leu Lys Asn Val Phe Asp Glu Ala He Leu Ala Ala 

165 170 175 

ctg gag cct ccg gag ccg aag aag act cgc agg tgt gtg ctg eta 573 



240 



288 
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Leu Glu Pro Pro Glu Pro Lys Lys Thr Arg Arg Cys Val Leu Leu 



180 185 190 

tga 



<210> 308 
<211> 191 
<212> PRT 

<213> Gallus gallus 
<400> 308 

Met Gin Thr He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 

15 10 15 

Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Lys Phe Pro Ser Glu Tyr 

20 25 30 

Val Pro Thr " Val Phe Asp Ash Tyr Ala Val Thr Val Met lie Gly Gly 

35 40 45 

Glu Pro Tyr Thr Leu Gly Leu Phe Asp Thr Ala Gly Gin Glu Asp Tyr 

50 55 60 

Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu Val 
65 70 75 80 

Cys Phe Ser Val Val Ser Pro Ser Ser Phe Glu Asn Val Lys Glu Lys 

85 90 95 

Trp Val Pro Glu lie Thr His His Cys Pro Lys Thr Pro Phe Leu Leu 

100 105 110 

Val Gly Thr Gin He Asp Leu Arg Asp Asp Pro Ser Thr lie Glu Lys 

115 120 125 

Leu Ala Lys Asn Lys Gin Lys Pro He Thr Pro Glu Thr Ala Glu Lys 

130 135 140 

Leu Ala Arg Asp Leu Lys Ala Val Lys Tyr Val Glu Cys Ser Ala Leu 
145 "* 150 155 160 

Thr Gin Lys Gly Leu Lys Asn Val Phe Asp Glu Ala He Leu Ala Ala 

165 170 175 

Leu Glu Pro Pro Glu Pro Lys Lys Thr Arg Arg Cys Val Leu Leu 
180 185 190 



576 



<210> 309 
<211> 2011 
<212> DNA 

<213> Candida albicans 

<220> 
<221> CDS 

<222> (540) (1580) 
<400> 309 

tctagatttc cttactgtaa atcttaagga tcttcaatga tctatcccta acttcaatta 60 



tagttcacaa tacgctatga cgagcatcta aactatgtca atatctgaaa aggctaattt 120 



tttgttttgt tttaaagcac tttgtgattt taaggatatt ttccttaaat attattttta 180 



gaaataaata cagggacgac tgattggaaa taataaaaca taaacactgc ccatttcagg 



cacgcaaaaa aacaaaaaaa ccgtgccctt ttctttttca tgttttcctt tcccgatttt 300 



ttctctctct ctttatttct ttctctctag atttttttta ctgatctcac actatcacca 360 



catatacaca ctcacacatt atataatctt caattataac tcatctgggt tctaacagtt 420 
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635 



683 



7.31 



779 



827 



tctttttttt tctaacttgc ttttttttta ataaagtttc tataatttcg atacaccaat 480 

tttgttttca ctatttagtt attgtaacgt tctagtttga aaatccattt ctataagat 539 

atg aca ccc aac ggc agt agg aga cat teg gcg tac atg ggg teg ccc 587 
Met Thr Pro Asn Gly Ser Arg Arg His Ser. Ala Tyr Met Gly Ser Pro 
15 10 15 

aga age cag cat agt tec aca atg gaa aca ggt tac aat cct tac gaa 
Arg Ser Gin His Ser Ser Thr Met Glu Thr Gly Tyr Asn Pro Tyr Glu 

20 25 30 

gca gta cag aag aaa cag gaa tta tac caa aat aac aac ggc aat tea 
Ala Val Gin Lys Lys Gin Glu Leu Tyr Gin Asn Asn Asn Gly Asn Ser 

35 40 45 

cca acc gtc ate att gaa gaa gat cca tac ate cca aat tat aaa gag 
pro Thr Val lie He Glu Glu Asp" Pro Tyr He Pro Asn Tyr Lys Glu 

50 55 60 

ctg tct eta gca aat aag aaa aca aat tat aac atg aaa att gtc gtt 
Leu Ser Leu Ala Asn Lys Lys Thr Asn Tyr Asn Met Lys He Val Val 
65 70 75 80 

gtc ggt gac ggt ggg tgt ggt aag acg tgt tta tta tta gca tac aca 
Val Gly Asp Gly Gly Cys Gly Lys Thr Cys Leu Leu Leu Ala Tyr Thr 

85 90 95 

caa aac aaa ttt cct tea ate tat gtt ccc aca gtt ttt gag aat tat 875 
Gin Asn Lys Phe Pro Ser He Tyr Val Pro Thr Val Phe Glu Asn Tyr 

100 105 HO 

gtg acg gca gta cag teg cct aat ggt aaa acc gtg gaa ttg get etc 923 
Val Thr Ala Val Gin Ser Pro Asn Gly Lys Thr Val Glu Leu Ala Leu 

115 120 125 

tgg gat act get ggc caa gaa gaa tac gat agg tta cga cca eta age 971 
Trp Asp Thr Ala Gly Gin Glu Glu Tyr Asp Arg Leu Arg Pro Leu Ser 

130 135 140 

tat cct gac gtt gat att tta ttg gtg tgt ttt gee gtg gac aat gaa 1019 
Tyr Pro Asp Val Asp He Leu Leu Val Cys Phe Ala Val Asp Asn Glu 
145 150 155 160 

gtt agt ttg gag aat gtc aag gat atg tgg ttc ccc gaa gtg aat cat 1067 
Val Ser Leu Glu Asn Val Lys Asp Met Trp Phe Pro Glu Val Asn His 

165 170 175 

tat tgt cct ggt att ccc att att ttg gtt ggt acc aag agt gat tta 1115 
Tyr Cys Pro Gly He Pro He He Leu Val Gly Thr Lys Ser Asp Leu 

180 185 190 

ctg tct gat atg aat cac gat gca tea ata cga gta gee aaa gaa att 1163 
Leu Ser Asp Met Asn His Asp Ala Ser He Arg Val Ala Lys Glu He 

195 200 205 

ggc gee att gga ttg att ttc aca tea gee aag acc atg ttc aat gta 1211 
Gly Ala He Gly Leu He Phe Thr Ser Ala Lys Thr Met Phe Asn Val 

210 215 220 

egg act gtt ttt aat ttt gca tta aac cat ttt caa aga aat atg gaa 1259 
Arg Thr Val Phe Asn Phe Ala Leu Asn His Phe Gin Arg Asn Met Glu 
225 230 235 240 

ttg cag gaa caa tat gaa aag aca ttg ggt tea aga aag aga ata agt 1307 
Leu Gin Glu Gin Tyr Glu Lys Thr Leu Gly Ser Arg Lys Arg He Ser 

245 250 255 

aga gta ttg ggt ggt agt aat gga ggc agt ggg aac cat tec aga cat 
Arg Val Leu Gly Gly Ser Asn Gly Gly Ser Gly Asn His Ser Arg His 

260 265 270 

cat tct aga aac tac tec aac gtt teg aac aac aga agg ggc cat ttg 
His Ser Arg Asn Tyr Ser Asn Val Ser Asn Asn Arg Arg Gly His Leu 

275 280 285 

aag aac aca tea tac gat tec acg gca ttg ttg gat caa ccg ttg aca 1451 
Lys Asn Thr Ser Tyr Asp Ser Thr Ala Leu Leu Asp Gin Pro Leu Thr 

290 295 300 < 

gaa gac acc tat gtg aaa aat cct tat ggg aac ttt ggc tat aaa gca 1499 
Glu Asp Thr Tyr Val Lys Asn Pro Tyr Gly Asn Phe Gly Tyr Lys Ala 
305 310 315 . 320 

aat gtt gaa agt ccg tat aat cag gat gag ttt gca ttt aca aga gaa 



1355 



1403 



1547 
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Asn Val Glu Ser Pro Tyr Asn Gin Asp Glu Phe Ala Phe Thr Arg Glu 

325 ~ 330 335 

aga aag aag aag aaa aag tgt gta ata ttg tagatacctc tttgattagg 1597 
Arg Lys Lys Lys Lys Lys Cys Val lie Leu 

34 0 345 
tccataataa tatattaaag ttctgcatat gacaagaaat cgttttgtag aaagcacatg 1657 



aaatcatgtc acaattgcat ggctagttta caggtctctg gatttcgaat tggatgaaag 1717 
tataattata aaaccaatta gtgtctggga atactacata actgtctact gagatttcct 1777 



atagtgagat atctaactgg tcaaagtgta gtacttttga agtgatattg ggttacttgc 1837 



tijtatatatc gtcaatgttc cgtttatcct tcttatccga gactagtata gcaattatta '. * 1897 



tcattaatcc ttaacaaatg aacaggctcc agcttgtcga aaaaacactg ggcacttcat 1957 



cattttgtag tggtaaatct ttatattggt ttccaatatt atagatcctc taga 2011 



<210> 310 
<211> 346 
<212> PRT 

<213> Candida albicans 



<400> 310 



Met 


Thr 


Pro 


Asn 


Gly 


Ser Arg Arg His Ser Ala Tyr 


Met Gly 


Ser 


Pro 


1 








5 


10 




15 




Arg 


Ser 


Gin 


His 


Ser 


Ser Thr Met Glu Thr Gly Tyr 


Asn Pro 


Tyr 


Glu 






20 




25 


30 






Ala 


Val 


Gin 


Lys 


Lys 


Gin Glu Leu Tyr Gin Asn Asn 


Asn Gly 


Asn 


Ser 






35 






40 


45 






Pro 


Thr 


Val 


He 


He 


Glu Glu Asp Pro Tyr He Pro 


Asn Tyr 


Lys 


Glu 




50 








55 60 








Leu 


Ser 


Leu 


Ala 


Asn 


Lys Lys Thr Asn Tyr Asn Met 


Lys He 


Val 


Val 


65 










70 75 






80 


Val 


Gly 


Asp 


Gly 


Gly 


Cys Gly Lys Thr Cys Leu Leu 


Leu Ala 


Tyr 


Thr 










85 


90 




95 




Gin 


Asn 


Lys 


Phe 


Pro 


Ser lie Tyr Val Pro Thr Val 


Phe Glu 


Asn 


Tyr 








100 




105 


110 






Val 


Thr 


Ala 


Val 


Gin 


Ser Pro Asn Gly Lys Thr Val 


Glu Leu 


Ala 


Leu 






115 






120 


125 






Trp 


Asp 


Thr 


Ala 


Gly 


Gin Glu Glu Tyr Asp Arg Leu 


Arg Pro 


Leu 


Ser 




130 








135 140 








Tyr 


Pro 


Asp 


Val 


Asp 


He Leu Leu Val Cys Phe Ala 


Val Asp 


Asn 


Glu 


145 










150 155 






160 


Val 


Ser 


Leu 


Glu 


Asn 


Val Lys Asp Met Trp Phe Pro 


Glu Val 


Asn 


His 










165 


170 




175 




Tyr 


Cys 


Pro 


Gly 


He 


Pro He lie Leu Val Gly Thr 


Lys Ser 


Asp 


Leu 




180 




185 


190 






Leu 


Ser 


Asp 


Met 


Asn 


His Asp Ala Ser He Arg Val 


Ala Lys 


Glu 


He 






195 






200 


205 






Gly 


Ala 


lie 


Gly 


Leu 


He Phe Thr Ser Ala Lys Thr 


Met Phe 


Asn 


Val 


210 








215 220 








Arg 


Thr 


Val 


Phe 


Asn 


Phe Ala Leu Asn His Phe Gin 


Arg Asn 


Met 


Glu 


225 










230 235 






240 


Leu 


Gin 


Glu 


Gin 


Tyr 


Glu Lys Thr Leu Gly Ser Arg 


Lys Arg 


lie 


Ser 










245 


250 




255 




Arg 


Val 


Leu 


Gly 


Gly 


Ser Asn Gly Gly Ser Gly Asn 


His Ser 


Arg 


His 






260 




265 


270 
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His Ser Arg Asn Tyr Ser Asn Val Ser Asn Asn Arg Arg Gly His Leu 

275 280 285 

Lys Asn Thr Ser Tyr Asp Ser Thr Ala Leu Leu Asp Gin Pro Leu Thr 

290 295 300 

Glu Asp Thr Tyr Val Lys Asn Pro Tyr Gly Asn Phe Gly Tyr Lys Ala 
305 310 315 320 

Asn Val Glu Ser Pro Tyr Asn Gin Asp Glu Phe Ala Phe Thr Arg Glu 

325 330 335 

Arg Lys Lys Lys Lys Lys Cys Val lie Leu 
340 345 

<210> 311 
<211> 710 
<212> DNA 

<213> Beta vulgaris 



<220> 

■<22X> CDS" " * 
<222> (91) . . (684) 

<400> 311 

ttgattgatc aagaggagaa gagagagtga gaggttgttt gaatagaaga tcgttagata 60 



tttttttttt taattttttt tgttaaaaaa atg agt get teg agg ttc ata aaa 114 

Met Ser Ala Ser Arg Phe lie Lys 
1 5 



tgt 


gtt 


aca 


gtt ggt 


gat 


ggt gee gtt 


ggt 


aaa act 


tgc 


ttg 


ttg 


att 




Cys 


Val 


Thr 


Val Gly Asp 


Gly Ala Val 


Gly 


Lys Thr 


Cys 


Leu 


Leu 


He 




10 








15 




20 












tct 


tac 


ace 


age aac 


ace 


ttt cct acg 


gac 


tac gtg 


ccc 


act 


gtt 


ttt 


210 


Ser 


Tyr 


Thr 


Ser Asn 


Thr 


Phe Pro Thr Asp 


Tyr Val 


Pro 


Thr 


Val 


Phe 




25 






30 






35 








40 




gac 


aat 


ttc 


agt gec 


aat 


gtc gtc gtt 


aac 


ggg gee 


aca 


gtt 


aat 


ctg 


258 


Asp 


Asn 


Phe 


Ser Ala 


Asn 


Val Val Val 


Asn 


Gly Ala 


Thr 


Val 


Asn 


Leu 








45 






50 




/ 




55 






gga 


tta 


tgg gat act 


gca 


gga caa gag gat 


tat aac 


aga 


tta 


aga 


cct 


306 


Gly 


Leu 


Trp Asp Thr 


Ala 


Gly Gin Glu Asp 


Tyr Asn 


Arg Leu 


Arg 


Pro 








60 




65 








70 








ttg 


agt 


tat 


cgt gga 


gca 


gat gtt ttt 


att 


etc get 


ttc 


tec 


ctt 


att 


354 


Leu 


Ser 


Tyr Arg Gly Ala 


Asp Val Phe 


He 


Leu Ala 


Phe 


Ser 


Leu 


He 








75 






80 






85 










age 


aag 


get 


agt tat 


gaa 


aat gtt tct 


aag 


aag tgg 


att 


cct 


gag 


ttg 


402 


Ser 


Lys 


Ala 


Ser Tyr Glu 


Asn Val Ser 


Lys 


Lys Trp 


He 


Pro 


Glu 


Leu 






90 








95 




100 












aag 


cat 


tat 


get cct 


ggt 


gtc ccc att 


gtt 


ctt gtt 


gga 


aca 


aag 


etc 


450 


Lys 


His 


Tyr Ala Pro Gly 


Val Pro He 


Val 


Leu Val 


Gly Thr 


Lys 


Leu 




105 








110 






115 








120 




gat 


ctt 


egg 


gat gac 


aag 


cag ttt ttt 


ate 


gac cac 


cct 


ggt 


gca 


gtt 


498 


Asp 


Leu 


Arg Asp Asp 


Lys 


Gin Phe Phe 


He 


Asp His 


Pro 


Gly 


Ala 


Val 








125 






13 0 








135 






cca 


ate 


act 


aca get 


cag 


gga gag gaa 


tta 


agg aaa 


ctg att 


ggg 


get 


546 


Pro 


He 


Thr 


Thr Ala 


Gin 


Gly Glu Glu Leu 


Arg Lys 


Leu 


He 


Gly 


Ala 










140 




145 








150 








cct 


get 


tac 


ate gaa 


tgc 


agt tea aaa 


aca 


cag cag 


aat 


gtc 


aag 


gca 


594 


Pro 


Ala 


Tyr 


He Glu Cys 


Ser Ser Lys 


Thr 


Gin Gin 


Asn Val 


Lys 


Ala 








155 






160 






165 










gtt 


ttt 


gat 


gca gee 


att 


aag gtc gtg 


ctt 


caa cca 


cca 


aag 


aca 


aag 


642 


Val 


Phe 


Asp 


Ala Ala 


He 


Lys Val Val 


Leu 


Gin Pro 


Pro 


Lys 


Thr 


Lys 






170 






175 




180 












aaa 


aag 


aag 


tea aag 


gca 


cag aag get 


tgc 


tec ata 


ttg 


taattgtgta 


691 


Lys 


Lys 


Lys 


Ser Lys 


Ala 


Gin Lys Ala Cys 


Ser He 


Leu 










185 








190 






195 













agttgtaaag agaaacaag 
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<210> 312 
<211> 197 
<212> PRT 

<213> Beta vulgaris 
<400> 312 

Met Ser Ala Ser Arg Phe lie Lys Cys Val Thr Val Gly Asp Gly Ala 

1 5 10 15 

Val Gly Lys Thr Cys Leu Leu lie Ser Tyr. Thr Ser Asix Thr Phe Pro 

20 25 30 

Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val 

35 40 45 

Val Asn Gly Ala Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 

50 55 60 

Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 
65 70 75 80 

Phe lie Leu Ala Phe Ser Leu lie Ser Lys Ala Ser Tyr Glu Asn Val 

85 90 95 

Ser Lys Lys Trp lie Pro Glu Leu Lys His Tyr Ala Pro Gly Val Pro 

100 105 110 

He Val Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Gin Phe 

115 120 125 

Phe He Asp His Pro Gly Ala Val Pro He Thr Thr Ala Gin Gly Glu 

13 0 13 5 140 

Glu Leu Arg Lys Leu He Gly Ala Pro Ala Tyr He Glu Cys Ser Ser 
145 . 150 155 160 

Lys Thr Gin Gin Asn Val Lys Ala Val Phe Asp Ala Ala He Lys Val 

165 170 175 

Val Leu Gin Pro Pro Lys Thr Lys Lys Lys Lys Ser Lys Ala Gin Lys 

180 185 . 190 

Ala Cys Ser He Leu 
195 

<210> 313 
<211> 581 
<212> DNA 

<213> Cavia porcellus 
<400> 313 

atg cag gcc ate aag tgt gtg gtg gtg gga gat gga get gtg ggc aag 48 

Met Gin Ala He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 

1 5 10 15 

acc tgc etc ctt ate age tac ace ace aac gcc ttc cce gga gag tac 96 

Thr Cys Leu Leu He Ser Tyr- Thr Thr Asn Ala Phe Pro Gly" Glu Tyr 

20 25 30 

ate cce acc gtg ttt gac aac tat tea gcc aat gtg atg gtg gac age 144 
He Pro Thr Val Phe Asp Asn Tyr Ser Ala Asn Val Met Val Asp Ser 

35 40 . 45 

aag cce gtg aac ctg ggg ctg tgg gac aca get ggc cag gag gac tat 192 
Lys Pro Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr 

50 55 ~ 60 

gac cgc ctg cga ccg etc tec tac cca cag acg gac gtc ttc etc ate 240 
Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu He 
65 70 75 80 

tgc ttc tec etc gtc agt ccg gcc tec tat gag aat gtc cat gcc aat 288 
Cys Phe Ser Leu Val Ser Pro Ala Ser Tyr Glu Asn Val His Ala Asn 

85 90 95 

tgg tac cce aaa gtg egg cac cac tgc cce age acc cce ate ate ctg 336 
Trp Tyr Pro Lys Val Arg His His Cys Pro Ser Thr Pro He He Leu 

100 105 110 

ttg ggc acc aag ctg gac ctg egg gat gac aag gag acc ate gag aag 384 
Leu Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Glu Thr lie Glu Lys 

115 ' 120 " 125 

ctg aag gag aag aag ctg gcg cce ate acc tac cca cag ggc ctg gca 432 
Leu Lys Glu Lys Lys Leu Ala Pro He Thr Tyr Pro Gin Gly Leu Ala 

130 135 140 

ctg gcc aag gag att gac tct gtg aag tac ctg gag tgc tea gee etc 480 
Leu Ala Lys Glu He Asp Ser Val Lys Tyr Leu Glu Cys Ser Ala Leu 



WO 2005/014828 

145 150 
acc cag aga ggc ctg aag 
Thr Gin Arg Gly Leu Lys 
165 

ctg tgc cct cag ccc acg 
Leu Cys Pro Gin Pro Thr 
180 

ta tag 



234/291 

155 

acc gtc ttt gac gag gcc 
Thr Val Phe Asp Glu Ala 
170 

egg cca cag aag cgc gcc 
Arg Pro Gin Lys Arg Ala 
185 



PCT7EP2004/008136 
160 

ate cgt gcc gtc 528 
lie Arg Ala Val 
175 

tgc age etc etc 576 
Cys Ser Leu Leu 
190 

581 



<210> 314 
<211> 192 
<212> PRT 

<213> Cavia porcellus 

<400> 314 , 

Met Gin Ala lie Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 

X 5 10 15 

Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Ala Phe Pro Gly Glu Tyr 

20 25 30 

He Pro Thr Val Phe Asp Asn Tyr Ser Ala Asn Val Met Val Asp Ser 

35 40 45 

Lys Pro Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr 

50 55 60 

Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu He 
65 ~ 70 75 80 

Cys Phe Ser Leu Val Ser Pro Ala Ser Tyr Glu Asn Val His Ala Asn 

85 90 95 

Trp Tyr Pro Lys Val Arg His His Cys Pro Ser Thr Pro lie He Leu 

100 105 HO 

Leu Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Glu Thr He Glu -Lys 

115 120 125 

Leu Lys Glu Lys Lys Leu Ala Pro He Thr Tyr Pro Gin Gly Leu Ala 

130 * 135 140 

Leu Ala Lys Glu He Asp Ser Val Lys Tyr Leu Glu Cys Ser Ala Leu 
145 150 155 160 

Thr Gin Arg Gly Leu Lys Thr Val Phe Asp Glu Ala He Arg Ala Val 

165 170 175 

Leu Cys Pro Gin Pro Thr Arg Pro Gin Lys Arg Ala Cys Ser Leu Leu 
180 185 190 

<210> 315 
<211> 982 
<212> DNA 

<213> Lotus japonicus 

<220> 

<221> CDS 

<222> (113) . . (703) 

<400> 315 to _ 

attttgtgtg tctcactctt tttcgcttgt tctattttta gtttcaagtg aaagaaatat 



60 



ggaaagcttt gctgtaaggt aaggaaaatc agttcagaaa aggaggttga aa atg age 118 

Met Ser 
l 

aca get aga ttc ate aag tgt gtt act gtt gga gat gga gca gtg gga 166 
Thr Ala Arg Phe He Lys Cys Val Thr Val Gly Asp Gly Ala Val Gly 

5 10 15 

aag acc tgt atg ctt ate tct tac acc age aac aca ttc cca acg gat 214 
Lys Thr Cys Met Leu He Ser Tyr Thr Ser Asn Thr Phe Pro Thr Asp 

20 25 30 

tat gtg cct act gtt ttt gat aac ttc agt gca aat gtg gtg gtt gat 2 62 

Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Val Val Asp 
35 40 45 50 
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ggc age 


aca 


gtt 


aac 


ctg 


gga 


tta 


tgg 


gac 


act 


get 


gga 


cag 


gag 


gat 


310 


Gly Ser 


Thr 


Val 


Asn 
55 


Leu 


Gly 


Leu 


Trp 


Asp 
60 


Thr 


Ala 


Gly 


Gin 


Glu 
65 


Asp 




tac aat 


agg 


ctt 


agg 


cct 


ttg 


age 


tac 


aga 


gga 


gca 


gat 


gtg 


ttc 


ttg 


358 


Tyr Asn 


Arg 


Leu 
70 


Arg 


Pro 


Leu 


Ser 


Tyr 
75 


Arg 


Gly 


Ala 


Asp 


Val 
80 


Phe 


Leu 




ctg get 


ttt 


tec 


etc 


ctt 


age 


aga 


gee 


age 


tat 


gaa 


aat 


ate 


tec 


aaa 


406 


Leu Ala 


Phe 


Ser 


Leu 


Leu 


Ser 


Arg 


Ala 


Ser 


Tyr 


Glu 


Asn 


He 


Ser 


Lys 






85 










90 










95 








aag tgg 


att 


cct 


gaa 


ctg 


aga 


cac 


tat 


gee 


cca 


act 


gtg 


cca 


att 


gtt 


454 


Lys Trp 


lie 


Pro 


Glu 


Leu 


Arg 


His 


Tyr 


Ala 


Pro 


Thr 


Val 


Pro 


He 


Val 




100 










105 










110 












ctt gtg 


gga 


acc 


aaa 


ctt 


gat 


ttg 


agg 


gaa 


gac 


agg 


cag 


tat 


ttg 


att 


502 


Leu Val 


Gly 


Thr 


Lys 


Leu 


Asp 


Leu 


Arg 


Glu 


Asp 


Arg 


Gin 


Tyr 


Leu 


He 




115 








120 










125 










130 




gat cat 


cct 


gga 


gee 


aca 


cct 


att 


act 


act 


gee 


cag 


gga 


gaa 


gag 


ctg 


550 


Asp His 


Pro 


Gly 


Ala 


Thr 


Pro 


He 


Thr 


Thr 


Ala 


Gin 


Gly 


Glu 


Glu 


Leu 










135 










140 








145 






aag aag 


gca 


att 


ggt 


get 


get 


gtg 


tac 


eta 


gaa 


tgc 


age 


tea 


aag 


act 


598 


Lys Lys 


Ala 


lie 
150 


Gly 


Ala 


Ala 


Val 


Tyr 
155 


Leu 


Glu 


Cys 


Ser 


Ser 
160 


Lys 


Thr 




caa cag 


aat 


gtg 


aag 


get 


gtg 


ttt 


gat 


get 


get 


ate 


aag 


gtt 


gtt 


ttg 


646 


Gin Gin 


Asn 


Val 


Lys 


Ala 


Val 


Phe 


Asp 


Ala 


Ala 


He 


Lys 


Val val 


Leu 






165 










170 










175 










cag cca 


cct 


aaa 


cca 


aag 


aaa 


aaa 


cga 


aag 


aag 


acc 


aga 


cca 


tgc 


gtt 


694 


Gin Pro 


Pro 


Lys 


Pro 


Lys 


Lys 


Lys 


Arg 


Lys 


Lys 


Thr 


Arg 


Pro 


Cys 


Val 




180 










185 










190 













ttc ctt taattgatgt tcatgtttga ttegcaatet gtagcattcg ggaccttctt 750 

Phe Leu 

195 

cctatgtttt ttacccttta tttgattgea caaatatget ttttgaacat atgtattaca 810 

cactcttatc ttctttccgt tttcttgtat tttttgttgg tgggtctaag tgatatttea 870 

atgtatctac tggaacaaac aaggagactg tactcgatca aagataaatt ttgtgataag 930 

aaataatagc attcatagtg acttttttta ttaaaaaaaa aaaaaaaaaa aa 982 



<210> 316 
<2li> 196 
<212> PRT 

<213> Lotus japonicus 
<400> 316 



Met 


Ser 


Thr 


Ala 


Arg 


Phe 


He 


Lys 


Cys 


Val 


Thr Val 


Gly 


Asp 


Gly Ala 


1 








5 








10 






15 


Val 


Gly 


Lys 


Thr 


Cys 


Met 


Leu 


He 


Ser 


Tyr 


Thr Ser 


Asn 


Thr 


Phe Pro 








20 










25 








30 




Thr 


Asp 


Tyr 


Val 


Pro 


Thr 


Val 


Phe 


Asp 


Asn 


Phe Ser 


Ala 


Asn 


Val Val 






35 










40 








45 






Val 


Asp 


Gly 


Ser 


Thr 


val 


Asn 


Leu 


Gly 


Leu 


Trp Asp 


Thr 


Ala 


Gly Gin 




50 










55 








60 






Glu 


Asp 


Tyr 


Asn 


Arg 


Leu 


Arg 


Pro 


Leu 


Ser 


Tyr Arg 


Gly 


Ala 


Asp Val 


65 










70 










75 






80 


Phe 


Leu 


Leu 


Ala 


Phe 


Ser 


Leu 


Leu 


Ser 


Arg 


Ala Ser 


Tyr 


Glu 


Asn He 










85 










90 








95 


Ser 


Lys 


Lys 


Trp 


He 


Pro 


Glu 


Leu 


Arg 


His 


Tyr Ala 


Pro 


Thr 


Val Pro 








100 










105 








110 




He 


Val 


Leu 


Val 


Gly 


Thr 


Lys 


Leu 


Asp 


Leu 


Arg Glu 


Asp 


Arg 


Gin Tyr 






115 










120 








125 




Leu 


He 


Asp 


His 


Pro 


Gly 


Ala 


Thr 


Pro 


He 


Thr Thr 


Ala 


Gin 


Gly Glu 




13 0 










135 








140 
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Glu Leu Lys Lys Ala He Gly Ala Ala Val Tyr Leu Glu Cys Ser Ser 
145 150 155 160 

Lys Thr Gin Gin Asn val Lys Ala Val Phe Asp Ala Ala He Lys Val 

165 170 175 

Val Leu Gin Pro Pro Lys Pro Lys Lys Lys Arg Lys Lys Thr Arg Pro 

180 185 190 

Cys Val Phe Leu 
195 

<210> 317 
<211> 659 
<212> DNA 

<213> Dictyostelium discoideum 

<220> 

<221> CDS 

<222> (22) . . (609) 

<400> 317 

ggttaaaaag tttacataat c atg caa tea att aaa ttg gtg gta gta ggt 51 

Met Gin Ser He Lys Leu Val Val Val Gly 
15 10 
gat ggt get gtt ggt aaa act tgt tta tta att tct tat aca tea aat 99 
Asp Gly Ala Val Gly Lys Thr Cys Leu Leu He Ser Tyr Thr Ser Asn 

15 " 20 25 

tct ttc cca aca gaa tac gtt cca act gta ttt gat aac tat tea gca 147 
Ser Phe Pro Thr Glu Tyr Val Pro Thr Val Phe Asp Asn Tyr Ser Ala 

30 35 40 

aat gtt atg gta gat aat aaa act gtt tea tta ggt ctt tgg gat act 195 
Asn Val Met Val Asp Asn Lys Thr Val Ser Leu Gly Leu Trp Asp Thr 

45 50 55 

get ggt caa gag gat tat gat cgt tta .aga cca ctt tea tac cca caa 243 
Ala Gly Gin Glu Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin 

60 65 70 

acc gat gtt ttt ctt att tgt ttc get att ata agt caa act tea tat 291 
Thr Asp Val Phe Leu He Cys Phe Ala He He Ser Gin Thr Ser Tyr 
75 80 85 90 

aca aat gta aaa tct aaa tgg tgg cct gaa gtc aca cat cat tgt cca 
Thr Asn Val Lys Ser Lys Trp Trp Pro Glu Val Thr His His Cys Pro 

95 100 105 

aac tgc aca att att tta gtt ggt aca aaa tgt gat tta aga gaa gac 387 
Asn Cys Thr He He Leu Val Gly Thr Lys Cys Asp Leu Arg Glu Asp 

110 H5 120 

aaa gaa agt tta gaa aaa etc aga gaa aaa cat caa caa cca etc acc 435 
Lys Glu Ser Leu Glu Lys Leu Arg Glu Lys His Gin Gin Pro Leu Thr 

125 * 130 135 

ttc caa caa' ggt gaa caa atg gca aaa gaa att aaa gee ttt tgt tat 483 
Phe Gin Gin Gly Glu Gin Met Ala Lys Glu He Lys Ala Phe Cys Tyr 

140 ~ 145 150 

atg gaa tgt tec get tta act caa aaa ggt etc aaa caa gtt ttc gac 531 
Met Glu Cys Ser Ala Leu Thr Gin Lys Gly Leu Lys Gin Val Phe Asp 
155 * 160 165 170 

gaa get att aaa get gtt att ttc cca gat aga gat aag gee aca aac 579 
Glu Ala He Lys Ala Val He Phe Pro Asp Arg Asp Lys Ala Thr Asn 

175 180 185 

aaa aag aat tea aaa tgt tea att tta taaaaacata tcaaaatatc 626 
Lys Lys Asn Ser Lys Cys Ser He Leu 
19*0 195 
tctattcttt ataataatca caaccaaaaa aaa 659 



339 



<210> 318 
<211> 195 
<212> PRT 

<213> Dictyostelium discoideum 
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<400> 318 



Met 


Gin 


Ser 


He 


Lys 


Leu 


Val 


Val Val Gly Asp Gly Ala 


Val Gly Lys 


1 








5 






10 


15 


Thr 


Cys 


Leu 


Leu 


He 


Ser 


Tyr 


Thr Ser Asn Ser Phe Pro 


Thr Glu Tyr 








20 








25 


30 


Val 


Pro 


Thr 


Val 


Phe 


Asp 


Asn 


Tyr Ser Ala Asn Val Met 


Val Asp Asn 






35 










40 • 45 


Lys 


Thr 


Val 


Ser 


Leu 


Gly 


Leu 


Trp Asp Thr Ala Gly Gin 


Glu Asp Tyr 




50 










55 


60 


Asp Arg 


Leu 


Arg 


Pro 


Leu 


Ser 


Tyr Pro Gin Thr Asp Val 


Phe Leu He 


65 










70 




75 


80 


Cys 


Phe 


Ala 


He 


He 


Ser 


Gin 


Thr Ser Tyr Thr Asn Val 


Lys Ser Lys 










85 






90 


95 


Trp 


Trp 


Pro 


Glu 


Val 


Thr 


His 


His Cys Pro Asn Cys Thr 


He He Leu 








100 








105 


110 


Val 


Gly 


Thr 


Lys 


Cys 


Asp 


Leu 


Arg Glu Asp Lys Glu Ser 


Leu Glu Lys 






115 










120 125 


Leu Arg 


Glu 


Lys 


His 


Gin 


Gin 


Pro Leu Thr Phfe Gin Gin 


Gly Glu Glh 




130 










135 


140 


Met 


Ala 


Lys 


Glu 


He 


Lys 


Ala 


Phe Cys Tyr Met Glu Cys 


Ser Ala Leu 


145 










150 




155 


160 


Thr 


Gin 


Lys 


Gly 


Leu 


Lys 


Gin 


Val Phe Asp Glu Ala He 


Lys Ala Val 










165 






170 


175 


He 


Phe 


Pro 


Asp 


Arg 


Asp 


Lys 


Ala Thr Asn Lys Lys Asn 


Ser Lys Cys 








180 








185 " 


190 


Ser 


He 


Leu 















195 

<210> 319 
<211> 634 
<212> DNA 

<213> Dictyostelium discoideum 

<220> 

<221> CDS 

<222> (46) . . (624) 

<400> 319 

gatcatattt aaattataca aaaaacaatt acatatatat aaaga atg tea gca gca 57 

Met Ser Ala Ala 
1 



gaa 


gtt 


att 


aaa 


tta 


gtc 


gtt 


att 


ggt 


gat 


ggt 


get 


gta 


ggt 


aaa 


act 


105 


Glu 


Val 


He 


Lys 


Leu 


val 


Val 


He 


Gly 


Asp 


Gly Ala 


Val 


Gly 


Lys 


Thr 




5 










10 










15 










20 




tgt 


tta 


ttg 


att 


agt 


tat 


gca 


aac 


aat 


cgt 


ttc 


cca 


gaa 


gat 


tat 


att 


153 


Cys 


Leu 


Leu 


He 


Ser 


Tyr 


Ala 


Asn 


Asn 


Arg 


Phe 


Pro 


Glu 


Asp 


Tyr 


lie 












25 










30 








35 






cca 


act 


gta 


ttc 


gat 


aat 


tat 


gtt 


gta 


aat 


ctt 


aca 


gca 


ggt 


gat 


aga 


201 


Pro 


Thr 


val 


Phe 
40 


Asp 


Asn 


Tyr 


Val 


Val 
45 


Asn 


Leu 


Thr 


Ala 


Gly 
50 


Asp 


Arg 




aac 


ata 


gaa 


etc 


gga 


ctt 


tgg 


gat 


act 


gca 


ggt 


caa 


gaa 


gag 


tac 


gat 


249 


Asn 


He 


Glu 


Leu 


Gly 


Leu 


Trp 


Asp 


Thr 


Ala 


Gly Gin 


Glu 


Glu 


Tyr 


Asp 








55 










60 










65 








aaa 


tta 


aga 


cca 


tta 


agt 


tat 


gca 


aat 


gca 


aat 


gta 


ttt 


tta 


att 


tgc 


297 


Lys 


Leu 


Arg 


Pro 


Leu 


Ser 


Tyr 


Ala 


Asn 


Ala 


Asn 


val 


Phe 


Leu 


He 


Cys 






70 










75 










80 










ttc 


tea 


att 


acc 


aat 


cca 


gtt 


tea 


ttt 


gaa 


aat 


gtt 


tac 


aca 


aaa 


tgg 


345 


Phe 


Ser 


He 


Thr 


Asn 


Pro 


Val 


Ser 


Phe 


Glu 


Asn Val 


Tyr 


Thr 


Lys 


Trp 




85 










90 










95 








100 




tac 


cca 


gaa 


gtt 


atg 


cat 


ttt 


tgc 


cca 


gaa 


gtt 


cca 


caa 


att 


tta 


gtt 


393 


Tyr 


Pro 


Glu 


Val 


Met 
105 


His 


Phe 


Cys 


Pro 


Glu 
110 


Val 


Pro 


Gin 


He 


Leu 
115 


Val 




ggt 


act 


aaa 


tta 


gat 


aca 


cgt 


gac 


gat 


aga 


ggt 


gtt 


tta 


gat 


aaa 


-ctt 


441 


Gly 


Thr 


Lys 


Leu 


Asp 


Thr 


Arg 


Asp 


Asp 


Arg 


Gly Val 


Leu 


Asp 


Lys 


Leu 










120 










125 










130 








caa 


caa 


act 


ggt 


cat 


aaa 


cca 


att 


aca 


acc 


gaa 


caa 


ggt 


aac 


gat 


tta 


489 


Gin 


Gin 


Thr 


Gly 


His 


Lys 


Pro 


He 


Thr 


Thr 


Glu 


Gin 


Gly 


Asn 


Asp 


Leu 





135 140 145 
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gcc aga aga att aaa gcc 
Ala Arg Arg lie Lys Ala 
150 

tea caa aat etc aaa caa 
Ser Gin Asn Leu Lys Gin 
165 170 
ttt ate aaa aaa aag aaa 
Phe He Lys Lys Lys Lys 
185 

tec 



238/291 

att aaa tat atg gaa tgt 

He Lys Tyr Met Glu Cys 

155 160 

gtc ttt gat gaa gcc att 

Val Phe Asp Glu Ala He 
175 

tec aag tgt att gtt atg 

Ser Lys Cys He Val Met 
190 



PCT/EP2004/008136 

tct gcc aaa ace 537 
Ser Ala Lys Thr 

aaa tct gtt ttg 585 
Lys Ser Val Leu 
180 

taacccgctc 631 



634 



<210> 320 
<211> 192 

<212> PRT 

<213> Dictyostelium" discoideum 

<400> 320 

Met Ser Ala Ala Glu Val He Lys Leu Val Val He Gly Asp Gly Ala 

1-5 10 15 

Val Gly Lys Thr Cys Leu Leu He Ser Tyr Ala Asn Asn Arg Phe Pro 

20 25 30 

Glu Asp Tyr He Pro Thr Val Phe Asp Asn Tyr Val Val Asn Leu Thr 

35 40 45 

Ala Gly Asp Arg Asn He Glu Leu Gly Leu Trp Asp Thr Ala Gly Gin 

50 " 55 60 

Glu Glu Tyr Asp Lys Leu Arg Pro Leu Ser Tyr Ala Asn Ala Asn Val 
65 70 ^ 75 80 

Phe Leu He Cys Phe Ser He Thr Asn Pro Val Ser Phe Glu Asn Val 

85 90 95 

Tyr Thr Lys -Trp Tyr Pro Glu Val Met His Phe Cys Pro Glu Val Pro 

100 105 110 

Gin He Leu Val Gly Thr Lys Leu Asp Thr Arg Asp Asp Arg Gly Val 

115 120 125 

Leu Asp Lys Leu Gin Gin Thr Gly His Lys Pro He Thr Thr Glu Gin 

130 135 140 

Gly Asn Asp Leu Ala Arg Arg He Lys Ala He Lys Tyr Met Glu Cys 
145 150 155 160 

Ser Ala Lys Thr Ser Gin Asn Leu Lys Gin Val Phe Asp Glu Ala He 

165 170 175 

Lys Ser Val Leu Phe He Lys Lys Lys Lys Ser Lys Cys He Val Met 
180 185 190 

<210> 321 
<211> 765 
<212> DNA 

<213> Dictyostelium discoideum 

<220> 

<221> CDS 

<222> (1) . . (765) 

<400> 321 

atg get aca ggt att aag aaa act gtt aaa gta gta gtt gta ggt gat 48 
Met Ala Thr Gly He Lys Lys Thr Val Lys Val Val Val Val Gly Asp 
1 5 10 15 

ggt get gtt ggt aaa act tea ctt tta att tta tat aca act aaa gca 
Gly Ala Val Gly Lys Thr Ser Leu Leu He Leu Tyr Thr Thr Lys Ala 

20 25 30 

ttc cca aaa gat tat gta cca aca gta ttt gat aac ttt aat tgt tta 144 
Phe Pro Lys Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Asn Cys Leu 

35 ** 40 45 

gag atg tac gat aat aaa cca gtt aat tta gta ttg tgg gat ace get 192 
Glu Met Tyr Asp Asn Lys Pro Val Asn Leu Val Leu Trp Asp Thr Ala 

50 55 60 

ggt caa gaa gat tat gat aat tta cgt cca ttg tea tat cca cag act 240 



96 
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Gly Gin 


Glu 


Asp 


Tyr Asp 


65 








70 


gat gta 


ttt 


ata 


ata 


tgt 


Asp Val 


Phe 


lie 


He 


Cys 








85 




aat ata 


aag 


tat 


aaa 


tgg 


Asn lie 


Lys 


Tyr 


Lys 


Trp 






100 






aca cca 


att 


att 


tta 


gtt 


Thr Pro 


He 


He 


Leu 


Val 




115 








aaa aca 


tta 


tea 


caa 


tta 


Lys Thr 


Leu 


Ser 


Gin 


Leu 


130 










gat gaa 


ggt 


gta 


gca 


tta 


Asp Glu 


Gly 


Val 


Ala' 


Leu 


145 








150 


gaa tgt 


tct 


gca 


ttg 


aca 


Glu Cys 


Ser 


Ala 


Leu 


Thr 








165 




gca att 


aaa 


gca 


get 


ttt 


Ala He 


Lys 


Ala 


Ala 


Phe 






180 






aaa tct 


agt 


ggt 


aaa 


tct 


Lys Ser 


Ser 


Gly 


Lys 


Ser 




195 








aaa act 


act 


act 


aca 


act 


Lys Thr 


Thr 


Thr 


Thr 


Thr 


210 










get get 


tea 


act 


gca 


aaa 


Ala Ala 


Ser 


Thr 


Ala 


Lys 


225 








230 


ttg ttc 


cgt 


aaa 


aaa 


gat 


Leu Phe Arg 


Lys 


Lys 


Asp 








245 





75 80 



105 110 



135 140 

gca aaa gag ata ggt gca gta caa ttt ttc 

Ala Lys Glu He Gly Ala Val Gin Phe Phe 

155 160 



288 



90 95 
tta cct gaa att aat caa act aat cag ggt 336 



384 



120 125 
caa gaa tea aaa caa gaa cca gtt tea aga 432 



480 



528 



170 175 
aat aaa cct get gta act tea cca act teg 576 
Asn Lys Pro Ala Val Thr Ser Pro Thr Ser 

185 190 
tea cct tea tct ace tct tea aaa cct tct 624 
Ser Pro Ser Ser Thr Ser Ser Lys Pro Ser 

200 205 
aca ace tct tct tct tct tct tea cca cct 672 
Thr Thr Ser Ser Ser Ser Ser Ser Pro Pro 
215 220 

cca gca ggt gaa aag aaa tta agt tgg ggt 720 
Pro Ala Gly Glu Lys Lys Leu Ser Trp Gly 
235 '•. 240 
aaa gat gaa aag aaa cca gca aaa taa 765 
Lys Asp Glu Lys Lys Pro Ala Lys 
250 



<210> 322 
<211> 254 
<212> PRT 

<213> Dictyostelium discoideum 
<400> 322 

Met Ala Thr Gly lie Lys Lys Thr Val Lys Val Val Val Val Gly Asp 

1 5 10 15 

Gly Ala Val Gly Lys Thr Ser Leu Leu He Leu Tyr Thr Thr Lys Ala 

20 25 30 

Phe Pro Lys Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Asn Cys Leu 

35 40 45 

Glu Met Tyr Asp Asn Lys Pro Val Asn Leu Val Leu Trp Asp Thr Ala 

50 55 60 

Gly Gin Glu Asp Tyr Asp Asn Leu Arg Pro Leu Ser Tyr Pro Gin Thr 
65 70 75 80 

Asp Val Phe He He Cys Tyr Ser Val Val Lys Arg Asp Ser Leu Asp 

85 90 95 

Asn He Lys Tyr Lys Trp Leu Pro Glu He Asn Gin Thr Asn Gin Gly 

100 105 110 

Thr Pro He He Leu Val Gly Thr Lys Thr Asp Leu Arg Glu Asp Lys 

115 120 125 

Lys Thr Leu Ser Gin Leu Gin Glu Ser Lys Gin Glu Pro Val Ser Arg 

130 135 140 

Asp Glu Gly Val Ala Leu Ala Lys Glu He Gly Ala Val Gin Phe Phe 
145 150 155 160 

Glu Cys Ser Ala Leu Thr Gly Asn Gly Val Asn Asp He Phe Ala Ala 

165 170 175 

Ala He Lys Ala Ala Phe Asn Lys Pro Ala Val Thr Ser Pro Thr Ser 

180 185 190 

Lys Ser Ser Gly Lys Ser Ser Pro Ser Ser Thr Ser Ser Lys Pro Ser 
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195 200 205 

Lys Thr Thr Thr Thr Thr Thr Thr Ser Ser Ser Ser Ser Ser Pro Pro 

210 215 220 

Ala Ala Ser Thr Ala Lys Pro Ala Gly Glu Lys Lys Leu Ser Trp Gly 
225 230 235 240 

Leu Phe Arg Lys Lys Asp Lys Asp Glu Lys Lys Pro Ala Lys 
245 250 

<210> 323 
<211> 672 
<212> DNA 

<213> Dictyostelium discoideum 

<220> 

<221> CDS 

<222> (1) . . (672) 



48 



96 



144 



240 



288 



336 



<400> 323 

atg tea gaa gat caa ggt tea gga gca aca aga gtt aaa tta gta gtt 
Met Ser Glu Asp Gin Gly Ser Gly Ala Thr Arg Val Lys Leu Val Val 
1 5 10 15 

gtc ggt gat ggt get gtt ggt aaa aca tgt ctt tta att tgt tat gca 
Val Gly Asp Gly Ala Val Gly Lys Thr Cys Leu Leu lie Cys Tyr Ala 

20 25 30 

caa aat gat ttt cca gta gat tat gta cca act gtt ttt gaa aat tat 
Gin Asn Asp Phe Pro Val Asp Tyr Val Pro Thr Val Phe Glu Asn Tyr 

35 40 45* 

aca gca ace aga aag aga gga aat gaa gat att aaa gta cat tta tgg 192 
Thr Ala Thr Arg Lys Arg Gly Asn Glu Asp lie Lys Val His Leu Trp 

50 55 . .60 

gat act gca ggc caa gaa gaa tat gat cgt tta cgt cca tta tea tac 
Asp Thr Ala Gly Gin Glu Glu Tyr Asp Arg Leu Arg Pro Leu Ser Tyr 
65 ~ 70 75 80 

cca ggc get gat gtt gtt etc ctt tgt ttc age aca ate agt caa tea 
Pro Gly Ala Asp Val Val Leu Leu Cys Phe Ser Thr lie Ser Gin Ser 

85 90 95 

tea tat gaa gec att aga gat aaa tgg gca cca gaa gtt aat cac tat 
Ser Tyr Glu Ala He Arg Asp Lys Trp Ala Pro Glu Val Asn His Tyr 

100 " 105 HO 

ate cca gat gta cca tea att tta gtt ggt act aaa ate gat tta cgt 384 
He Pro Asp Val Pro Ser He Leu Val Gly Thr Lys He Asp Leu Arg 

115 120 125 

gaa caa caa cac cca gat cca aac tct ggt aaa ttc gaa cca ate act 
Glu Gin Gin His Pro Asp Pro Asn Ser Gly Lys Phe Glu Pro He Thr 

130 135 140 

gee gat atg ggt att tea atg caa aaa caa att aaa gee aag aaa tat 
Ala Asp Met Gly He Ser Met Gin Lys Gin He Lys Ala Lys Lys Tyr 
145 * 150 155 160 

tta gaa gtc tct 'gca aag act cgt caa ggt tta gaa gaa gtt ttc agt 
Leu Glu Val Ser Ala Lys Thr Arg Gin Gly Leu Glu Glu Val Phe Ser 

165 170 175 

get gee att gaa ate gtt ctt gaa tea aga ggt atg gat aaa aag agt 576 
Ala Ala He Glu He Val Leu Glu Ser Arg Gly Met Asp Lys Lys Ser 

180 185 190 

caa gat ggt tct tea agt gca tct ggt gtt cca tea ggt gat aaa cca 624 
Gin Asp Gly Ser Ser Ser Ala Ser Gly Val Pro Ser Gly Asp Lys Pro 

195 200 205 

aca aaa gga aaa gca ggt aaa aag aaa tct ggt tgt att ata ctt 
Thr Lys Gly Lys Ala Gly Lys Lys Lys Ser Gly Cys He He Leu 

210 215 220 



432 



480 



528 



669 



taa 



672 



<210> 324 
<211> 223 
<212> PRT 
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<213> Dictyostelium discoideum 
<400> 324 

Met Ser Glu Asp Gin Gly Ser Gly Ala Thr Arg Val Lys Leu Val Val 

1 5 10 15 

Val Gly Asp Gly Ala Val Gly Lys Thr Cys Leu Leu lie Cys Tyr Ala 

20 2 5 30 

Gin Asn Asp Phe Pro Val Asp Tyr Val Pro Thr Val Phe Glu Asn Tyr 

35 40 45 

Thr Ala Thr Arg Lys Arg Gly Asn Glu Asp lie Lys Val His Leu Trp 

50 55 60 

Asp Thr Ala Gly Gin Glu Glu Tyr Asp Arg Leu Arg Pro Leu Ser Tyr 
65 70 75 80 

Pro Gly Ala Asp Val Val Leu Leu Cys Phe Ser Thr lie Ser Gin Ser 

85 90 95 

Ser Tyr Glu Ala lie Arg Asp Lys Trp Ala Pro Glu Val Asn His Tyr 

100 105 110 

lie Pro Asp Val Pro- Ser He Leu Val Gly Thr Lys lie Asp Leu Arg 

115 120 125 

Glu Gin Gin His Pro Asp Pro Asn Ser Gly Lys Phe Glu Pro He Thr 

130 135 140 

Ala Asp Met Gly He Ser Met Gin Lys Gin He Lys Ala Lys Lys Tyr 
145 150 155 160 

Leu Glu Val Ser Ala Lys Thr Arg Gin Gly Leu Glu Giu Val Phe Ser 

165 170 175 

Ala Ala lie Glu He Val Leu Glu Ser Arg Gly Met Asp Lys Lys Ser 

180 185 190 

Gin Asp Gly Ser Ser Ser Ala Ser Gly Val Pro Ser Gly Asp Lys Pro 

195 200 205 

Thr Lys Gly Lys Ala Gly Lys Lys Lys Ser Gly Cys He He Leu 
210 215 220 

<210> 325 
<211> 579 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1)..(579) 

<400> 325 

atg cag gcc ate aag ggt gtg gtg gtg gga gac gga get gta ggt aaa 48 

Met Gin Ala He Lys Gly Val Val Val Gly Asp Gly Ala Val Gly Lys 

1 5 10 15 

act tgc eta ctg ate agt tac aca ace. aac gca ttt cct gga gaa gat 96 

Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Ala Phe Pro Gly Glu Asp 

20 25 30 

ate cct act gcc ttt gac aat tat tct gcc aat gtt atg gta gat gga 144 
He Pro Thr Ala Phe Asp Asn Tyr Ser Ala Asn Val Met Val Asp Gly 

35 40 45 

aaa ctg gtg aat ctg ggc tta tgg aat aca get gga caa gaa gat tat 192 
Lys Leu Val Asn Leu Gly Leu Trp Asn Thr Ala Gly Gin Glu Asp Tyr 

50 55 60 

gac aga tta cgc ccc eta tec tat ccg caa gca gat gtg ttc tta att 240 
Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Ala Asp Val Phe Leu He 
65 70 75 80 

tgc ttt teg ctt gtg agt cct gca tea ttt gaa aat gtc ctt gca aag 288 
Cys Phe Ser Leu Val Ser Pro Ala Ser Phe Glu Asn Val Leu Ala Lys 

85 90 95 

tgg tat cct gag gtg cag cac cac tgt ccc aac act ccc att ate eta 336 
Trp Tyr Pro Glu Val Gin His His Cys Pro Asn Thr. Pro He He Leu 

100 105 110 

gtg gga act aaa ctt gat ctt agg gat gat aaa gac agg ate cag aaa 384 
Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Asp Arg He Gin Lys 

115 120 ~ 125 

ctg aag gag aag aag eta act ccc ate acc tat ccg cag ggt eta gcc 432 
Leu Lys Glu Lys Lys Leu. Thr Pro He Thr Tyr Pro Gin Gly Leu Ala 
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130 






atg 


get 


aag 


gag atg ggt 


Met 


Ala 


Lys 


Glu Met Gly 


145 






150 


aca 


agg 


cga 


ggc etc aag 


Thr 


Arg 


Arg 


Gly Leu Lys 








165 


etc 


tgc 


cca 


cct ccc gtg 


Leu 


Cys 


Pro 


Pro Pro Val 








180 


tag 
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135 140 
get gta aaa tac ctg gag 
Ala Val Lys Tyr Leu Glu 
155 

aca gtg ttt gac gaa gcg 
Thr Val Phe Asp Glu Ala 
170 

aag aag agg aag aga aaa 
Lys Lys Arg Lys Arg Lys 
185 
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tgc ttg gec etc 480 
Cys Leu Ala Leu 
. 160 

ate cga get gtc 528 
He Arg Ala Val 
175 

tgc ctg cag ttg 576 
Cys Leu Gin Leu 
190 

579 



<210> 326 

<211> 192 ....... 

"<212> "PRT ' " * 
<213> Homo sapiens 

<400> 326 

Met Gin Ala He Lys Gly Val Val Val Gly Asp Gly Ala Val Gly Lys 

15 10 15 

Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Ala Phe Pro Gly Glu Asp 

20 25 30 

He Pro Thr Ala Phe Asp Asn Tyr Ser Ala Asn Val Met Val Asp Gly 

35 40 45 

Lys Leu Val Asn Leu Gly Leu Trp Asn Thr Ala Gly Gin Glu Asp Tyr 

50 55 60 

Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Ala Asp Val Phe Leu He 
65 70 75 80 

Cys Phe Ser Leu Val Ser Pro Ala Ser Phe Glu Asn Val Leu Ala Lys 

85 90 95 

Trp Tyr Pro Glu Val Gin His His Cys Pro Asn Thr Pro He He Leu 

100 105 HO 

Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Asp Arg He Gin Lys 

115 120 125 

Leu Lys Glu Lys Lys Leu Thr Pro He Thr Tyr Pro Gin Gly Leu Ala 

13 0 * "* 135 140 

Met Ala Lys Glu Met Gly Ala Val Lys Tyr Leu Glu Cys Leu Ala Leu 
145 150 155 160 

Thr Arg Arg Gly Leu Lys Thr Val Phe Asp Glu Ala He Arg Ala Val 

165 170 175 

Leu Cys Pro Pro Pro Val Lys Lys Arg Lys Arg Lys Cys Leu Gin Leu 
180 185 190 

<210> 327 
<211> 594 
<212> DNA 

<213> Dictyostelium discoideum 

<220> 

<221> CDS 

<222> (5) • . (589) 

<400> 327 

cata atg caa gca att aaa tgt gta gtt gtt ggt gat ggt gca gtt ggt 49 
Met Gin Ala He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly 
1 5 10 15' 

aaa aca tgt ctt tta att tea tat aca ace aat get ttt cca gga gag 
Lys Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Ala Phe Pro Gly Glu 

20 25 3 0 

tat ate cca aca gtt ttt gat aat tac age gca aat gta atg gtt gat 145 
Tyr He Pro Thr Val Phe Asp Asn Tyr Ser Ala Asn Val Met Val Asp 

35 40 45 

ggt aaa cca att aat etc gga tta tgg gat aca gca ggt caa gaa, gat 193 
Gly Lys Pro He Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp 
50 55 60 



97 
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tat 


gat 


cgt 


tta 


cgt 


cct 


Tyr 


Asp 


Arg 


Leu Arg 


Pro 




65 










att 


tgt 


ttt 


tea 


ate 


gtt 


lie 


Cys 


Phe 


Ser 


He 


Val 


80 










85 


aaa 


tgg 


cat 


cca 


gaa 


ata 


Lys 


Trp 


His 


Pro 


Glu 


He 










100 




tta 


gtt 


ggt 


act 


aaa 


tta 


Leu 


Val 


Gly 


Thr 


Lys 


Leu 








115 






aga 


tta 


aaa 


gag 


aaa 


aaa 


Arg 


Leu 


Lys 


Glu 


Lys 


Lys 






130 








tea 


aaa 


atg 


aaa 


gaa 


att 


Ser 


Lys 


Met 


Lvs 


Glu 


He 




145 










etc 


aca 


caa 


aaa 


ggt 


ctt 


Leu 


Thr 


Gin 


Lys 


Gly 


Leu 


160 










165 


gta 


att 


aat 


cca 


act 


ctt 


Val 


He 


Asn 


Pro 


Thr 


Leu 










180 




att 


ata 


atg 


taaaaaaa 


He 


He 


Met 
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tta tec tat cct caa act 
Leu Ser Tyr Pro Gin Thr 
70 75 
tea cca gca tct ttt gag 
Ser Pro Ala Ser Phe Glu 
90 

tgt cac cat gca cca aat 
Cys His His Ala Pro Asn 
105 

gat atg aga gaa' gat aga 
Asp Met Arg. Glu Asp Arg 
120 

ctt tat cca gtt tec tat 
Leu Tyr Pro Val Ser Tyr 
135 

aat get gtc aaa tat ctt 
Asn Ala Val Lys Tyr Leu 
150 - ' "155 

aaa act gtt ttt gat gaa 
Lys Thr Val Phe Asp Glu 
170 

aag aaa aaa cca aaa tct 
Lys Lys Lys Pro Lys Ser 
185 
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gat gtt ttc tta 241 
Asp Val Phe Leu 

aat gta aat ggt 289 
Asn Val Asn Gly 
95 

gtt cga ate att 337 
Val Arg He He 
110 

gat act caa gat 385 
Asp Thr Gin Asp 
125 

gaa caa ggt ctt 433 

Glu Gin Gly Leu 

140 

gaa tgt tea get 481 
Glu Cys Ser Ala 

gca att aga tct 529 
Ala He Arg Ser 
175 

tea aaa ggt tgt 577 
Ser Lys Gly Cys 
190 

594 



<210> 328 
<211> 194 
<212> PRT 

<213> Dictyostelium discoideum 
<400> 328 

Met Gin Ala He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 

1 5 10 15 

Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Ala Phe Pro Gly Glu Tyr 

20 25 30 

He Pro Thr Val Phe Asp Asn Tyr Ser Ala Asn Val Met Val Asp Gly 

35 40 45 

Lys Pro He Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr 

50 55 60 

Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu He 
65 70 75 80 

Cys Phe Ser He Val Ser Pro Ala Ser Phe Glu Asn Val Asn Gly Lys 

85 90 95 

Trp His Pro Glu He Cys His His Ala Pro Asn Val Arg He He Leu 

100 105 110 

Val Gly Thr Lys Leu Asp Met Arg Glu Asp Arg Asp Thr Gin Asp Arg 

115 120 125 

Leu Lys Glu Lys Lys Leu Tyr Pro Val Ser Tyr Glu Gin Gly Leu Ser 

130 135 140 

Lys Met Lys Glu He Asn Ala Val Lys Tyr Leu Glu Cys Ser Ala Leu 
145 150 155 160 

Thr Gin Lys Gly Leu Lys Thr Val Phe Asp Glu Ala He Arg Ser Val 

165 170 175 

He Asn Pro Thr Leu Lys Lys Lys Pro Lys Ser Ser Lys Gly Cys He 
180 185 190 

He Met 



<210> 329 
<211> 591 
<212> DNA 

<213> Dictyostelium discoideum 



<220> 
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49 



<221> CDS 
<222> (8) . . (589) 

<400> 329 

aaaaaaa atg caa gca att aaa tgt gta gtt gta ggt gat ggt gcg gtt 
Met Gin Ala lie Lys Cys Val Val Val Gly Asp Gly Ala Val 
1 5 10 

ggt aaa aca tgt ctt tta att tct tat aca act aac get ttt cct gga 97 
Glv Lys Thr Cys Leu Leu lie Ser Tyr Thr. Thr Asn Ala Phe Pro Gly 
15 20 25 30 

gag tat ate cca aca gtt ttc gat aat tac tea get aat gtt atg gtt 145 
Glu Tyr lie Pro Thr Val Phe Asp Asn Tyr Ser Ala Asn Val Met Val 

35 40 45 

gat ggt aaa cca att aat ctt ggc ttg tgg gat act get ggt caa gaa 193 
Asp Gly Lys Pro He Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin Glu 

• 50 55 60 

gat tat gat cgt tta cgt cca ctt tea tat . cct . caa act...gat gtt ttc 241 
Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe 

65 70 75 

tta att tgc ttt tea att att tct cca tea tea tat gaa aat gtt tea 289 
Leu He Cys Phe Ser He He Ser Pro Ser Ser Tyr Glu Asn Val Ser 

80 85 90 

ggt aaa tgg gga cca gaa gta ttt cat cat get cca aat gtt cca ate 337 
Gly Lys Trp Gly Pro Glu Val Phe His His Ala Pro Asn Val Pro He 
95 100 105 HO 

att ttg gtt ggt aca aaa atg gat atg aga gaa gat aag gaa act caa 385 
He Leu Val Gly Thr Lys Met Asp Met Arg Glu Asp Lys Glu Thr Gin 

115 120 125 

gat aga tta aaa gaa aag aaa ctt tat cca gtt tec tat gaa caa ggt 433 
Asp Arg Leu Lys Glu Lys Lys Leu Tyr Pro Val Ser Tyr Glu Gin Gly 

130 135 140 

ctt tta aaa atg aaa gaa att aat get ttc aaa tat ctt gaa tgc tct 481 
Leu Leu Lys Met Lys Glu He Asn Ala Phe Lys Tyr Leu Glu Cys Ser 

145 150 155 

get etc act caa aaa ggt ctt aaa act gtt ttc g'ac gaa get att aga 529 
Ala Leu Thr Gin Lys Gly Leu Lys Thr Val Phe Asp Glu Ala He Arg 

160 ^ 165 170 

tct gta att aat cca cca gtt aaa aaa tea aaa agt aaa agt gga tgt 577 
Ser Val He Asn Pro Pro Val Lys Lys Ser Lys Ser Lys Ser Gly Cys 
175 180 185 190 

aat ate ttg taaaa 
Asn He Leu 



591 



<210> 330 
<211> 193 
<212> PRT 

<213> Dictyostelium discoideum 

<400> 330 „ , m 

Met Gin Ala He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 
1 5 10 I 5 

Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Ala Phe Pro Gly Glu Tyr 

20 25 30 

He Pro Thr Val Phe Asp Asn Tyr Ser Ala Asn Val Met Val Asp Gly 

35 40 45 

Lys Pro He Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr 

50 55 60 

Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu He 
65 70 75 80 

Cys Phe Ser He He Ser Pro Ser Ser Tyr Glu Asn Val Ser Gly Lys 

85 90 95 

Trp Gly Pro Glu Val Phe His His Ala Pro Asn Val Pro He He Leu 

100 105 HO 

Val Gly Thr Lys Met Asp Met Arg Glu Asp Lys Glu Thr Gin Asp Arg 

115 120 125 

Leu Lys Glu Lys Lys Leu Tyr Pro Val Ser Tyr Glu Gin Gly Leu Leu 
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130 135 140 

Lys Met Lys Glu He Asn Ala Phe Lys Tyr Leu Glu Cys Ser Ala lieu 
145 150 A ~ 155 "* 160 

Thr Gin Lys Gly Leu Lys Thr Val Phe Asp Glu Ala lie Arg ser Val 

165 170 175 

He Asn Pro Pro Val Lys Lys Ser Lys Ser Lys Ser Gly Cys Asn He 
180 185 190 

Leu 



<210> 331 
<211> 723 
<212> DNA 

<213> Entamoeba histolytica 

<220> 
<221> CDS 

<222> (22) . . (621} " 



<400> 331 



aaataaaaaa < 


aaacatgacc t 


atg 


aga cca 


gtg 


aaa 


ctt gtc 


ate gtc 


yy *- 


3 Jm 












Met 

1 . 


Arg Pro 


Val 


Lys 
5 


Leu Val 


He Val 


Glv 




crat 


crcrt ctcc 

33 u y*— *■* 


gtc 


crcrt 


aaa 


act 


tgt atg 


tta 


att 


tea tat 


cu \— cl auci 


10 

CLCL l» 


QQ 


Asp 


Gly Ala 


Val 


Glv 


Lys 


Thr Cys Met 


Leu 


He 


Ser Tyr 


Thr Thr 


Asn 










15 








20 




25 






act 


ttc cca 


aat 


gaa 


tat 


att 


cca aca 


gtc 


ttt 


gaa aat 


tat aat 


tct 


1 4.7 


Ala 


Phe Pro 


Asn 


Glu 


Tvr 


He 


Pro Thr 


Val 


Phe 


Glu Asn 


Tyr Asn 


Ser 








30 








35 








40 






tea 


ttcr att 


crtt 


gat 


crat 


gtt 


aaa att' 


aat 


ctt gga tta 


tcr or crat 


act 


195 


Ser 


Leu Val 


Val 


Asd 


ASD 


Val 


Lys He Asn Leu Gly Leu 


Trp Asp 


Thr 






45 










50 






55 








get 


gga caa 


gaa 


gat 


tat 


gat 


aga tta 


aga 


cca 


tta tea 


tat cca 


tea • 




Ala 


Gly Gin 


Glu 


Asp 


Tyr 


Asp Arg Leu Arg 


Pro 


Leu Ser 


Tyr Pro 


Ser 






60 








65 








70 








act 


gat gta 


ttc 


ctt 


gtt 


tgt 


ttc tec 


gtt 


att 


get cca 


get tea 


tat 


291 


Thr 


Asp Val 


Phe 


Leu 


Val 


Cys 


Phe Ser 


Val 


He 


Ala Pro 


Ala Ser 


Tyr 




75 








80 








85 






90 




gaa 


aat gtt 


gaa 


ggt 


aaa 


tgg 


aaa cca 


gaa 


att 


gat caa 


cac tgt 


cca 


339 


Glu 


Asn Val 


Glu 


Gly 


Lys 


Trp 


Lys Pro 


Glu He Asp Gin 


His Cys 


Pro 






gta cca 




95 








100 






105 






aat 


att 


att 


ctt 


gtt 


gga act 


aaa 


att 


gat att 


aga gat 


gat 


387 


Asn 


Val Pro 


lie 


He 


Leu 


Val 


Gly Thr Lys 


He Asp lie 


Arg. Asp 


Asp 








110 








115 








120 




cca 


gaa caa 


gtt 


aaa 


cgt. 


tta 


get gaa 


aag 


aat 


att gtc 


cca att 


caa 


435 


Pro 


Glu Gin 


Val 


Lys 


Arg 


Leu 


Ala Glu 


Lys 


Asn 


He Val 


Pro He 


Gin 






125 










130 






135 








cct 


cct caa 


gga 


gat 


gaa 


tta 


gca aag 


aaa 


att 


ggt get 


gtt aaa 


tat 


483 


Pro 


Pro Gin 


Gly 


Asp 


Glu 


Leu 


Ala Lys 


Lys 


He 


Gly Ala 


Val Lys 


Tyr 






140 








145 








150 




att 


gaa tgt 


tct 


get 


tta 


aca 


caa gee 


aat 


ctt 


aaa. ctt 


gtt ttt 


gaa 


531 


He 


Glu Cys 


Ser 


Ala 


Leu 


Thr 


Gin Ala 


Asn 


Leu 


Lys Leu 


Val Phe 


Glu 




155 








160 








165 






170 




gaa 


get gtt 


aga 


get 


gtt 


ctt 


get aaa 


get 


get 


aaa gag 


cca act 


gga 


579 


Glu 


Ala Val 


Arg 


Ala 


Val 


Leu Ala Lys Ala Ala Lys Glu 


Pro Thr 


Gly 










175 








180 






185 




aag 


aaa gaa 


aaa 


gga 


ggt 


aag 


aaa gga 


tgc 


tea 


tta ttc 


taatttcact 


62 8 


Lys 


Lys Glu 


Lys 


Gly 


Gly 


Lys 


Lys Gly Cys 


Ser 


Leu Phe 












190 








195 















ttttagttga taaattgaag aagttgttat aaaagaaaat atatttgttt tttgtaaaaa 688 
aatagaaaat aaaagaagaa aaagattgat ttaaa 723 



<210> 332 



WO 2005/014828 



246/291 



PCT/EP2004/008136 



<211> 199 
<212> PRT 

<213> Entamoeba histolytica 



<400> 332 

Met Arg Pro Val Lys Leu Val lie Val Gly Asp Gly Ala Val Gly Lys 

1 5 10 15 

Thr Cys Met Leu lie Ser Tyr Thr Thr Asn Ala Phe Pro Asn Glu Tyr 

20 25 30 

lie Pro Thr Val Phe Glu Asn Tyr Asn Ser Ser Leu Val Val Asp Asp 

35 40 45 

Val Lys lie Asn lieu Gly Leu Tip Asp Thr Ala Gly Gin Glu Asp Tyr 

50 55 60 

Asp Arg Leu Arg Pro Leu Ser Tyr Pro Ser Thr Asp Val Phe Leu Val 
65 70 75 80 

Cys Phe Ser Val lie Ala Pro Ala Ser Tyr Glu Asn Val Glu Gly Lys 

85 90 95 

Trp Lys" Pro Glu" 'lie Asp Gin His Cys Pro Asn Val Pro lie lie Leu 

100 105 110 

Val Gly Thr Lys He Asp He Arg Asp Asp Pro Glu Gin Val Lys Arg 

115 120 125 

Leu Ala Glu Lys Asn He Val Pro He Gin Pro Pro Gin Gly Asp Glu 

130 135 140 

Leu Ala Lys Lys He Gly Ala Val Lys Tyr He Glu Cys Ser Ala Leu 
145 ~ 150 155 160 

Thr Gin Ala Asn Leu Lys Leu Val Phe Glu Glu Ala Val Arg Ala Val 

165 170 175 

Leu Ala Lys Ala Ala Lys Glu Pro Thr Gly Lys Lys Glu Lys Gly Gly 

180 185 190 

Lys Lys Gly Cys Ser Leu Phe 
195 

<210> 333 
<211> 597 
<212> DNA 

<213> Candida albicans 



<220> 

<221> CDS 

<222> (1) . . (597) 



<400> 333 



atg 


gtt 


aac 


ggt 


cca 


get 


Met 


Val 


Asn Gly 


Pro 


Ala 


1 








5 




gat 


ggt 


get 


tgt 


ggt 


aag 


Asp 


Gly 


Ala Cys 


Gly 


Lys 








20 






act 


ttc 


cca 


gaa 


gtt 


tat 


Thr 


Phe 


Pro 


Glu 


Val 


Tyr 






35 








gat 


gtt 


gaa 


gtt 


gat 


ggt 


Asp 


Val 


Glu 


Val 


Asp 


Gly 




50 










get 


ggt 


caa 


gaa 


gat 


tat 


Ala 


Gly 


Gin 


Glu 


Asp 


Tyr 


65 










70 


tct 


aat 


gtt 


att 


ttg 


att 


Ser 


Asn 


Val 


He 


Leu 


He 










85 




gat 


aac 


gtt 


tta 


gaa 


aaa 


Asp 


Asn 


Val 


Leu 


Glu 


Lys 








100 






ggt 


gtt 


cca 


ate 


att 


tta 


Gly 


Val 


Pro 


He 


He 


Leu 




115 








cct 


cat 


act 


att 


gaa 


gec 


Pro 


His 


Thr 


He 


Glu 


Ala 



gaa ctt cgt aga aaa tta 
Glu Leu Arg Arg Lys Leu 
10 

act tgt tta tta att gtt 
Thr Cys Leu Leu He Val 
25 

gtc cca aca gtt ttt gaa 
Val Pro Thr val Phe Glu 
40 

aga aaa gtt gaa ttg gca 
Arg Lys Val Glu Leu Ala 
55 60 
gat aga tta aga cca tta 
Asp Arg Leu Arg Pro Leu 
75 

tgt ttt tea gtt gat tea 
Cys Phe Ser Val Asp Ser 
90 

tgg att tct gaa gtt tta 
Trp He Ser Glu Val Leu 
105 

gtt ggt tgt aaa tct gat 
Val Gly Cys Lys Ser Asp 
120 

ttg aga caa caa caa caa 
Leu Arg Gin Gin Gin Gin 



gtc att gtc ggt 48 
Val He val Gly 
15 

ttt tea aaa ggt 96 
Phe Ser Lys Gly 
30 

aat tac gtt get 144 

Asn Tyr Val Ala 

45 

tta tgg gat act 192 
Leu Trp Asp Thr 

tct tat cca gat 240 
Ser Tyr Pro Asp 
80 

cca gat tct tta 288 
Pro Asp Ser Leu 
95 

cat ttc tgt caa 336 
His Phe Cys Gin 
110 

tta aga gat gat 384 

Leu Arg Asp Asp 

125 

caa cca gtc tea 432 
Gin Pro Val Ser 



WO 2005/014828 PCTVEP2004/008136 

247/291 

130 135 140 

act tct gaa ggc caa caa gtt get caa aga att ggt get get gat tac 480 
Thr Ser Glu Gly Gin Gin Val Ala Gin Arg He Gly Ala Ala Asp Tyr 
145 150 155 160 

ttg gaa tgt tct get aaa ace ggt aga ggt gtt aga gaa gtg ttt gaa 528 
Leu Glu Cys Ser Ala Lys Thr Gly Arg Gly Val Arg Glu Val Phe Glu 

165 170 175 

get get act aga get tct tta aga gtt aaa gaa aag aag gaa aag aag 576 
Ala Ala Thr Arg Ala Ser Leu Arg Val Lys Glu Lys Lys Glu Lys Lys 

180 185 190 

aag aaa tgt gtt gtc ttg taa 597 
Lys Lys Cys Val Val Leu 
195 

<210> 334 
<211> 198 
' ~<212> PRT " 
<213> Candida albicans 

<400> 334 

Met Val Asn Gly Pro Ala Glu Leu Arg Arg Lys Leu Val lie Val Gly 

1 5 10 15 

Asp Gly Ala Cys Gly Lys Thr Cys Leu Leu He Val Phe Ser Lys Gly 

20 25 30 

Thr Phe Pro Glu Val Tyr Val Pro Thr Val Phe Glu Asn Tyr Val Ala 

35 40 45 

Asp Val Glu Val Asp Gly Arg Lys Val Glu Leu Ala Leu Trp Asp Thr 

50 55 60 ■ 

Ala Gly Gin Glu Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp 
65 70 75 " 80 

Ser Asn Val He Leu He Cys Phe Ser Val. Asp Ser Pro Asp Ser Leu 

85 90 . Sf5 

Asp Asn Val Leu Glu Lys Trp He Ser Glu Val Leu His Phe Cys Gin 

100 105 no 

Gly Val Pro He He Leu Val Gly Cys Lys Ser Asp Leu Arg Asp Asp 

115 120 125 

Pro His Thr He Glu Ala Leu Arg Gin Gin Gin Gin Gin Pro Val Ser 

130 135 140 

Thr Ser Glu Gly Gin Gin Val Ala Gin Arg He Gly Ala Ala Asp Tyr 
145 150 155 160 

Leu Glu Cys Ser Ala Lys Thr Gly Arg Gly Val Arg Glu Val Phe Glu 

165 170 " 175 

Ala Ala Thr Arg Ala Ser Leu Arg Val Lys Glu Lys Lys Glu Lys Lys 

180 185 190 

Lys Lys Cys Val Val Leu 
195 

<210> 335 
<211> 1165 
<212> DNA 

<213> Kluyveromyces lactis 

<220> 

<221> CDS 

<222> (102) . . (728) 

<400> 335 

aagtgtcata aaacagaatt actcaagagt tttcgaacag ctttgatttt ataatacaat 60 

aaaaaaaagt agtgatttga tttgttttaa ccctttccat a atg tct caa get gtt 116 

Met Ser Gin Ala Val 
1 5 

ggt aat gtt gee age att aga aga aag eta gtt att gtc ggt gat ggt 164 
Gly Asn Val Ala Ser He Arg Arg Lys Leu Val He Val Gly Asp Gly 

10 15 20 

gcg tgt ggt aag act tgt tta ttg att gtt ttc gec aag ggt aag ttc 212 



260 
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Ala Cys Gly Lys Thr Cys Leu Leu He Val Phe Ala Lys Gly Lys Phe 

25 30 35 

cct cag gtt tac gtg ccc act gtt ttc gat aat tac gtt get gat gtc 
Pro Gin Val Tyr Val Pro Thr Val Phe Asp Asn Tyr Val Ala Asp Val 

40 45 50 

gaa gtt gac ggt cgt cgc gta gaa ttg gcg ttg tgg gat act get ggc 308 
Glu Val Asp Gly Arg Arg Val Glu Leu Ala Leu Trp Asp Thr Ala Gly 

55 " 60 65 

caa gaa gat tat gat aga ttg aga cca ctt teg tac cca gat tec aac 356 
Gin Glu Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp Ser Asn 
70 * 75 80 85 

gtt gtg tta ate tgt tac tea att gat ttg cca gat tec tta gaa aac 404 
Val Val Leu He Cys Tyr Ser He Asp Leu Pro Asp Ser Leu Glu Asn 

90 95 100 

gtg atg gaa aaa tgg ate tct gaa gtt tta cac ttt tgt caa ggt gtt 452 
Val Met Glu Lys Trp He Ser Glu Val Leu His Phe Cys Gin Gly Val 

105_ * 110 115 

cca' ate ate ttg gtt ggt tgt aaa gca gat "ttg aga aat gat cct caa 500 
Pro He He Leu Val Gly Cys Lys Ala Asp Leu Arg Asn Asp Pro Gin 

120 125 130 

gtt gtt gaa gaa eta aga get caa gga tta caa cct gtg tct cag get 548 
Val Val Glu Glu Leu Arg Ala Gin Gly Leu Gin Pro Val Ser Gin Ala 

135 140 145 

caa gca caa gaa gtc get gac caa ate ggt get gtt gac tac ate gaa 596 
Gin Ala Gin Glu Val Ala Asp Gin He Gly Ala Val Asp Tyr He Glu 
150 155 160 165 

tgt tct gee aag acc ggt tac ggt gtc aga gaa gtc ttc gag get get 644 
Cys Ser Ala Lys Thr Gly Tyr Gly Val Arg Glu Val Phe Glu Ala Ala 

170 175 180 

aca aga get tct ttg gtc ggt aaa caa ggt aag age aaa cct aag act 692 
Thr Arg Ala Ser Leu Val Gly Lys Gin Gly Lys Ser Lys Pro Lys Thr 

185 190 195 

aag tct age aaa aaa aag aag tgt gtc gtc ttg taaataaaga cacgcacgcc 745 
Lys Ser Ser Lys Lys Lys Lys Cys Val Val Leu 

200 205 
ccaaacccac tcttcacccc cttatctcca acttcatttg aatttacctt gaaacagagg 805 

actgagcttt gaggatatta ttattaggcg atgggaattg tttacaaccg ttaattggca 865 

gaagtaaaga tccaaaaaaa aagaagaggg aagacgaaca aaccatattc atttgtttct 925 



aattgatcaa tttttaatat acaaatgtca gcaattaatt tccgttgaat ttcctaattt 985 

tttctcctcg ttcttttctt tgtttccatc tggtttcaag tgaccatccc tccctgacat 1045 

tttctcttct acctctcttc tgttttgtgt gtttctacct ataaattatt tttatatacc 1105 

tacactcttt tattttgtaa gaaacttaaa tattatcacc actatactgc actttctcat 1165 



<210> 336 
<211> 208 
<212> PRT 

<213> Kluyveromyces lactis 
<400> 336 

Met Ser Gin Ala Val Gly Asn Val Ala Ser He Arg Arg Lys Leu Val 
15 10 15 

He Val Gly Asp Gly Ala Cys Gly Lys Thr Cys Leu Leu He Val Phe 
20 25 30 
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Ala 


Lys 


Gly 


Lys 


Phe 


Pro 


Gin 


Val Tyr 


Val Pro Thr 


Val 


Phe 


Asp 


Asn 






35 










40 




45 






Tyr 


Val 
50 


Ala 


Asp 


Val 


Glu 


Val 
55 


Asp Gly 


Arg Arg Val 
60 


Glu 


Leu 


Ala 


Leu 


Trp 


Asp 


Thr 


Ala 


Gly 


Gin 


Glu 


Asp Tyr 


Asp Arg Leu 


Arg 


Pro 


Leu 


Ser 


65 










70 






75 








80 


Tyr 


Pro 


Asp 


Ser 


Asn 


Val 


Val 


Leu lie 


Cys Tyr Ser 


He 


Asp 


Leu 


Pro 










85 








90 




95 




Asp 


Ser 


Leu 


Glu 
100 


Asn 


Val 


Met 


Glu Lys 
105 


Trp. He Ser 


Glu 


Val 
110 


Leu 


His 


Plie 


Cys 


Gin 


Gly 


Val 


Pro 


He 


He Leu 


Val Gly Cys 


Lys 


Ala 


Asp 


Leu 






115 










120 




125 






Arg 


Asn 


Asp 


Pro 


Gin 


Val 


Val 


Glu Glu 


Leu Arg Ala 


Gin 


Gly 


Leu 


Gin 




130 










135 




140 








Pro 


Val 


Ser 


Gin 


Ala 


Gin 


Ala 


Gin Glu 


Val Ala Asp 


Gin 


He 


Gly 


Ala 


145 










150 












160 


Val 


Asp 


Tyr 


lie 


Glu 
165 


Cys 


Ser 


Ala Lys 


Thr Gly Tyr 
170 


Gly 


Val 


Arg 
175 


Glu 


Val 


Phe 


Glu 


Ala 


Ala 


Thr 


Arg 


Ala Ser 


Leu Val Gly 


Lys 


Gin 


Gly 


Lys 








180 








185 






190 


Ser 


Lys 


Pro 
195 


Lys 


Thr 


Lys 


Ser 


Ser Lys 
200 


Lys Lys Lys 


Cys 
205 


Val 


Val 


Leu 



<210> 337 
<211> 1181 
<212> DNA 

<213> Schizophyllum commune 

<220> 

<221> CDS 

<222> (139) . . (759) 

<400> 337 

ctggtctctc gtcctttctt ccaccacgcg cacgcgcacc cacgtttctt ctcccactcc 60 



cggcgctgtc ggccccgtcg ccctcgcgtc cttcctcttc ctcttccgcc agctcagtga 12 0 



caggtacaca ctcccctt 


atg 


gcg 


etc 


tgc 


ggt 


teg teg 


aaa 


ggg 


cgc 


ggc 


171 












Met 
1 


Ala 


Leu 


Cys 


Gly 
5 


Ser Ser 


Lys 


Gly 


Arg 
10 


Gly 




agg 


ggc 


cga 


ccg 


ate 


cag 


cgc 


aag 


gtc 


gtc 


gta tgc 


ggc 


gac 


ggc 


gcg 


219 


Arg 


Gly 


Arg 


Pro 


He 


Gin 


Arg 


Lys 


Val 


Val 


Val Cys 


Gly 


Asp 


Gly 


Ala 










15 










20 






25 








tgc 


ggg 


aag 


acg 


agt 


ctg 


ctg 


aac 


gtc 


ttc 


acg agg 


ggg 


ttc 


ttc 


acg 


267 


Cys 


Gly 


Lys 


Thr 


Ser 


Leu 


Leu 


Asn 


Val 


Phe 


Thr Arg 


Gly 


Phe 


Phe 


Thr 








30 










35 








40 










cag 


gtt 


tat 


gaa 


ccg 


acg 


gtg 


ttc 


gag 


aac 


tac gtg 


cac 


gat 


ctg 


tat 


315 


Gin 


Val 


Tyr 


Glu 


Pro 


Thr 


Val 


Phe 


Glu 


Asn 


Tyr Val 


His 


Asp 


Leu 


Tyr 






45 










50 








55 










ate 


gac 


gac 


cag 


ctg 


gtg 


gag 


ctg 


agt 


etc 


tgg gat 


acg 


gcg 


ggg 


cag 


363 


He 


Asp 


Asp 


Gin 


Leu 


Val 


Glu 


Leu 


Ser 


Leu 


Trp Asp 


Thr 


Ala 


Gly 


Gin 




60 










65 










70 






75 




gag 


gag 


ttc 


gac 


egg 


eta 


egg 


age 


ctg 


teg 


tat gca 


gaa 


acg 


cat 


gtg 


411 


Glu 


Glu 


Phe 


Asp 


Arg 


Leu 


Arg 


Ser 


Leu 


Ser 


Tyr Ala 


Glu 


Thr 


His 


Val 












80 










85 








90 






ate 


atg 


ata 


tgc 


ttc 


age 


gtc 


gac 


aat 


cca 


acg teg 


etc 


gag 


aat 


gtg 


459 


He 


Met 


He 


Cys 


Phe 


Ser 


Val 


Asp 


Asn 


Pro 


Thr Ser 


Leu 


Glu 


Asn 


Val 










95 










100 








105 








gag 


age 


aag 


tgg 


etc 


gac 


gag 


att 


ttg 


gag 


tac tgt 


ccg 


ggc 


gtg 


aag 


507 


Glu 


Ser 


Lys 


Trp 


Leu 


Asp 


Glu 


He 


Leu 


Glu 


Tyr Cys 


Pro 


Gly 


Val 


Lys 








110 










115 








120 










ttg 


gta 


ttg 


gtc 


gac 


tea 


aaa 


tgt 


gat 


eta 


cgc gac 


gac 


cct 


gca 


gta 


555 


Leu 


Val 


Leu 


Val 


Asp 


Ser 


Lys 


Cys 


Asp 


Leu 


Arg Asp 


Asp 


Pro 


Ala 


Val 






125 










13 0 








135 












etc 


gat 


egg 


eta 


caa 


cga 


tac 


ggt 


aca 


cat 


acg gat 


caa 


tat 


gaa 


gag 


603 


Leu 


Asp 


Arg 


Leu 


Gin 


Arg 


Tyr 


Gly 


Thr 


His 


Thr Asp 


Gin 


Tyr 


Glu 


Glu 
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140 145 150 155 

ggc etc ggg gtc gcg cga aga ata cga get tea cga tac tta gag tgc 651 

Gly Leu Gly Val Ala Arg Arg He Arg Ala Ser Arg Tyr Leu Glu Cys 

160 165 170 

tec tec aaa cac aac egg ggc gtt aac gaa gtc ttc tta cga ggc cgc 699 
Ser Ser Lys His Asn Arg Gly Val Asn Glu Val Phe Leu Arg Gly Arg 

175 180 185 

gcg cgt gtc act gtc cac teg ate agg cag ggg age gec ggg teg tgt 747 
Ala Arg Val Thr Val His Ser lie Arg Gin Gly Ser Ala Gly Ser Cys 

190 195 200 

tgt gtc atg tagcgcacct cactctcgtc agtctagtcc cccaagccga tcgacgctgc 806 
Cys Val Met 
205 

gctggcgctc cccgtcgacc tctcaaactg ctctgcccca ctatgtcaca ccaagcggcc 866 
catcgacccg tcctagacca cactacacag gtgttaggac gcaagagaac tccagcgcgg ,926 . 
cccgccctcg gtccccatgc tcctgggcgt gtagtgcttc gtgcactcct gttcgtaggt 986 
gegggggtae tcctccccac tcgtgcggtg ctcaaggacg tgtagggggc agctcggtgc 1046 
gctgtccgaa taagatagat atataccccc ccaccccgaa ggtcacatac gcatggatat 1106 
cttcacgctc aaatatgtta cgctccctta atetaactge gtcttatatg ggaattgaat 1166 
gcgtttgata tgggc 1181 



<210> 338 
<211> 206 
<212> PRT 

<213> Schizophyllum commune 
<400> 338 

Met Ala Leu Cys Gly Ser Ser Lys Gly Arg Gly Arg Gly Arg Pro He 

15 10 15 

Gin Arg Lys Val Val Val Cys Gly Asp Gly Ala Cys Gly Lys Thr Ser 

20 25 30 

Leu Leu Asn Val Phe Thr Arg Gly Phe Phe Thr Gin Val Tyr Glu Pro 

35 40 45 

Thr Val Phe Glu Asn Tyr Val His Asp Leu Tyr He Asp Asp Gin Leu 

50 55 60 

Val Glu Leu Ser Leu Trp Asp Thr Ala Gly Gin Glu Glu Phe Asp Arg 
65 70 75 80 

Leu Arg Ser Leu Ser Tyr Ala Glu Thr His Val He Met He Cys Phe 

85 90 95 

Ser Val Asp Asn Pro Thr Ser Leu Glu Asn Val Glu Ser Lys Trp Leu 

100 105 110 

Asp Glu He Leu Glu Tyr Cys Pro Gly Val Lys Leu Val Leu Val Asp 

115 120 125 

Ser Lys Cys Asp Leu Arg Asp Asp Pro Ala Val Leu Asp Arg Leu Gin 

130 135 140 

Arg Tyr Gly Thr His Thr Asp Gin Tyr Glu Glu Gly Leu Gly Val Ala 
145 150 155 160 

Arg Arg He Arg Ala Ser Arg Tyr Leu Glu Cys Ser Ser Lys His Asn 

165 " 170 175 

Arg Gly Val Asn Glu Val Phe Leu Arg Gly Arg Ala Arg Val Thr Val 

180 185 190 

His Ser He Arg Gin Gly Ser Ala Gly Ser Cys Cys Val Met 
195 200 205 
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<210> 339 
<211> 618 
<212> DNA 

<213> Schizosaccharomyces pombe 

<220> 

<221> CDS 

<222> (1).. (618) 

<400> 339 

atg tea age tgt ttc gga agt aaa aag aag cct att tat egg aaa ate 48 
Met Ser Ser Cys Phe Gly Ser Lys Lys Lys Pro He Tyr Arg Lys He 
1 5 io - 15 

gta att ctt ggt gat ggt get get ggt aaa acc agt ttg tta aat gta 96 
Val He Leu Gly Asp Gly Ala Ala Gly Lys Thr Ser lieu Leu Asn Val 
20 25 30 

* ttt act aacf ggt" tat' ttc cct cag gta tac gag" cct act ata "ttt gaa 144 
Phe Thr Lys Gly Tyr Phe Pro Gin Val Tyr Glu Pro Thr lie Phe Glu 

35 40 " 45* 

aac tac att cat gat ate ttt gtc gat gga aac agt ata gaa ctg tct 192 
Asn Tyr He His Asp He Phe Val Asp Gly Asn Ser He Glu Leu Ser 

50 55 60 

eta tgg gat aca get ggt caa gaa gag tat gat caa ctg cgt teg tta 240 
Leu Trp Asp Thr Ala Gly Gin Glu Glu Tyr Asp Gin Leu Arg Ser Leu 
65 70 75 ~ 80 

tea tat tea gat aca cat gtt att atg ate tgc ttt gee gtg gat tea 288 
Ser Tyr Ser Asp Thr His Val He Met He Cys Phe Ala Val Asp Ser 

85 90 95 

cga gac tea tta gaa aat gta ate aca aaa tgg ctt ccg gaa gtc tct 336 
Arg Asp Ser Leu Glu Asn Val He Thr Lys Trp Leu Pro Glu Val Ser 

100 105 no 

agt aat tgc cct ggt gtt aaa ttg gtt ctt gtt get eta aaa tgt gat 384 
Ser Asn Cys Pro Gly Val Lys Leu Val Leu Val Ala Leu Lys Cys Asp 

115 120 125 

tta cgt gga get gat gag gag caa gtt gat cac agt aaa att att gat 432 
Leu Arg Gly Ala Asp Glu Glu Gin Val Asp His Ser Lys He He Asp 

130 135 140 

tac gag gaa gga ctg gca gcg gca aaa aaa ate aac get gta cga tat 480 
Tyr Glu Glu Gly Leu Ala Ala Ala Lys Lys He Asn Ala Val Arg Tyr 
145 150 155 160 

tta gaa tgc age get aaa tta aat cgt ggc gta aat gaa get ttc acg 528 
Leu Glu Cys Ser Ala Lys Leu Asn Arg Gly Val Asn Glu Ala Phe Thr 

165 170 175 

gaa get gca cgc gtt gee ctt gee gcg caa cca aga ggt aca aag gat 576 
Glu Ala Ala Arg Val Ala Leu Ala Ala Gin Pro Arg Gly Thr Lys Asp 

180 185 " 190 

ggt get gat gaa tec cat ggt acc gga tgt ate att get tga 618 
Gly Ala Asp Glu Ser His Gly Thr Gly Cys He He Ala 
195 200 205 

<210> 340 
<211> 205 
<212> PRT 

<213> Schizosaccharomyces pombe 
<400> 340 

Met Ser Ser Cys Phe Gly Ser Lys Lys Lys Pro He Tyr Arg Lys He 

1 5 10 15 

Val He Leu Gly Asp Gly Ala Ala Gly Lys Thr Ser Leu Leu Asn Val 

20 25 30 

Phe Thr Lys Gly Tyr Phe Pro Gin Val Tyr Glu Pro Thr He Phe Glu 

35 40 45 

Asn Tyr He His Asp He Phe Val Asp Gly Asn Ser He Glu Leu Ser 

50 55 60 

Leu Trp Asp Thr Ala Gly Gin Glu Glu Tyr Asp Gin Leu Arg Ser Leu 
65 70 75 80 
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Ser Tyr Ser Asp Thr His Val He Met He Cys Phe Ala Val Asp Ser 

85 90 95 

Arg Asp Ser Leu Glu Asn Val He Thr Lys Trp Leu Pro Glu Val Ser 

100 105 HO 

Ser Asn Cys Pro Gly Val Lys Leu Val Leu Val Ala Leu Lys Cys Asp 

115 120 125 

Leu Arg Gly Ala Asp Glu Glu Gin Val Asp His Ser Lys He He Asp 

130 135 140 

Tyr Glu Glu Gly Leu Ala Ala Ala Lys Lys. He Asn Ala Val Arg Tyr 
145 150 155 160 

Leu Glu Cys Ser Ala Lys Leu Asn Arg Gly Val Asn Glu Ala Phe Thr 

165 170 175 

Glu Ala Ala Arg Val Ala Leu Ala Ala Gin Pro Arg Gly Thr Lys Asp 

180 185 190 

Gly Ala Asp Glu Ser His Gly Thr Gly Cys He He Ala 
195 200 205 

* <210> 341 
<211> 582 
<212> DNA 

<213> Emericella nidulans 

<220> 

<221> CDS 

<222> (1) . . (582) 

<400> 341 

atg get gag ate cgc cgc aag ctt gtt ate gtt ggt gat ggt gec tgc 48 
Met Ala Glu He Arg Arg Lys Leu Val He Val Gly Asp Gly Ala Cys 
15 10 15 

ggt aag ace tgt ctg ttg ate gtc ttc tea aag ggc act ttc cct gag 96 
Glv Lys Thr Cys Leu Leu He Val Phe Ser Lys Gly Thr Phe Pro Glu 

20 =25 30 

gtc tac gtc ccc ace gtc ttt gag aac tac gtt gee gat gtt gag gtt 144 
Val Tyr Val Pro Thr Val Phe Glu Asn Tyr Val Ala Asp Val Glu Val 

35 40 45 

gat ggc aag cac gtc gag etc get etc tgg gat acg get ggt caa gaa 192 
Asp Gly Lys His Val Glu Leu Ala Leu Trp Asp Thr Ala Gly Gin Glu 

50 55 60 

gat tac gac cgt etc cgc cct etc tec tac cct gac teg cat gtc ate 
Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp Ser His Val He 
65 ~ 70 75 80 

ctg att tgc ttc get gtc gac tea ccg gat tec ctt gac aac gtt caa 
Leu He Cys Phe Ala Val Asp Ser Pro Asp Ser Leu Asp Asn Val Gin 

85 90 95 

gag aag tgg ate tct gaa gtc eta cac ttc tgc cag ggt etc ccc ate 336 
Glu Lys Trp He Ser Glu Val Leu His Phe Cys Gin Gly Leu Pro He 

100 105 HO 

ate etc gtc gga tgc aag aag gat ctt cgc cat gac ccc aag acg ate 384 
He Leu Val Gly Cys Lys Lys Asp Leu Arg His Asp Pro Lys Thr He 

115 120 125 

gag gag ctg aac aag ace tct cag aag cct gtc ace ccc gaa cag ggt 432 
Glu Glu Leu Asn Lys Thr Ser Gin Lys Pro Val Thr Pro Glu Gin Gly 

130 135 140 

gag gaa gtc cgc aag aag att ggc gee tac aag tac etc gag tgc tct 480 
Glu Glu Val Arg Lys Lys He Gly Ala Tyr Lys Tyr Leu Glu Cys Ser 
145 150 155 160 

get cga ace aac gag ggt gtc cgt gag gtc ttt gag get gee acg cgt 528 
Ala Arg Thr Asn Glu Gly Val Arg Glu Val Phe Glu Ala Ala Thr Arg 

165 170 175 

get gee etc ttg ace aag ace cac aag age aag aag aag tgc age ate 576 
Ala Ala Leu Leu Thr Lys Thr His Lys Ser Lys Lys Lys Cys Ser He 
180 185 190 

ctg taa 
Leu 



240 



288 



582 
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<210> 342 
<211> 193 
<212> PRT 

<213> Emericella nidulans 



<400> 342 



Met 


Ala 


Glu 


He 


Arg Arg Lys Leu Val 


He Val 


Gly Asp Gly Ala 


Cys 


1 








5 




15 




Gly 


Lys 


Thr 


Cys 


Leu Leu He Val Phe 


Ser. Lys 


Gly Thr Phe Pro 


Glu 








20 


25 




30 




Val 


Tyr 


Val 


Pro 


Thr Val Phe Glu Asn 


Tyr Val 


Ala Asp Val Glu 


Val 






35 




40 




45 




Asp 


Gly 


Lys 


His 


Val Glu Leu Ala Leu 


Trp Asp 


Thr Ala Gly Gin 


Glu 




50 






55 




60 




Asp 


Tyr 


Asp 


Arg 


Leu Arg Pro Leu Ser 


Tyr Pro 


Asp Ser His Val 


He 


65 








70 


75 




80 


Leu 


lie 


Cys 


Phe 


Ala Val Asp Ser Pro 


Asp. Ser 


Leu Asp Asn Val 


Gin 










85 


90 


95 




Glu 


Lys 


Trp 


He 


Ser Glu Val Leu His 


Phe Cys 


Gin Gly Leu Pro 


He 






100 


105 




110 




lie 


Leu 


Val 


Gly 


Cys Lys Lys Asp Leu 


Arg His 


Asp Pro Lys Thr 


He 






115 




120 




125 




Glu 


Glu 


Leu 


Asn 


Lys Thr Ser Gin Lys 


Pro Val 


Thr Pro Glu Gin 


Gly 




130 






135 




140 


Glu 


Glu 


Val 


Arg 


Lys Lys He Gly Ala 


Tyr Lys 


Tyr Leu Glu Cys 


Ser 


145 








150 


155 




160 


Ala 


Arg 


Thr 


Asn 


Glu Gly Val Arg Glu 


Val Phe 


Glu Ala Ala Thr 


Arg 










165 


170 


175 


Ala 


Ala 


Leu 


Leu 


Thr Lys Thr His Lys 


Ser Lys 


Lys Lys Cys Ser 


He 








180 


185 




190 





Leu 



<210> 343 

<211> 1086 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (40) . . (672) 

<400> 343 

cgcagccgcc cgcccgcccg ctcagcgccc ggccccggg atg acg gcg gcc cag 54 

Met Thr Ala Ala Gin 
1 5 



gcc 


gcg 


ggt 


gag 


gag 


gcg 


cca 


cca 


ggc 


gtg 


egg 


tec 


gtc 


aag 


gtg 


gtc 


102 


Ala 


Ala 


Gly 


Glu 


Glu 


Ala 


Pro 


Pro 


Gly 


Val 


Arg 


Ser 


Val 


Lys 


Val 


Val 












10 










15 










20 






ctg 


gtg 


ggc 


gac 


ggc 


ggc 


tgc 


ggg 


aag 


acg 


teg 


ctg 


ctg 


atg 


gtc 


ttc 


150 


Leu 


Val 


Gly 


Asp 


Gly 


Gly 


Cys 


Gly 


Lys 


Thr 


Ser 


Leu 


Leu 


Met 


Val 


Phe 










25 










30 










35 








gcc 


gat 


ggg 


gcc 


ttc 


ccc 


gag 


age 


tac 


acc 


ccc 


acg 


gtg 


ttt 


gag 


egg 


198 


Ala 


Asp 


Gly 


Ala 


Phe 


Pro 


Glu 


Ser 


Tyr 


Thr 


Pro 


Thr 


Val 


Phe 


Glu 


Arg 








40 










45 










50 










tac 


atg 


gtc 


aac 


ctg 


caa 


gtg 


aaa 


ggc 


aaa 


cct 


gtg 


cac 


etc 


cac 


ate 


246 


Tyr 


Met 


Val 


Asn 


Leu 


Gin 


Val 


Lys 


Gly 


Lys 


Pro 


Val 


His 


Leu 


His 


He 






55 










60 








65 












tgg 


gac 


aca 


gca 


ggg 


caa 


gat 


gac 


tat 


gac 


cgc 


ctg 


egg 


ccc 


ctg 


ttc 


294 


Trp 


Asp 


Thr 


Ala 


Gly 


Gin 


Asp 


Asp 


Tyr 


Asp 


Arg 


Leu 


Arg 


Pro 


Leu 


Phe 




70 










75 










80 










85 




tac 


cct 


gac 


gcc 


age 


gtc 


ctg 


ctg 


ctt 


tgc 


ttc 


gat 


gtc 


acc 


age 


ccg 


342 


Tyr 


Pro 


Asp 


Ala 


Ser 


Val 


Leu 


Leu 


Leu 


Cys 


Phe 


Asp 


Val 


Thr 


Ser 


Pro 












90 










95 










100 






aac 


age 


ttt 


gac 


aac 


ate 


ttt 


aac 


egg 


tgg 


tac 


cca 


gaa 


gtg 


aat 


cat 


390 


Asn 


Ser 


Phe 


Asp 


Asn 


He 


Phe 


Asn 


Arg 


Trp 


Tyr 


Pro 


Glu 


Val 


Asn 


His 










105 










110 










115 








ttc 


tgc 


aag 


aag 


gta 


ccc 


ate 


ate 


gtc 


gtg 


ggc 


tgc 


aag 


act 


gac 


ctg 


438 
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Phe 


Cys 


Lys 


Lys 


Val 


Pro 


lie 


lie 


Val 


Val 


Gly Cys 


Lys 


Thr 


Asp 


Leu 






120 










125 








130 










cgc 


aag 


gac 


aaa 


tea 


ctg 


gtg 


aac 


aag 


etc 


cga aga 


aac 


gga 


ttg 


gag 


486 


Arg 


Lys 


Asp 


Lys 


Ser 


Leu 


Val 


Asn 


Lys 


Leu 


Arg Arg 


Asn 


Gly 


Leu 


Glu 




135 










140 








145 












cct 


gtg 


acc 


tac 


cac 


agg 


ggc 


cag 


gag 


atg 


gcg agg 


tec 


gtg 


ggc 


gcg 


534 


Pro 


Val 


Thr 


Tyr 


His 


Arg 


Gly 


Gin 


Glu 


Met 


Ala Arg 


Ser 


val 


Gly 


Ala 




150 








155 










160 








165 




gtg 


gcc 


tac 


etc 


gag 


tgc 


teg 


get 


egg 


etc. 


cat gac 


aac 


gtc 


cac 


gcc 


582 


Val 


Ala 


Tyr 


Leu 


Glu 


Cys 


Ser 


Ala 


Arg 


Leu 


His Asp 


Asn 


Val 


His 


Ala 










170 








175 








180 






gtc 


ttc 


cag 


gag 


gcc 


gcc 


gag 


gtg 


gcc 


etc 


age age 


cgc 


ggt 


cgc 


aac 


630 


Val 


Phe 


Gin 


Glu 
185 


Ala 


Ala 


Glu 


Val 


Ala 
190 


Leu 


Ser Ser 


Arg 


Gly 
195 


Arg 


Asn 




ttc 


tgg 


egg 


egg 


att 


acc 


cag 


ggc 


ttt 


tgc 


gtg gtg 


acc 


tgageggetc 


679 


Phe 


Trp 


Arg 


Arg 


lie 


Thr 


Gin 


Gly 


Phe 


Cys 


Val Val 


Thr 











200. 205 _ . . 21Q 

ggggcgtccc agegacgegg gaaggggcag ggcgctgacc tgetgetgag ctggctgggc 



tggacccggt ccctaggctg tgaccgccga actccactgc aacagaeggg cgccaccaaa 799 



gccaggccct gaggcctggg agtcctggac tgagaaaggg ggttcctggg cccacctgct 859 



ctgtgtaggg ctcgtcctgc ggtgcccgag aatcactcgc taacccctat gcccggtccc 919 



ggaccgacat cctggagccg cctgtgcagc ctgatgcccc ctcgtggctg ctcccagggc 979 



tgcacctgcc aggacctaat gttcttaggt ccctctggcc agaacccaca cccggcccct 1039 



tcccacctgt catactggta actgtaacaa gaaaaacgac atcactt 1086 



<210> 344 
<211> 210 
<212> PRT 

<213> Homo sapiens 



<400> 344 



Met 


Thr 


Ala 


Ala 


Gin Ala Ala Gly Glu Glu Ala Pro 


Pro 


Gly 


Val 


Arg 


1 








5 10 






15 




Ser 


Val 


Lys 


Val 


Val Leu val Gly Asp Gly Gly Cys 


Gly 


Lys 


Thr 


Ser 






20 


25 




30 






Leu 


Leu 


Met 


Val 


Phe Ala Asp Gly Ala Phe Pro Glu 


Ser 


Tyr 


Thr 


Pro 






35 




40 


45 








Thr' 


Val 


Phe 


Glu 


Arg Tyr Met Val Asn Leu Gin Val 


Lys 


Gly 


Lys 


Pro 




50 






55 " 60 










Val 


His 


Leu 


His 


lie Trp Asp Thr Ala Gly Gin Asp 


Asp 


Tyr 


Asp 


Arg 


65 








70 75 








80 


Leu 


Arg 


Pro 


Leu 


Phe Tyr Pro Asp Ala Ser Val Leu 


Leu 


Leu 


Cys 


Phe 








85 90 






95 




Asp 


Val 


Thr 


Ser 


Pro Asn Ser Phe Asp Asn lie Phe 


Asn 


Arg 


Trp 


Tyr 






100 


105 




110 






Pro 


Glu 


Val 


Asn 


His Phe Cys Lys Lys Val Pro lie 


He 


Val 


Val 


Gly 






115 




120 


125 








Cys 


Lys 


Thr 


Asp 


Leu Arg Lys Asp Lys Ser Leu Val 


Asn 


Lys 


Leu 


Arg 


13 0 






135 140 










Arg 


Asn 


Gly 


Leu 


Glu Pro Val Thr Tyr His Arg Gly 


Gin 


Glu 


Met 


Ala 


145 






150 155 








160 


Arg 


Ser 


Val 


Gly 


Ala Val Ala Tyr Leu Glu Cys Ser 


Ala 


Arg 


Leu 


His 








165 170 






175 
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Asp Asn Val His Ala Val Phe Gin Glu Ala Ala Glu Val Ala Leu Ser 

180 185 190 

Ser Arg Gly Arg Asn Phe Trp Arg Arg lie Thr Gin Gly Phe Cys Val 
195 200 205 

Val Thr 
210 

<210> 345 

<211> 1166 

<212> DNA 

<213> Mus musculus 

<220> 

<221> CDS 

<222> (85) . . (717) 

<400> 345 

gccgccgggc cagccccgcg" cccgccgcag ccagcccgcc gcgtaccgcc tgctgctccg 60 



cgcaccgccg tccgccagcc aggg atg aac gcg tec cag gtt gcg gga gaa 111 

Met Asn Ala Ser Gin Val Ala Gly Glu 
1 5 



gag 


gcg 


ccg 


cag 


aac 


ctctcj 


cac 


teg gtc 


aag gtg gtc 


ctcr 


rrfccj orcrc crac 

3 L 3 33 u 3 au 


159 


Glu Ala 


Pro 


Gin 


Ser 


Gly 


His 


Ser Val 


Lys Val Val 


Leu 


Val Gly Asp 




10 










15 






20 




25 




ggg 


ggc 


tgc 


ggg 


aag 


acg 


tea 


ctg atg 


atg gtc ttc 


QCC 


aaa crcrcr crcc 

^ 333 3 


207 


Gly Gly 


Cys 


Gly 


Lys 


Thr 


Ser 


Leu Met 


Met Val Phe 


Ala 


Lys Gly Ala 












30 








35 




40 




ttc 


cca 


gag 


age 


tac 


aqrt 


ccc 


aca gtg 


ttt gag cgc 


tat 


aat crcc act 


255 


Phe 


Pro 


Glu 


Ser 


Tyr 


Ser 


Pro 


Thr Val 


Phe Glu Arg 


Tyr 


Asn Ala Thr 










45 








50 






55 




ctg cag 


atg aag 


crcrt 


aaa 


cct 


gtg cac 


etc caa ate 


tgg 


gac aca gee 


J U J 


Leu Gin 


Met 


Lys 


Gly 


Lys 


Pro 


Val His 


Leu Gin lie 


Trp 


Asp Thr Ala 








60 










65 




70 




ggg 


caa 


gat 


gac 


tat 


gac 


cgc 


etc egg 


ccc ttg ttc 


tat 


cct gat gee 


351 


Gly Gin 


Asp Asp 


Tyr 


Asp 


Arg 


Leu Arg 


Pro Leu Phe 


Tyr 


Pro Asp Ala 






75 










80 




85 








aat 


gtc 


ttg 


etc 


etc 


tgc 


ttc 


gat gtg 


ace aat cca 


aac 


age ttt gac 


399 


Asn 


Val 


Leu 


Leu 


Leu 


Cys 


Phe Asp Val 


Thr Asn Pro 


Asn 


Ser Phe Asp 




90 










95 






100 




105 




aac 


gtc 


tec 


aac 


egg 


tgg 


tac 


cca gag 


gtg aca, cat 


ttc 


tgc aag gga 


447 


Asn 


Val 


Ser 


Asn 


Arg 


Trp 


Tyr 


Pro Glu 


Val Thr His 


Phe 


Cys Lys Gly 












110 








115 




120 




gtg 


ccc 


ate 


att 


gtt 


gtg 


ggc 


tgc aag 


ata gac ctg 


cgt 


aag gac aag 


495 


val 


Pro 


lie 


He 


Val 


Val 


Gly Cys Lys 


lie Asp Leu 


Arg 


Lys Asp Lys 










125 








130 






135 




gtg 


ctg 


gtg aac 


aac 


ctg 


egg 


aag aaa 


aga ctg gag 


ccc 


gtg ace tac 


543 


Val 


Leu 


Val 


Asn 


Asn 


Leu 


Arg Lys Lys 


Arg Leu Glu 


Pro 


Val Thr Tyr 








140 










145 




150 




cac 


agg 


ggc 


cac 


gat 


atg 


gca 


agg tct 


gtg gga gcg 


gtg 


gec tat ctt 


591 


His 


Arg 


Gly His 


Asp 


Met 


Ala Arg Seri/Val Gly Ala 


Val 


Ala Tyr Leu 






155 










160 




165 








gag 


tgt 


tea 


get 


egg 


etc 


cat 


gac aac 


gtg gaa gca 


gtc 


ttc cag gaa 


639 


Glu 


Cys 


Ser 


Ala 


Arg 


Leu 


His Asp Asn Val Glu Ala 


Val 


Phe Gin Glu 




170 










175 






180 




185 




gca 


gca 


gaa 


gtg 


get 


etc 


age 


agt cgc 


aga cat aac 


ttt 


tgg egg egg 


687 


Ala 


Ala 


Glu 


Val 


Ala 


Leu 


Ser 


Ser Arg Arg His Asn 


Phe 


Trp Arg Arg 












190 








195 




200 




att 


act 


cag 


aat 


tgt 


tgc 


ttg 


gee ace 


tgactggctt ggaacccacc 


734 


lie 


Thr 


Gin Asn 


Cys 


Cys 


Leu 


Ala Thr 
















205 








210 











ttgccaaccg gtttactccg ctgcagaaag acccagaagc agaacttgta ctctgttgac 794 



tgggatggac ctaatcccta ggctgagctg gtagaaccac acatccatcc acagatgggc 



854 
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tccaatgagg cctggcccct ggaaccgaat gaaacccggt aatgatagga aagaagtggt 914 

cccaagaacc cttaagctct ggaaaccaat taatagaaca tcctgtgccc aaatcctgaa 974 

cctgcgcctg aacggggtct gtgcggtctg tttggcgcca accttgtggc taattctaat 1034 

tgaagttata tctacaggac ctaagggttc ccaagaacca acttgcccgg gctaataacc 1094 

tgcgccgtgg ctcaacaacc ttaatttaat cccattgcta agtgtaacaa ctaaaatggc 1154 

aagcacttgc ta 1166 



<210> 346 
<211> 210 
<212> PRT 

<213> Mus musculus 
<400> 346 

Met Asn Ala Ser Gin Val Ala Gly Glu Glu Ala Pro Gin Ser Gly His 

15 10 15 

Ser Val Lys Val Val Leu Val Gly Asp Gly Gly Cys Gly Lys Thr Ser 

20 25 " 30 

Leu Met Met Val Phe Ala Lys Gly Ala Phe Pro Glu Ser Tyr Ser Pro 

35 40 45 

Thr Val Phe Glu Arg Tyr Asn Ala Thr Leu Gin Met Lys Gly Lys Pro 

50 55 60 

Val His Leu Gin lie Trp Asp Thr Ala Gly Gin Asp Asp Tyr Asp Arg 
65 70 75 80 

Leu Arg Pro Leu Phe Tyr Pro Asp Ala Asn Val Leu Leu Leu Cys Phe 

85 90 95 

Asp Val Thr Asn Pro Asn Ser Phe Asp Asn Val Ser Asn Arg Trp Tyr 

100 105 110 

Pro Glu Val Thr His Phe Cys Lys Gly Val Pro lie lie Val Val Gly 

115 120 125 

Cys Lys He Asp Leu Arg Lys Asp Lys Val Leu Val Asn Asn Leu Arg 

130 135 '140 

Lys Lys Arg Leu- Glu Pro Val Thr Tyr His Arg Gly His Asp Met Ala 
145 150 155 160 

Arg Ser Val Gly Ala Val Ala Tyr Leu Glu Cys Ser Ala Arg Leu His 

165 170 175 

Asp Asn Val Glu Ala Val Phe Gin Glu Ala Ala Glu Val Ala Leu Ser 

180 185 190 

Ser Arg Arg His Asn Phe Trp Arg Arg He Thr Gin Asn Cys Cys Leu 
195 200 205 

Ala Thr 
210 

<210> 347 

<211> 735 

<212> DNA 

<213> Sus scrofa 

<220> 

<221> CDS 

<222> (1> . - (735) 

<400> 347 

atg aag gag aga aga gcc age cag aaa tta tec agt aaa tct ate atg 48 

Met Lys Glu Arg Arg Ala Ser Gin Lys Leu Ser Ser Lys Ser He Met 

1 " 5 10 15 
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gat cct aat cag aac gtg aaa tgc aag ata gta gtg gtg gga gat agt 96 
Asp Pro Asn Gin Asn Val Lys Cys Lys He Val Val Val Gly Asp Ser 

20 25 30 

cag tgt ggg aga acc gcg ctg ctg cac gtc ttc gcc aaa gac tgc ttc 144 
Gin Cys Gly Arg Thr Ala Leu lieu His Val Phe Ala Lys Asp Cys Phe 

35 40 45 

ccc gag aat tac gtc cct aca gtg ttt gag aat tac acg gcc agt ttc 192 
Pro Glu Asn Tyr Val Pro Thr Val Phe Glu Asn Tyr Thr Ala Ser Phe 

50 55 60 

gaa ate gac aca caa aga ata gaa ttg age ctg tgg gac act teg ggt 240 
Glu He Asp Thr Gin Arg He Glu Leu Ser Leu Trp Asp Thr Ser Gly 
65 70 75 80 

tct cct tac tat gac aac gtc egg ccc etc tec tac cea gac tea gac 288 
Ser Pro Tyr Tyr Asp Asn Val Arg Pro Leu Ser Tyr Pro Asp Ser Asp 

85 90 95 

gcc gtg ctg att tgc ttt gac ate agt aga cea gag act ctg gac agt 336 
Ala Val Leu He Cys Phe Asp He Ser Arg Pro Glu Thr Leu Asp Ser 

100' " " 105 lib 

gtc ctg aaa aag tgg aaa ggt gaa ate cag gag ttt tgt ccc aac acc 384 
Val Leu Lys Lys Trp Lys Gly Glu He Gin Glu Phe Cys Pro Asn Thr 

115 120 125 

aaa atg etc ttg gtc ggc tgc aaa tct gat ctt egg aca gat gtc agt 432 
Lys Met Leu Leu Val Gly Cys Lys Ser Asp Leu Arg Thr Asp Val Ser 

130 135 " " 140 

aca tta gta gag etc tea aac cac agg cag aca cea gtt tec tac gac 480 
Thr Leu Val Glu Leu Ser Asn His Arg Gin Thr Pro Val Ser Tyr Asp 
145 150 155 160 

cag ggg gca aat atg gcc aaa cag att gga gca gcc act tac ate gaa 528 
Gin Gly Ala Asn Met Ala Lys Gin He Gly Ala Ala Thr Tyr He Glu 

165 - 170 175 

tgc tea get tta cag tea gaa aat age gtc aga gac att ttt cat gtt 576 
Cys Ser Ala Leu Gin Ser Glu Asn Ser Val Arg Asp He Phe His Val 

180 185 190 

gcc acc ttg gca tgt gta aat aag aca aat aaa aac gtt aag egg aac 624 
Ala Thr Leu Ala Cys Val Asn Lys Thr Asn Lys Asn Val Lys Arg Asn 

195 200 205 

aaa teg cag agg gca aca aag egg att tea cac atg ccc age aga ccg 672 
Lys Ser Gin Arg Ala Thr Lys Arg He Ser His Met Pro Ser Arg Pro 

210 215 220 

gaa etc teg gca gtg get acg gac tta cga aag gac aaa gcc aag age 720 
Glu Leu Ser Ala Val Ala Thr Asp Leu Arg Lys Asp Lys Ala Lys Ser 
225 230 235 240 

tgc act gtg atg tga 735 
Cys Thr Val Met 



<210> 348 
<211> 244 
<212> PRT 
<213> Sus scrofa 



<400> 348 



Met 
1 


Lys 


Glu 


Arg 


Arg Ala Ser Gin 
5 


Asp 


Pro 


Asn 


Gin 


Asn Val Lys Cys 








20 




Gin 


Cys 


Gly 


Arg 


Thr Ala Leu Leu 






35 




40 


Pro 


Glu 


Asn 


Tyr 


Val Pro Thr Val 




50 




55 


Glu 


He 


Asp 


Thr 


Gin Arg He Glu 


65 








70 


Ser 


Pro 


Tyr 


Tyr 


Asp Asn Val Arg 










85 


Ala 


Val 


Leu 


He 


Cys Phe Asp He 








100 




Val 


Leu 


Lys 


Lys 


Trp Lys Gly Glu 



Lys Leu Ser Ser Lys Ser He Met 

10 15 
Lys He Val Val Val Gly Asp Ser 
25 30 
His Val Phe Ala Lys Asp Cys Phe 
45 

Phe Glu Asn Tyr Thr Ala Ser Phe 
60 

Leu Ser Leu Trp Asp Thr Ser Gly 

75 80 
Pro Leu Ser Tyr Pro Asp Ser Asp 

90 95 
Ser Arg Pro Glu Thr Leu Asp Ser 
105 HO 
He Gin Glu Phe Cys Pro Asn Thr 
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115 120 125 

Lys Met Leu Leu Val Gly Cys Lys Ser Asp Leu Arg Thr Asp Val Ser 

130 135 140 

Thr Leu Val Glu Leu Ser Asn His Arg Gin Thr Pro Val Ser Tyr Asp 
145 150 155 160 

Gin Gly Ala Asn Met Ala Lys Gin lie Gly Ala Ala Thr Tyr lie Glu 

165 170 175 

Cys Ser Ala Leu Gin Ser Glu Asn Ser Val Arg Asp lie Phe His Val 

180 185 190 

Ala Thr Leu Ala Cys Val Asn Lys Thr Asn Lys Asn Val Lys Arg Asn 

195 " 200 205 

Lys Ser Gin Arg Ala Thr Lys Arg He Ser His Met Pro Ser Arg Pro 

210 ~ 215 220 

Glu Leu Ser Ala Val Ala Thr Asp Leu Arg Lys Asp Lys Ala Lys Ser 
225 230 235 240 

Cys Thr Val Met 

<210> 349 

<211> 1017 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (30) . . (665) 

<400> 349 

gccgccgcca gtgctgcggg ctccgggca atg gat gcc ccc ggg gcc ctg gcc 53 

Met Asp Ala Pro Gly Ala Leu Ala 
1 5 

cag acc gcc gcc ccc ggt ccg ggc agg aag gag ctg aag ate gtg ate 101 
Gin Thr Ala Ala Pro Gly Pro Gly Arg Lys Glu Leu Lys He Val He 

10 15 20 

gtg ggc gac ggc ggc tgc ggc aag acc teg ctg etc atg gtg tac age 149 
Val Gly Asp Gly Gly Cys Gly Lys Thr Ser Leu Leu Met Val Tyr Ser 
25 30 35 40 

cag ggc tec ttc ccc gag cac tac gcc cca teg gtg ttc gag aag tac 197 
Gin Gly Ser Phe Pro Glu His Tyr Ala Pro Ser Val Phe Glu Lys Tyr 

45 50 55 

acg gcc age gtg acc gtt ggc age aag gag gtg acc ctg aac etc tac 245 
Thr Ala Ser Val Thr Val Gly Ser Lys Glu Val Thr . Leu Asn Leu Tyr 

60 65 70 

gac acg gcc ggg caa gaa gac tat gac egg ctg egg ccc ctg tec tac 293 
Asp Thr Ala Gly Gin Glu Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr 

75 80 85 

cag aac acc cac etc gtg etc ate tgc tat gac gtc atg aat ccc acc 341 
Gin Asn Thr His Leu Val Leu He Cys Tyr Asp Val Met Asn Pro Thr 

90 95 100 

age tac gac aac gtc etc ate aag tgg ttc cct gag gtc acg cat ttc 389 
Ser Tyr Asp Asn Val Leu He Lys Trp Phe Pro Glu Val Thr His Phe 
105 ~ HO H5 120 

tgc cgc ggg ate ccc atg gtg etc ate ggc tgc aag aca gac ctg agg 437 
Cys Arg Gly He Pro Met Val Leu He Gly Cys Lys Thr Asp Leu Arg 

125 130 135 

aag gac aag gag cag ctg egg aag etc egg gcc gcc cag ctg gag ccc 485 
Lys Asp Lys Glu Gin Leu Arg Lys Leu Arg Ala Ala Gin Leu Glu Pro 

140 145 150 

ate acc tac atg cag ggc ctg age gcc tgc gaa cag ate cga get get 533 
He Thr Tyr Met Gin Gly Leu Ser Ala Cys Glu Gin He Arg Ala Ala 

155 160 165 

etc tac ctg gaa tgt tec gcc aag ttt egg gag aat gtg gag gac gtc 581 
Leu Tyr Leu Glu Cys Ser Ala Lys Phe Arg Glu Asn Val Glu Asp Val 

170 175 180 

ttc egg gag gcc gcc aag gtg get etc age get ctg aag aag gcg caa 629 
Phe Arg Glu Ala Ala Lys Val Ala Leu Ser Ala Leu Lys Lys Ala Gin 
185 190 195 200 

egg cag aag aag cgc egg etc tgc ctg ctg etc tgacccaggg cagacagacc 682 
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Arg Gin Lys Lys Arg Arg Leu Cys Leu Leu Leu 

205 210 

tcacgacagc actgacaggg cccgggggcc caggtgccga ttgcaccagg gaggctgccc 742 

catcccgacc ctccagctca tggtgtctgg ggcctgcggc tagactcttg gaacattctg 802 

gaactctctc ctttcctggc tggggctctg accacaaact cccctccagg ctgcccctgg 8 62 

gacatggtgg tgatgtgggt gcaggagcca gtgtctgttg tcgggactcg caagtgccct 922 



catcacagcc acccccacca cgagtgtctc cccagtgcag. actcaagtta tgcttgaaat 982 
gaaaaagtct *~atctggtagt gggtaaaaaa " aaaaa 10i7 



<210> 350 
<211> 211 
<212> PRT 
<213> Homo sapiens 

<400> 350 



Met 


Asp 


Ala 


Pro 


Gly Ala Leu 


Ala 


Gin 


Thr Ala Ala Pro Gly Pro Gly 


1 








5 






10 15 


Arg 


Lys 


Glu 


Leu 


Lys He Val 


He 


Val 


Gly Asp Gly Gly Cys Gly Lys 








20 






25 


30 


Thr 


Ser 


Leu 


Leu 


Met Val Tyr 


Ser 


Gin 


Gly Ser Phe Pro Glu His Tyr 






35 






40 




45 


Ala 


Pro 


Ser 


Val 


Phe Glu Lys 


Tyr 


Thr 


Ala Ser Val Thr Val Gly Ser 




50 






55 






60 


Lys 


Glu 


Val 


Thr 


Leu Asn Leu 


Tyr 


Asp 


Thr Ala Gly Gin Glu Asp Tyr 


65 








70 






75 80 


Asp 


Arg 


Leu 


Arg 


Pro Leu Ser 


Tyr 


Gin 


Asn Thr His Leu Val Leu He 










85 






90 95 


Cys 


Tyr 


Asp 


Val 


Met Asn Pro 


Thr 


Ser 


Tyr Asp Asn Val Leu He Lys 








100 






105 


110 


Trp 


Phe 


Pro 


Glu 


Val Thr His 


Phe 


Cys 


Arg Gly He Pro Met Val Leu 






115 






120 




125 


He 


Gly 


Cys 


Lys 


Thr Asp Leu 


Arg 


Lys 


Asp Lys Glu Gin Leu Arg Lys 




13 0 






135 






140 


Leu 


Arg 


Ala 


Ala 


Gin Leu Glu 


Pro 


He 


Thr Tyr Met Gin Gly Leu Ser 


145 








150 






155 160 


Ala 


Cys 


Glu 


Gin 


He Arg Ala 


Ala 


Leu 


Tyr Leu Glu Cys Ser Ala Lys 










165 






170 175 


Phe 


Arg 


Glu 


Asn 


Val Glu Asp 


Val 


Phe 


Arg Glu Ala Ala Lys Val Ala 








180 






185 


190 


Leu 


Ser 


Ala 


Leu 


Lys Lys Ala 


Gin 


Arg 


Gin Lys Lys Arg Arg Leu Cys 






195 






200 




205 


Leu 


Leu 


Leu 














210 















<210> 351 
<211> 684 
<212> DNA 
<213> Mus musculus 

<220> 

<221> CDS 

<222> (1) . . (684) 

<400> 351 

atg gag ggg cag agt ggc cgc tgc aag ate gta gtg gtg ggg gac gcg 48 
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Met Glu Gly Gin Ser Gly Arg Cys Lys He Val Val Val Gly Asp Ala 

1 5 10 15 

gag tgc ggc aag aca gcg ttg ctg cag gtg ttc gcc aag gac gcc tac 96 

Glu Cys Gly Lys Thr Ala Leu Leu Gin Val Phe Ala Lys Asp Ala Tyr 

20 25 30 

ccc ggg agt tat gtc ccc acg gtg ttt gag aac tac act gcc age ttc 144 
Pro Gly Ser Tyr Val Pro Thr Val Phe Glu Asn .Tyr Thr Ala Ser Phe 

35 40 45 

gag ate gac aag cgc cgc att gag etc aac. atg tgg gat act tea ggt 192 
Glu He Asp Lys Arg Arg He Glu Leu Asn Met Trp Asp Thr Ser Gly 

50 55 60 

tec tct tac tat gac aat gtc egg cct ctg gcc tac ccg gat tct gat 240 
Ser Ser Tyr Tyr Asp Asn Val Arg Pro Leu Ala Tyr Pro Asp Ser Asp 
65 " 70 75 80 

get gtg ctt ate tgc ttt gac att age egg cca gaa aca ctg gac agt 288 
Ala Val Leu He Cys Phe Asp He Ser Arg Pro Glu Thr Leu Asp Ser 

85 ?0 95 

gtc "etc aag aag tgg" caa gga gag act cag gag ttt tgc ccc aat gcc 336 
Val Leu Lys Lys Trp Gin Gly Glu Thr Gin Glu Phe Cys Pro Asn Ala 

100 105 HO 

aag gtg gtg ctg gtt ggc tgt aag ctg gac atg egg act gac ctg gcc 384 
Lys Val Val Leu Val Gly Cys Lys Leu Asp Met Arg Thr Asp Leu Ala 

115 120 125 

aca ctg agg gag eta tec aag cag aga etc ate cct gtc aca cat gag 432 
Thr Leu Arg Glu Leu Ser Lys Gin Arg Leu He Pro Val Thr His Glu 

130 135 140 

cag ggt act gtg ctg gcc aag caa gtg gga get gtg tec tac gtt gaa 
Gin Gly Thr Val Leu Ala Lys Gin Val Gly Ala Val Ser Tyr Val Glu 
145 ~ 150 155 160 

tgt tec tec cga tct tct gag cgc agt gtc egg gat gtc ttc cat gtg 
Cys Ser Ser Arg Ser Ser Glu Arg Ser Val Arg Asp Val Phe His Val 

165 170 175 

gcc aca gtg get tct etc ggc cgt ggc cat agg cag eta cgt cgt act 576 
Ala Thr Val Ala Ser Leu Gly Arg Gly His Arg Gin Leu Arg Arg Thr 

180 185 190 

gac tct cgc egg gga ctg cag cga tec act caa ctg teg gga egg cca 624 
Asp Ser Arg Arg Gly Leu Gin Arg Ser Thr Gin Leu Ser Gly Arg Pro 

195 200 205 

gac egg gga aat gag ggc gag atg cat aag gat cga gcc aag age tgt 672 
Asp Arg Gly Asn Glu Gly Glu Met His Lys Asp Arg Ala Lys Ser Cys 

210 215 220 

aac etc atg tga 684 
Asn Leu Met 
225 



480 



528 



<210> 352 
<211> 227 
<212> PRT 

<213> Mus musculus 



<400> 352 



Met 


Glu 


Gly 


Gin 


Ser Gly Arg Cys 


1 








5 




Glu 


Cys 


Gly 


Lys 


Thr Ala Leu 


Leu 








20 






Pro 


Gly 


Ser 


Tyr 


Val Pro Thr 


Val 




35 






40 


Glu 


He 


Asp 


Lys 


Arg Arg He 


Glu 




50 






55 




Ser 


Ser 


Tyr 


Tyr 


Asp Asn Val 


Arg 


65 








70 




Ala 


Val 


Leu 


He 


Cys Phe Asp 


He 










85 




Val 


Leu 


Lys 


Lys 


Trp Gin Gly Glu 








100 






Lys 


val 


Val 


Leu 


Val Gly Cys 


Lys 




115 






120 



Lys He Val Val Val Gly Asp Ala 

10 15 
Gin val Phe Ala Lys Asp Ala Tyr 
25 30 
Phe Glu Asn Tyr Thr Ala Ser Phe 
45 

Leu Asn Met Trp Asp Thr Ser Gly 
60 

Pro Leu Ala Tyr Pro Asp Ser Asp 

75 80 
Ser Arg Pro Glu Thr Leu Asp Ser 

90 95 
Thr Gin Glu Phe Cys Pro Asn Ala 
105 HO 
Leu Asp Met Arg Thr Asp Leu Ala 
125 



WO 2005/014828 



Thr 


Leu 
130 


Arg 


Glu 


Leu 


Ser 


Gin 


Gly 


Thr 


val 


Leu 


Ala 


145 










150 


Cys 


Ser 


Ser 


Arg 


Ser 


Ser 








165 




Ala 


Thr 


Val 


Ala 
180 


Ser 


Leu 


Asp 


Ser 


Arg 
195 


Arg 


Gly 


Leu 


Asp 


Arg 
210 


Gly 


Asn 


Glu 


Gly 


Asn 


Leu 


Met 









225 



<210> 353 
<211> 576 
"<212> DNA. 

<213> Brachydanio rerio 

<220> 

<221> CDS 

<222> (1) . . (576) 



<400> 353 



atg 


cag 


acg 


att 


aag 


tgt 


Met 


Gin 


Thr 


He 


Lys 


Cys 


1 








5 




aca 


tgc 


ctt 


ctt 


att 


tct 


Thr 


Cys 


Leu 


Leu 


1-le 


Ser 








20 






att 


ccc 


aca 


gtg 


ttt 


gac 


He 


Pro 


Thr 


Val 


Phe 


Asp 






35 








cgc 


act 


gtc 


age 


etc 


aac 


Arg 


Thr 


val 


Ser 


Leu 


Asn 




50 










gac 


cgt 


ctg 


cgc 


acg 


ctt 


Asp 


Arg 


Leu 


Arg 


Thr 


Leu 


65 










70 


tgc 


ttt 


tec 


ate 


gga 


age 


Cys 


Phe 


Ser 


lie 


Gly 


Ser 










85 




tgg 


cac 


ccg 


gag 


gtg 


tct 


Trp 


His 


Pro 


Glu 


Val 


Ser 






100 






gtt 


ggc 


acc 


aag 


aag 


gat 


Val 


Gly 


Thr 


Lys 


Lys 


Asp 






115 








«g 


aag 


gag 


caa 


ggg 


ctt 


Leu 


Lys 


Glu 


Gin 


Gly 


Leu 




13 0 










ctg 


tgc 


aag 


cag 


ate 


aat 


Leu 


Cys 


Lys 


Gin 


He 


Asn 


145 










150 


cgc 


cag 


gaa 


ggc 


gtg 


egg 


Arg 


Gin 


Glu 


Gly 


Val 


Arg 










165 




etc 


tac 


ccc 


atg 


acc 


aaa 


Leu 


Tyr 


Pro 


Met 


Thr 


Lys 



180 



tag 



<210> 354 
<211> 191 
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Lys 


Gin 


Arg 


Leu 


He 


Pro 


135 










140 


Lys 


Gin 


Val 


Gly 


Ala 


Val 










155 




Glu 


Arg 


Ser 


Val 


Arg 


Asp 






170 






Gly 


Arg 


Gly 


His 


Arg 


.Gin 






185 








Gin 


Arg 


Ser 


Thr 


Gin 


Leu 




200 










Glu 


Met 


His 


Lys 


Asp 


Arg 


215 

... 








•• 


220 


gtg 


gtg 


gtg 


gga 


gac 


ggt 


Val 


Val 


Val 


Gly 


Asp 


Gly 








10 






tat 


acg 


aca 


aat 


gee 


ttt 


Tyr 


Thr 


Thr 


Asn 


Ala 


Phe 




25 








aac 


tac 


agt 


get 


cag 


atg 


Asn 


Tyr 


Ser 


Ala 


Gin 


Met 




40 










etc 


tgg 


gac 


acg 


gca 


ggg 


Leu 


Trp 


Asp 


Thr 


Ala 


Gly 


55 










60 


tec 


tac 


cca 


caa 


act 


aat 


Ser 


Tyr 


Pro 


Gin 


Thr 


Asn 










75 




cct 


tea 


tea 


tta 


get 


aat 


Pro 


Ser 


Ser 


Leu 


Ala 


Asn 








90 






cac 


cac 


tgt 


ccc 


aac 


gtg 


His 


His 


Cys 


Pro 


Asn 


val 






105 








ctg 


cgt 


tea 


gat 


aca 


gag 


Leu 


Arg 


Ser 


Asp 


Thr 


Glu 




120 










gca 


ccc 


tct 


act 


caa 


cag 


Ala 


Pro 


Ser 


Thr 


Gin 


Gin 


135 










140 


get 


gtg 


agg 


tat 


ctg 


gag 


Ala 


Val 


Arg 


Tyr 


Leu 


Glu 










155 




gat 


gtg 


ttt 


gta 


gat 


get 


Asp 


val 


Phe 


Val 


Asp 


Ala 








170 






aag 


aat 


acc 


aag 


aag 


tgt 


Lys 


Asn 


Thr 

1 QC 


Lys 


Lys 


Cys 



185 
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Val Thr 


His 


Glu 




Ser Tyr 


Val 


Glu 








160 




Val Phe 


His 


Val 






175 






Leu Arg 


Arg 


Thr 




190 








Ser Gly 


Arg 


Pro 




205 








Ala Lys 

- 


Ser 

- 


Cys 


- 


gca gta 


ggc 


aag 


48 


Ala Val 


Gly 


Lys 






15 






cca gag 


gag 


tac 


96 


Pro Glu 


Glu 


Tyr 




30 








agt *.gta 


gat 


ggg 


144 


Ser Val 


Asp 


Gly 




45 








cag gag 


gag 


tat 


192 


Gin Glu 


Glu 


Tyr 




gtt ttc 


ate 


att 


240 


Val Phe 


He 


He 








80 




gtt cgc 


cac 


aag 


288 


Val Arg 


His 


Lys 






95 






cca att 


ctt 


eta 


336 


Pro He 


Leu 


Leu 




110 








aca att 


aag 


aag 


384 


Thr He 


Lys 


Lys 




125 








cag ggt 


ggc 


acc 


432 


Gin Gly 


Gly 


Thr 




tgc teg 


gcg 


etc 


480* 


Cys Ser 


Ala 


Leu 








160 




gtt cgt 


get 


gtg 


528 


Val Arg 


Ala 


Val 






175 






gtt etc 


tta 




573 


Val Leu 


Leu 






190 









576 
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PCT/EP2004/008136 



<212> PRT 

<213> Brachydanio rerio 
<400> 354 

Met Gin Thr He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 

1 5 10 15 

Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Ala .Phe Pro Glu Glu Tyr 

20 25 30 

He Pro Thr Val Phe Asp Asn Tyr Ser Ala Gin Met Ser Val Asp Gly 

35 40 45 

Arg Thr Val Ser Leu Asn Leu Trp Asp Thr Ala Gly Gin Glu Glu Tyr 

50 55 60 

Asp Arg Leu Arg Thr Leu Ser Tyr Pro Gin Thr Asn Val Phe He He 
65 70 75 80 

Cys Phe Ser He Gly Ser Pro Ser Ser Leu Ala Asn Val Arg His Lys 

85 90 95 

Trp His Pro Glu Val Ser His His Cys Pro Asn Val Pro He Leu Leu 
-- 100 105 no 

Val Gly Thr Lys Lys Asp Leu Arg Ser Asp Thr Glu Thr He Lys Lys 

115 120 125 

Leu Lys Glu Gin Gly Leu Ala Pro Ser Thr Gin Gin Gin Gly Gly Thr 

130 135 140 

Leu Cys Lys Gin He Asn Ala Val Arg Tyr Leu Glu Cys Ser Ala Leu 
145 150 155 160 

Arg Gin Glu Gly Val Arg Asp Val Phe Val Asp Ala Val Arg Ala Val 

165 ~ 170 175 

Leu Tyr Pro Met Thr Lys Lys Asn Thr Lys Lys Cys Val Leu Leu 
180 " 185 190 

<210> 355 " 
<211> 576 
<212> DNA 

<213> Xenopus laevis 

<220> 

<221> CDS 

<222> (1)..(576) 

<400> 355 

atg cag age att aag tgt gtg gtg gtg ggc gac gga gca gtg gga aag 48 
Met Gin Ser He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 
15 10 15 

acc tgt tta etc ate tgc ttc aca acc aac gec ttc cct aag gaa tac 96 
Thr Cys Leu Leu He Cys Phe Thr Thr Asn Ala Phe Pro Lys Glu Tyr 

20 25 30 

ate ccc aca gtg ttt gat aac tac age gec cag acg gca gtt gac ggg 144 
He Pro Thr Val Phe Asp Asn Tyr Ser Ala Gin Thr Ala Val Asp Gly 

35 40 45 

agg aca gtt age ttg aac ctg tgg gac act get ggc cag gaa gag tat 192 
Arg Thr Val Ser Leu Asn Leu Trp Asp Thr Ala Gly Gin Glu Glu Tyr 

50 55 60 

gac aga ctg cgt aca ctt tct tac cca cag acc aac gtc ttc ate att 240 
Asp Arg Leu Arg Thr Leu Ser Tyr Pro Gin Thr Asn Val Phe He He 
65 " ~ 70 75 80 

tgc ttt tec ate gec age cca acc tec tat gag aat gta aag cac aag 288 
Cys Phe Ser He Ala Ser Pro Thr Ser Tyr Glu Asn Val Lys His Lys 

85 90 95 

tgg tac cct gag gtg ggt cac cat tgt ccc aac gtg ccc att etc ttg 336 
Trp Tyr Pro Glu Val Gly His His Cys Pro Asn Val Pro He Leu Leu 

100 ~ 105 HO 

gtg ggc acc aag aag gat ctt agg aac aac gca gat gtc ata aag aag 384 
Val Gly Thr Lys Lys Asp Leu Arg Asn Asn Ala Asp Val He Lys Lys 

115 - 120 125 

ctg aaa gag cag aac caa atg ccc ate act aac cat cag ggt gga aat 432 
Leu Lys Glu Gin Asn Gin Met Pro He Thr Asn His Gin Gly Gly Asn 

130 135 140 

ctg gec aag cag ate cat gca gta aag tat atg gag tgc teg gcg ctg 
Leu Ala Lys Gin He His Ala Val Lys Tyr Met Glu Cys Ser Ala Leu 



480 
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145 150 155 160 

aat caa gat ggc ate aag gaa gtg ttt gcg gat get gtg cga get gtt 528 

Asn Gin Asp Gly He Lys Glu Val Phe Ala Asp Ala Val Arg Ala Val 

165 170 175 

etc aac ccg act ccc ate aag gac aaa aag age tgc ttc att ttg 573 
Leu Asn Pro Thr Pro He Lys Asp Lys Lys Ser Cys Phe He Leu 

180 185 190 

tga 576 



<210> 356 
<211> 191 
<212> PRT 

<213> Xenopus laevis 
<400> 356 

Met Gin Ser He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 

1 5 10 15 

Thr Cys Leu Leu He Cys Phe Thr Thr Asn Ala Phe Pro Lys Glu Tyr 

20 25 30 

He Pro Thr Val Phe Asp Asn Tyr Ser Ala Gin Thr Ala Val Asp Gly 

35 40 45 

Arg Thr Val Ser Leu Asn Leu Trp Asp Thr Ala Gly Gin Glu Glu Tyr 

50 55 60 

Asp Arg Leu Arg Thr Leu Ser Tyr Pro Gin Thr* Asn Val Phe He He 
65 70 75 80 

Cys Phe Ser He Ala Ser Pro Thr Ser Tyr Glu Asn Val Lys His Lys 

85 90 95 

Trp Tyr Pro Glu Val Gly His His Cys Pro Asn Val Pro He Leu Leu 

100 105 110 

Val Gly Thr Lys Lys Asp Leu Arg Asn Asn Ala Asp Val He Lys Lys 

115 120 125 

Leu Lys Glu Gin Asn Gin Met Pro He Thr Asn His Gin Gly Gly Asn 

130 135 140 

Leu Ala Lys Gin He His Ala Val Lys Tyr Met Glu Cys Ser Ala Leu 
145 150 155 160 

Asn Gin Asp Gly He Lys Glu Val Phe Ala Asp Ala Val Arg Ala Val 

165 170 175 

Leu Asn Pro Thr Pro lie Lys Asp Lys Lys Ser Cys Phe He Leu 
180 185 190 

<210> 357 
<211> 591 
<212> DNA 

<213> Xenopus tropicalis 

<220> 

<221> CDS 

<222> (1)..(591) 

<400> 357 

atg gee gcg ata cgc aag aag ctg gtg gtg gtc ggg gac gga get tgc 48 

Met Ala Ala He Arg Lys Lys Leu Val Val Val Gly Asp Gly Ala Cys 

1 5 10 15 

ggc aaa ace tgc etc etc ate gtc ttc age aag gac gag ttc ccc gag 96 

Gly Lys Thr Cys Leu Leu He Val Phe Ser Lys Asp Glu Phe' Pro Glu 

20 25 30 

gtg tac gtt ccc ace gtg ttc gag aac tac gtg gee gat ate gag gtg 144 
Val Tyr Val Pro Thr Val Phe Glu Asn Tyr Val Ala Asp He Glu Val 

35 40 45 

gac gtg aag cag gtg gag ctg gec ctg tgg gac acg gcg ggg cag gag 192 
Asp Val Lys Gin Val Glu Leu Ala Leu Trp Asp Thr Ala Gly Gin Glu 

50 55 60 

gac tac gac egg ctg egg cct etc tec tac ccc gac acg gac gtc ate 240 
Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp Thr Asp Val He 
65 70 75 80 
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etc 


atg 


tgc 


ttc 


tec 


gtg 


Leu 


Met 


Cys 


Phe 


Ser 


Val 








85 




gag 


aag 


tgg 


gtg 


ccc 


gag 


Glu 


Lys 


Tip 


Val 


Pro 


Glu 








100 






ate 


ctg 


gtg 


gec 


aac 


aag 


lie 


Leu 


Val 


Ala 


Asn 


Lys 






115 








aac 


gag 


ctg 


gec 


cgc 


atg 


Asn 


Glu 


Leu 


Ala 


Arg 


Met 




13 0 










agg 


gec 


atg 


gee 


gtg 


agg 


Arg 


Ala 


Met 


Ala 


Val 


Arg 


145 










150 


gec 


aag 


acc 


aag 


gaa 


ggg 


Ala 


Lys 


Thr 


Lys 


Glu 


Gly 










165 




gcg 


gcg 


ctg 


cag 


aag 


aaa 


Ala 


Ala 


Leu 


Gin 


Lys 


Lys 








180 






tgt 


aag 


etc 


etc 


tga 




Cys 


Lys 


Leu 


Leu 







195 



264/291 

gac age ccg gac teg ctg 
Asp Ser Pro Asp Ser Leu 
90 

gtg aag cac ttc tgc ccc 
Val Lys His Phe Cys Pro 
105 

aag gac ctg agg aac gac 
Lys Asp Leu Arg Asn Asp 
120 

aag cag gag ccg gtc cgc 
Lys Gin Glu Pro Val Arg 
135 140 
att aac gec ttc gag tac 
lie Asn Ala Phe Glu Tyr 
155 

gtc egg gag gtg ttc gaa 
Val Arg Glu Val Phe Glu 
170 

cac ggc egg age ggt gaa 
His Gly Arg Ser Gly Glu 
185 



PCT/EP2004/008136 



gag aac ate ccg 288 
Glu Asn lie Pro 
95 

age gtc ccc ate 336 
Ser Val Pro lie 
110 

gag cac gtc cgc 384 

Glu His Val Arg 

125 

acc gag gac ggc 432 
Thr Glu Asp Gly 

ctg gag tgc teg 480 
Leu Glu Cys Ser 
160 

acg gee act agg 528 
Thr Ala Thr Arg 

""175 

tgt atg age tgc 576 
Cys Met Ser Cys 
190 

591 



<210> 358 
<211> 196 
<212> PRT 

<213> Xenopus tropicalis 
<400> 358 



Met 


Ala 


Ala 


lie 


Arg 


Lys 


Lys 


Leu Val Val Val Gly Asp Gly Ala Cys 


1 








5 






10 15 


Gly 


Lys 


Thr 


Cys 


Leu 


Leu 


lie 


Val Phe Ser Lys Asp Glu Phe Pro Glu 




20 








25 30 


Val 


Tyr 


Val 


Pro 


Thr 


Val 


Phe 


Glu Asn Tyr Val Ala Asp lie Glu Val 




35 










40 45 


Asp 


Val 


Lys 


Gin 


Val 


Glu 


Leu 


Ala Leu Trp Asp Thr Ala Gly Gin Glu 


50 










55 


60 


Asp 


Tyr 


Asp 


Arg 


Leu 


Arg 


Pro 


Leu Ser Tyr Pro Asp Thr Asp Val lie 


65 










70 




75 80 


Leu 


Met 


Cys 


Phe 


Ser 


Val 


Asp 


Ser Pro Asp Ser Leu Glu Asn lie Pro 








85 






90 95 


Glu 


Lys 


Trp 


Val 


Pro 


Glu Val 


Lys His Phe Cys Pro Ser Val Pro lie 






100 








105 110 


lie 


Leu 


Val 
115 


Ala 


Asn 


Lys 


Lys 


Asp Leu Arg Asn Asp Glu His Val Arg 
120 125 


Asn 


Glu 
130 


Leu 


Ala 


Arg 


Met 


Lys 
135 


Gin Glu Pro Val Arg Thr Glu Asp Gly 
140 


Arg 


Ala 


Met 


Ala 


Val 


Arg 


lie 


Asn Ala Phe Glu Tyr Leu Glu Cys Ser 


145 










150 




155 160 


Ala 


Lys 


Thr 


Lys 


Glu 


Gly Val 


Arg Glu Val Phe Glu Thr Ala Thr Arg 








165 






170 175 


Ala 


Ala 


Leu 


Gin 
180 


Lys 


Lys 


His 


Gly Arg Ser Gly Glu Cys Met Ser Cys 
185 190 


Cys 


Lys 


Leu 
195 


Leu 











<210> 359 
<211> 735 
<212> DNA 

<213> Xenopus laevis 



<220> 

<221> CDS 

<222> (1)..(735) 
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<400> 359 




















atg 


aag 


gaa 


aga aga acg 


agt 


cag 


aaa 


ctg 


gec 


age aaa tea atg 


atg 


48 


Met 


Lys 


Glu 


Arg Arg Thr Ser Gin Lys 


Leu 


Ala 


Ser Lys Ser Met 


Met 




1 






5 








10 




15 






gat 


ccc 


aac 


cag aat gtg 


aaa 


tgt 


aaa 


att 


gtg 


gtg gtg ggg gac 


age . 


96 


Asp 


Pro 


Asn 


Gin Asn Val Lys 


Cys 


Lys 


lie 


val 


Val Val Gly Asp 


Ser 










20 






25 






30 






cag 


tgt 


ggc 


aaa act gec 


ctg 


etc 


cat 


gtc 


ttt 


gee aag gac tec 


ttc 


144 


Gin 


Cys 


Gly 


Lys Thr Ala 


Leu 


Leu 


His 


Val. Phe 


Ala Lys Asp Ser. 


Phe 








35 






40 








45 






cca 


gag 


aac 


tat gtc ccc 


acc 


gta 


ttt 


gag 


aat 


tac acg gec agt 


ttt 


192 


Pro 


Glu 


Asn 


Tyr Val Pro Thr Val 


Phe 


Glu 


Asn 


Tyr Thr Ala Ser 


Phe 






50 






55 










60 






gaa 


att 


gac 


acg cag agg 


ata 


gaa 


ctg 


agt 


etc 


tgg gat aca tct 


ggt 


240 


Glu 


He 


Asp 


Thr Gin Arg 


He 


Glu 


Leu 


Ser 


Leu 


Trp Asp Thr Ser 


Gly 




65 






70 










75 




80 




tec 


cct 


.tat 


tat gac aac 


gtc 


cga 


jp.ct 


ctt 


age 


tat cca gac tct 


gac 


288. 


Ser 


Pro 


Tyr 


Tyr Asp Asn Val Arg Pro 


Leu 


Ser 


Tyr Pro Asp Ser 


Asp 










85 








90 




95 






gec 


gtt 


ttg 


ata tgt ttc 


gat 


ate 


agt 


cgc 


ccc 


gag acg ctt gac 


agt 


336 


Ala 


Val 


Leu 


He Cys Phe Asp 


He 


Ser 


Arg 


Pro 


Glu Thr Leu Asp 


Ser 










100 






105 






110 






gtt 


ctt 


aaa 


aag tgg aaa 


ggt 


gaa 


att 


caa 


gaa 


ttc tgc ccc aat 


acc 


384 


Val 


Leu 


Lys 


Lys Trp Lys 


Gly Glu He 


Gin 


Glu 


Phe Cys Pro Asn 


Thr 








115 






120 








125 






aaa 


atg 


ctg 


ctg gtt ggc 


tgc 


aaa 


tec 


gac 


etc 


cgt aca gat eta 


acc 


432 


Lys 


Met 


Leu 


Leu Val Gly Cys 


Lys 


Ser 


Asp 


Leu 


Arg Thr Asp Leu 


Thr 






130 






135 










140 






aca 


tta 


gta 


gaa eta tct 


aac 


cac 


aga 


cag 


acg 


cca gtc tec tat 


gat 


480 


Thr 


Leu 


Val 


Glu Leu Ser Asn His Arg 


Gin 


Thr 


Pro Val Ser Tyr 


Asp 




145 






150 










155 


160 




cag 


ggg 


gca 


aac atg gec 


aag 


cag 


ata 


gga 


gca 


gec acg tac ate 


gaa 


528 


Gin Gly 


Ala 


Asn Met Ala Lys 


Gin 


lie 


Gly Ala 


Ala Thr Tyr He 


Glu 










165 








170 




175 






tgc 


tct 


gcg 


tta cag tct 


gag 


aac 


agt 


gtc 


aga 


gac att ttc cat 


gta 


576 


Cys 


Ser 


Ala 


Leu Gin Ser 


Glu 


Asn 


Ser 


Val Arg 


Asp He Phe His 


Val 








180 






185 






190 






gec 


acc 


ctg 


gca tgt gtg 


aat 


aag 


act 


aac 


aaa 


aat ctt aaa agg 


aat 


624 


Ala 


Thr 


Leu 


Ala Cys Val 


Asn 


Lys 


Thr 


Asn Lys 


Asn Leu Lys Arg 


Asn 








195 






200 








205 






aaa 


act 


caa 


aga gee aca 


aag 


aga 


ate 


teg 


cac 


atg ccg agt egg 


cca 


672 


Lys 


Thr 


Gin 


Arg Ala Thr 


Lys 


Arg He 


Ser His 


Met Pro Ser Arg 


Pro 






210 






215 










220 






gaa 


eta 


agt 


get gtg gec 


aca 


gac 


ttg 


agg 


aag 


gat aaa gca aag 


age 


720 


Glu 


Leu 


Ser 


Ala Val Ala 


Thr 


Asp 


Leu 


Arg Lys 


Asp Lys Ala Lys 


Ser 




225 






230 










235 




240 




tgc 


age 


ate 


atg tga 
















735 


Cys 


Ser 


lie 


Met 


















<210> 360 




















<211> 244 




















<212> PRT 




















<213> Xenopus laevis 


















<400> 360 




















Met 


Lys 


Glu 


Arg Arg Thr 


Ser Gin Lys 


Leu 


Ala 


Ser Lys Ser Met 


Met 




1 






5 








10 




15 






Asp 


Pro 


Asn 


Gin Asn Val 


Lys 


Cys 


Lys 


He 


Val 


Val Val Gly Asp 


Ser 










20 






25 






30 






Gin 


Cys 


Gly 


Lys Thr Ala 


Leu 


Leu His 


Val 


Phe 


Ala Lys Asp Ser 


Phe 








35 






40 








45 






Pro 


Glu 


Asn 


Tyr Val Pro Thr Val 


Phe 


Glu Asn 


Tyr Thr Ala Ser 


Phe 






50 






55 










60 






Glu 


He Asp 


Thr Gin Arg 


He 


Glu 


Leu 


Ser 


Leu 


Trp Asp Thr Ser 


Gly 




65 






70 










75 




80 




Ser 


Pro 


Tyr 


Tyr Asp Asn Val Arg Pro 


Leu 


Ser 


Tyr Pro Asp Ser 


Asp 
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85 90 95 



Ala 


Val 


Leu 


lie 


Cys 


Phe 


Asp 


lie 


Ser Arg 


Pro 


Glu 


Thr 


Leu Asp Ser 








100 








105 








110 


Val 


Leu 


Lvs 


Lys 


Trp 


Lys 


Gly 


Glu 


lie Gin 


Glu 


Phe 


Cys 


Pro Asn Thr 






115 








120 








125 


Asp Leu Thr 


Lys 


Met 


Leu 


Leu 


Val 


Gly 


Cys 


Lys 


Ser Asp 


Leu 


Arg 


Thr 


130 










135 








.140 






Thr 


Leu 


Val 


Glu 


Leu 


Ser 


Asn 


His 


Arg Gin 


Thr 


Pro 


Val 


Ser Tyr Asp 


145 










150 








155 






160 


Gin 


Glv 


Ala 


Asn 


Met 


Ala 


Lys 


Gin 


lie Gly 


Ala 


Ala 


Thr 


Tyr He Glu 








165 








170 








175 


Cys 


Ser 


Ala 


Leu 


Gin 


Ser 


Glu 


Asn 


Ser Val 


Arg 


Asp 


He 


Phe His Val 






180 










185 








190 


Ala 


Thr 


Leu 


Ala 


Cys 


Val 


Asn 


Lys 


Thr Asn 


Lys 


Asn 


Leu 


Lys Arg Asn 






195 








200 








205 




Lys 


Thr 


Gin 


Arg 


Ala 


Thr 


Lys 


Arg 


He Ser 


His 


Met 


Pro 


Ser Arg Pro 


210 








215 








220 






"Glu 


"Leu 


Ser 


Ala 


Val 


Ala 


Thr 


Asp 


Leu' Arg 


Lys 


Asp 


Lys 


Ala Lys Ser 


225 










230 








235 






240 


Cys 


Ser 


lie 


Met 





















<210> 361 

<211> 1035 

<212> DNA 

<213> Pagrus major 

<220> 

<221> CDS 

<222> (25) . . (603) 

<400> 361 

cqqcacqagg cgcagcggaa cagg atg cag get ate aaa tgt gtg gtc gtg 51 

Met Gin Ala He Lys Cys Val Val Val 
1 5 

gga gat gga get gtg gga aaa aca tgt ctt etc ate age tac aea acc 99 

Gly Asp Gly Ala Val Gly Lys Thr Cys Leu Leu He Ser Tyr Thr Thr 

10 " 15 20 25 

aat gee ttc ccc ggg gag tac att cct act gta ttt gac aac tac tea 147 

Asn Ala Phe Pro Gly Glu Tyr He Pro Thr Val Phe Asp Asn Tyr Ser 

30 35 40 

get aat gtg atg gtg gac age aag cca gtc aac ctg ggc etc tgg gat 195 
Ala Asn Val Met Val Asp Ser Lys Pro Val Asn Leu Gly Leu Trp Asp 

45 50 55 

aca get gga cag gaa gac tac gac agg etc cgc cca ctg tec tac cca 243 
Thr Ala Gly Gin Glu Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro 

60 65 70 

cag acg gat gtt ttc ctg ate tgt ttc tec ctg gtg age cca gcg tec 291 
Gin Thr Asp Val Phe Leu He Cys Phe Ser Leu Val Ser Pro Ala Ser 

75 ^ 80 85 

tat gag aac gtc aga get aag tgg tac cca gag gtt cgc cac cac tgc 339 
Tyr Glu Asn Val Arg Ala Lys Trp Tyr Pro Glu Val Arg His His Cys 
90 95 100 105 

ccc tec aca cct ate ate ctg gtg ggc acc aag ctg gat ctg agg gac 387 
Pro Ser Thr Pro He He Leu Val Gly Thr Lys Leu Asp Leu Arg Asp 

110 115 120 

gag aag gaa acc att gag aag ctg aag gaa aaa aag ctg gcg ccg ate 435 
Glu Lys Glu Thr He Glu Lys Leu Lys Glu Lys Lys Leu Ala Pro He 

125 130 135 

acc tac ccc cag ggc ctg get ctg get aag gag ate gat gca gtg aag 483 
Thr Tyr Pro Gin Gly Leu Ala Leu Ala Lys Glu He Asp Ala Val Lys 

140 ~ 145 150 

tac ctg gag tgc tea gee ttg acc cag cgt ggc ctg aag acc gtg ttc 531 
Tyr Leu Glu Cys Ser Ala Leu Thr Gin Arg Gly Leu Lys Thr Val Phe 

155 160 165 

gac gag gee ate egg gec gtg etc tgc cct cag ccc acc aag gtc aag 579 
Asp Glu Ala He Arg Ala Val Leu Cys Pro Gin Pro Thr Lys Val Lys 
170 175 180 185 
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aag aag ccg tgc tea ctg ctg taaatgattc ttgcaagacg aagacgacga 630 
Lys Lys Pro Cys Ser Leu Leu 
190 

cgatgagaaa gcgtttgcga ctgatgaaaa ggtgaaaaga agacgatggg aaagttctgt 690 

tatgtttaat ggtttgtgat agaagactgc gactagattt ccactaggtt agaaaaaaat 750 

aatgcttgtg eggtgatgat aaatggtaaa tggtagcaaa ccctttctgt acagcatact 810 

ctgagactga ttattcaagt gcattataga ccaggcagac atttcactgt aaaattgaac 870 

aaagtatatc actagatgtg gaggctgaca cactgtacat tcatcctgtg ggcttaaatg 930 

gtcagatcaa gagcagcaaa gcacctgcac aaeggactet gctgtccctt ttggggctga 990 

cagtatgtac acaggtgtct tgtgcggctt cacataatga taata 1035 



<210> 362 
<211> 192 
<212> PRT 

<213> Pagrus major 
<400> 362 

Met Gin Ala lie Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 

1 5 10 15 

Thr Cys Leu Leu lie Ser Tyr Thr Thr Asn Ala Phe Pro Gly Glu Tyr 

20 25 30 

lie Pro Thr Val Phe Asp Asn Tyr Ser Ala Asn Val Met Val Asp Ser 

35 40 45 

Lys Pro Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr 

50 55 60 

Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu lie 
65 70 75 1 ~ 80 

Cys Phe Ser Leu Val Ser Pro Ala Ser Tyr Glu Asn Val Arg Ala Lys 

85 .90 95 

Trp Tyr Pro Glu Val Arg His His Cys Pro Ser Thr Pro lie lie Leu 

100 105 110 

Val Gly Thr Lys Leu Asp Leu Arg Asp Glu Lys Glu Thr lie Glu Lys 

115 120 125 

Leu Lys Glu Lys Lys Leu Ala Pro He Thr Tyr Pro Gin Gly Leu Ala 

130 135 140 

Leu Ala Lys Glu He Asp Ala Val Lys Tyr Leu Glu Cys Ser Ala Leu 
145 150 155 160 

Thr Gin Arg Gly Leu Lys Thr Val Phe Asp Glu Ala He Arg Ala Val 

165 170 175 

Leu Cys Pro Gin Pro Thr Lys Val Lys Lys Lys Pro Cys Ser Leu Leu 
180 - 185 190 

<210> 363 
<211> 1090 
<212> DNA 

<213> Schizophyllum commune 

<220> 

<221> CDS 

<222> (145) . . (735) 



<400> 363 

gctgctcgcc ttgttcttcc attgtccatc cccttgcccg tgctcccgcg gtegctgaac 60 
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gtacccaagc cctcgcctcc ccgtcgcgtc gtctgccttc ttcccctcgg tttccgatag 120 

tcgctccgtt gcccagtccc cagt atg cag gcc ate aag tgt gtt gtc gta 171 

Met Gin Ala He Lys Cys Val Val Val 
1 5 

gga gat ggt gcg gtc gga aag acc tgc ctg. eta ate teg tat acc acg 219 

Gly Asp Gly Ala Val Gly Lys Thr Cys Leu Leu He Ser Tyr Thr Thr 

10 " 15 20 25 

aac gcg ttc ccg gga gaa tat ate ccg acc gtg ttc gat aac tac tec 267 

Asn Ala Phe Pro Gly Glu Tyr He Pro Thr Val Phe Asp Asn Tyr Ser 

30 35 40 

gcc aat gtc atg gtc gac ggc aag act ate tec etc ggg ctt tgg gat 315 
Ala Asn Val Met Val Asp Gly Lys Thr He Ser Leu Gly Leu Trp Asp 

45 50 55 

acc get ggt caa gaa gat tac gac cgt etc cgc ccg etc tec tac cct 363 
Thr Ala Gly Gin Glu Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro 

60 65 70 

cag acg gat gtc ttc ttg att tgt ttc teg etc gtc age ccg cca agt 411 
Gin Thr Asp Val Phe Leu He Cys Phe Ser Leu Val Ser Pro Pro Ser 

75 80 85 

ttc gag aac gtc egg acc aag tgg tac cct gaa ata teg cat cac gca 459 
Phe Glu Asn Val Arg Thr Lys Trp Tyr Pro Glu He Ser His His Ala 
90 95 100 105 

ccg cag acg ccc gtc gtg etc gtg ggc acc aag ctg gat ttg cga gag 507 
Pro Gin Thr Pro Val Val Leu Val Gly Thr Lys Leu Asp Leu Arg Glu 

110 115 120 

gac cct gcg acg ata gag aaa ctg cgt gac cgc cgc atg tec ccc ate 555 
Asp Pro Ala Thr He Glu Lys Leu Arg Asp Arg Arg Met Ser Pro He 

-.125 130 135 

cag tac teg cag ggt gtc gcg atg atg aag gac ate ggt get gtg aag 603 
Gin Tyr Ser Gin Gly Val Ala Met Met Lys Asp He Gly Ala Val Lys 

140 145 150 

tac eta gag tgt tec gcg ctg acg caa aag ggg etc aag acc gtg ttt 651 
Tyr Leu Glu Cys Ser Ala Leu Thr Gin Lys Gly Leu Lys Thr Val Phe 

155 " 160 165 

gac gag gcg ate cgt gtt gtc ttg tac ccg tec gcg egg tec gac aac 699 
Asp Glu Ala He Arg Val Val Leu Tyr Pro Ser Ala Arg Ser Asp Asn 
170 175 180 185 

aaa cgc age aag ggc cgc tea tgc att gtc gca taagtggact ccagcgccgc 752 
Lys Arg Ser Lys Gly Arg Ser Cys He Val Ala 

190 195 
tgtatgggtg gacgaaccac aggaagtgtt gtgcgcactg tatcatcacg cggcgcgcgg 812 

cgctccctgc actatatcac gecatttatg tctttcttct agcactatct cctttttcgc 872 

gccagctgct gatatccgea cggtttcctg ccaacacata ccagtggaac tcgaggatcc 932 

. tgaccatttt gctactcgtc acgaccaccc ttgcctcaca ccccctttgt gttgactctc 992 

aggttctgta attcgacgac ctgctcttcg aatttcgetc ccctttgccc acatagcaca 1052 

tatagtatca ccgtgacgaa ctgtatcctc cttgacga 1090 



<210> 364 
<211> 196 
<212> PRT 

<213> Schizophyllum commune 
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<400> 364 

Met Gin Ala lie Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 

15 io 15 

Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Ala Phe Pro Gly Glu Tyr 

20 25 30 

He Pro Thr Val Phe Asp Asn Tyr Ser Ala Asn Val Met Val Asp Gly 

35 40 45 

Lys Tnr He Ser Leu Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr 

50 55 60 

Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu He 
65 70 75 * 80 

Cys Phe Ser Leu Val Ser Pro Pro Ser Phe Glu Asn Val Arg Thr Lys 

85 90 95 

Trp Tyr Pro Glu He Ser His His Ala Pro Gin Thr Pro Val Val Leu 

100 105 no 

Val Gly Thr Lys Leu Asp Leu Arg Glu Asp Pro Ala Thr He Glu Lys 

115 120 125 

Leu Arg Asp Arg Arg Met Ser Pro He Gin Tyr Ser Gin Gly Val Ala 

130 135 140 

Met Met Lys Asp He Gly Ala Val Lys Tyr Leu Glu Cys Ser Ala Leu 
145 150 * 155 160 

Thr Gin Lys Gly Leu Lys Thr Val Phe Asp Glu Ala He Arg Val Val 

165 170 175 

Leu Tyr Pro Ser Ala Arg Ser Asp Asn Lys Arg Ser Lys Gly Arg Ser 

180 185 190 

Cys He Val Ala 
195 

<210> 365 
<211> 576 
<212> DNA 

<213> Paracoccidioides brasiliensis 

<220> 

<221> CDS 

<222> (1) . . (576) 

<400> 365 

atg get gag att cgt cgt aaa ctt gtc att gtt ggt gat ggt gec tgt 48 

Met Ala Glu He Arg Arg Lys Leu Val He Val Gly Asp Gly Ala Cys 

1 5 10 15 

ggt aaa act tgt etc ttg att gtc ttt tec aag ggt acc ttc cct gag 96 

Gly Lys Thr Cys Leu Leu He Val Phe Ser Lys Gly Thr Phe Pro Glu 

20 25. " 30 - 

gtc tac gtc cca acc gtc ttc gag aac tat gtg gec gac gtt gag gtc 144 
Val Tyr Val Pro Thr Val Phe Glu Asn Tyr Val Ala Asp Val Glu Val 

35 40 45 

gat gga aag cat gtc gag etc gca ctt tgg gat acg get ggc cag gaa 192 
Asp Gly Lys His Val Glu Leu Ala Leu Trp Asp Thr Ala Gly Gin Glu 

50 55 60 

gat tac gat cga etc aga cct ctt tec tac cct gat tea cat gtt att 240 
Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp Ser His Val He 
65 70 75 80 

ctg ate tgt ttc get ate gat tec ccc gac tct etc gac aac gtc cag 288 
Leu He Cys Phe Ala He Asp Ser Pro Asp Ser Leu Asp Asn Val Gin 

85 90 95 

gag aag tgg att tct gaa gtc ctt cat ttc tgc cag ggt cat ccc att 336 
Glu Lys Trp He Ser Glu Val Leu His Phe Cys Gin Gly His Pro He 

100 105 110 

ate etc gtt ggt tgc aag aaa gat ctt cgt gac gac ccc aga acg att 384 
He Leu Val Gly Cys Lys Lys Asp Leu Arg Asp Asp Pro Arg Thr He 

115 120 ~ 125 

gag gag ctg cgc aag acg tct cag aag ccc gtg acc acc gaa cag ggt 432 
Glu Glu Leu Arg Lys Thr Ser Gin Lys Pro Val Thr Thr Glu Gin Gly 

130 135 140 

gag gag gtc cgc aag aag att ggc get tac aag tat ctg gaa tgc tec 480 
Glu Glu Val Arg Lys Lys He Gly Ala Tyr Lys Tyr Leu Glu Cys Ser 



WO 2005/014828 

145 150 
gcc cga aca aac gac gga 
Ala Arg Thr Asn Asp Gly 
• 165 

get gca ctg ctg gcg aag 
Ala Ala Leu Leu Ala Lys 
180 

taa 



270/291 

155 

gtt cgt gag gtg ttc gag 
Val Arg Glu Val Phe Glu 
170 

aag gag aaa aag aaa tgc 
Lys Glu Lys Lys Lys Cys 
185 
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160 

tea get act cga 528 
Ser Ala Thr Arg 
175 

aag ate ttg 573 
Lys He Leu 
190 

576 



<210> 366 
<211> 191 
<212> PRT 

<213> Paracoccidioides brasiliensis 



<400> 366 

Met Ala Glu He Arg Arg Lys Leu Val He Val Gly Asp Gly Ala Cys 
1 5 10 15 

Gly Lys Thr Cys Leu Leu He Val Phe Ser Lys Gly Thr Phe Pro Glu 

20 25 30 

Val Tyr Val Pro Thr Val Phe Glu Asn Tyr Val Ala Asp Val Glu Val 

35 40 45 

Asp Gly Lys His Val Glu Leu Ala Leu Trp Asp Thr Ala Gly Gin Glu 

50 55 60 

Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro Asp Ser His Val He 
65 ~ ^ 70 75 80 

Leu He Cys Phe Ala He Asp Ser Pro Asp Ser Leu Asp Asn Val Gin 

85 90 95 • 

Glu Lys Trp He Ser Glu Val Leu His Phe Cys Gin Gly His Pro He 

100 105 HO 

He Leu .Val Gly Cys Lys Lys Asp Leu Arg Asp Asp Pro Arg Thr He 

115 120 125 

Glu Glu Leu Arg Lys Thr Ser Gin Lys Pro Val Thr Thr Glu Gin Gly 

130 135 140 

Glu Glu Val Arg Lys Lys He Gly Ala Tyr Lys Tyr Leu Glu Cys Ser 
145 150 155 160 

Ala Arg Thr Asn Asp Gly Val Arg Glu Val Phe Glu Ser Ala Thr Arg 

165 ^ 170 175 

Ala Ala Leu Leu Ala Lys Lys Glu Lys Lys Lys Cys Lys He Leu 
180 185 190 

<210> 367 
<211> 591 
<212> DNA 

<213> Fucus distichus 

<220> 
<221> CDS 
<222> (1) . . (591) 

<400> 367 

atg cag aac ata aag tgc gtc gtc gtg ggt gac ggt gcg gtt ggt aaa 
Met Gin Asn He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 
1 5 10 15 

ace tgc etc etc ate tec tac acg acg aac gcg ttt cct ggg gaa tac 96 
Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Ala Phe Pro Gly Glu Tyr 

20 25 3 0 

ate cca act gtc ttc gac aac tat age get aat gta atg gtc gac gga 144 
He Pro Thr Val Phe Asp Asn Tyr Ser Ala Asn Val Met Val Asp Gly 

35 40 45 

aag ccg ate aac ctg ggc ctt tgg gat act gca ggg cag gaa gat tac 192 
Lys Pro He Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr 

50 55 60 

gat cga etc egg cca ctg age tac ccg caa acg gac gta ttt ctt gtg 
Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu Val 
65 70 75 80 



48 



240 
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tgc 


ttc 


age 


gtc 


gtt 


gac 


ccc 


ace age 


ttt 


cac aac gtg aag etc aag 


288 


Cys 


Phe 


Ser 


Val 


Val 


Asp 


Pro 


Thr Ser 


Phe 


His Asn Val Lys Leu Lys 












85 








90 


95 




tgg 


ata 


ccc 


gag 


ctg 


caa 


cac 


cat get 


ccg 


ggc ate cca ttt ata ctg 


336 


Trp 


lie 


Pro 


Glu 


Leu 


Gin 


His 


His Ala 


Pro 


Gly lie Pro Phe He Leu 










100 








105 




110 




artt 


ggt 


acg 


aag 


etc 


gac 


ctg 


agg gat 


gat 


caa gac gcg ate aag cgt 


384 


Val 


Gly 


Thr Lys 


Leu 


Asp 


Leu Arg Asp Asp 


Gin Asp Ala He Lys Arg 








115 










120 




125 




eta 


gca 


gaa 


cgt 


egg 


cag 


acg 


ccc ate 


age 


ttc age gag gcg cag ggt 


432 


Leu 


Ala 


Glu Arg 


Arg 


Gin 


Thr 


Pro lie 


Ser 


Phe Ser Glu Ala Gin Gly 






130 










135 






140 




ctg 


teg 


tct 


gac 


ctt 


gaa 


get 


tac cgc 


tac 


ctt gag tgc age gcg ctg 


480 


Leu 


Ser 


Ser Asp 


Leu 


Glu 


Ala 


Tyr Arg 


Tyr 


Leu Glu Cys Ser Ala Leu 




145 










150 








155 160 




acg 


caa 


cac 


ggg 


tta 


aaa 


cag 


gtg ttt 


gac 


ggg get ate egg tgt gtt 


528 


Thr 


Gin 


His 


Gly 


Leu 


Lys 


Gin Val Phe Asp Gly Ala He Arg Cys Val 












165 








170 


175 




eta 


gaa 


cag 


aac 


cag 


aga 


aag 


atg aaa 


aag 


aaa aag ggc aag aaa ggc 


576 


Leu 


Glu 


Gin 


Asn 


Gin Arg Lys Met Lys Lys Lys Lys Gly Lys Lys Gly 










180 








185 




190 




tgc 


gtc 


ate 


tct 


tga 












591 


Cys 


Val 


He 


Ser 


















195 



















<210> 368 
<211> 196 
<212> PRT . 

<213> Pucus distichus 
<400> 368 

Met Gin Asn He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 

1 5 10 15 

Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Ala Phe Pro Gly Glu Tyr 

20 25 30 

lie Pro Thr Val Phe Asp Asn Tyr Ser Ala Asn Val Met Val Asp Gly 

35 . * 40 45 

Lys Pro He Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr 

50 55 60 

Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu Val 
65 70 75 80 

Cys Phe Ser Val Val Asp Pro Thr Ser Phe His Asn Val Lys Leu Lys 

85 90 95 

Trp He Pro Glu Leu Gin His His Ala Pro Gly He Pro Phe He Leu 

100 105 110 

Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Gin Asp Ala He Lys Arg 

115 120. 125 

Leu Ala Glu Arg Arg Gin Thr Pro He Ser Phe Ser Glu Ala Gin Gly 

130 135 140 

Leu Ser Ser Asp Leu Glu Ala Tyr Arg Tyr Leu Glu Cys Ser Ala Leu 
145 150 155 ~ 160 

Thr Gin His Gly Leu Lys Gin Val Phe Asp Gly Ala He Arg Cys Val 

165 170 175 

Leu Glu Gin Asn Gin Arg Lys Met Lys Lys Lys Lys Gly Lys Lys Gly 

180 185 190 

Cys Val He Ser 
195 

<210> 369 
<211> 636 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> CDS 

<222> (1) . . (636) 
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240 



288 



<400> 369 

atg cag gcc ate aag tgt gtg gtg gtg gga gac gga get gta ggt aaa 48. 

Met Gin Ala He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 

1 5 10 15 

act tgc eta ctg ate agt tac aca ace aat gca ttt cct gga gaa tat 96 

Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Ala Phe Pro Gly Glu Tyr 

20 25 30 

ate cct act gtc ttt gac aat tat tct gcc aat gtt atg gta gat gga 144 
He Pro Thr Val Phe Asp Asn Tyr Ser Ala. Asn Val Met Val Asp Gly 

35 40 45 

aaa ccg gtg aat ctg ggc tta tgg gat aca get gga caa gaa gat tat 192 
Lys Pro Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr 

50 55 60 

gac aga tta cgc ccc eta tec tat ccg caa aca gtt gga gaa acg tac 
Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Val Gly Glu Thr Tyr 
65 " 70 75 80 

ggt aag gat ata acq tec egg ggc aaa gac aag ccg att gcc. gat gtg 
Gly Lys Asp He Thr Ser Arg Gly Lys Asp Lys Pro He Ala Asp Val 

85 90 95 

ttc tta att tgc ttt tec ctt gtg agt cct gca tea ttt gaa aat gtc 336 
Phe Leu He Cys Phe Ser Leu Val Ser Pro Ala Ser Phe Glu Asn Val 

100 105 HO 

cgt gca aag tgg tat cct gag gtg egg cac cac tgt ccc aac act ccc 384 
Arg Ala Lys Trp Tyr Pro Glu Val Arg His His Cys Pro Asn Thr Pro 

115 120 125 

ate ate eta gtg gga act aaa ctt gat ctt agg gat gat aaa gac acg 432 
He He Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys Asp Thr 

130 135 140 

ate gag aaa ctg aag gag aag aag ctg act ccc ate acc tat ccg cag 
He Glu Lys Leu Lys Glu Lys Lys Leu Thr Pro He Thr Tyr Pro Gin 
145 150 155 160 

ggt eta gcc atg get aag gag att ggt get gta aaa tac ctg gag tgc 
Gly Leu Ala Met Ala Lys Glu He Gly Ala Val Lys Tyr Leu Glu Cys 

165 170 175 

teg gcg etc aca cag cga ggc etc aag aca gtg ttt gac gaa gcg ate 576 
Ser Ala Leu Thr Gin Arg Gly Leu Lys Thr Val Phe Asp Glu Ala He 

180 ~ 185 190 

cga gca gtc etc tgc ccg cct ccc gtg aag aag agg aag aga aaa tgc 624 
Arg Ala Val Leu Cys Pro Pro Pro Val Lys Lys Arg Lys Arg Lys Cys 

195 ~ 200 205 

ctg ctg ttg taa 636 
Leu Leu Leu 

210 



480 



528 



<210> 370 
<211> 211 
<212> PRT 
<213> Homo sapiens 



<400> 370 



Met 


Gin 


Ala 


He 


Lys Cys Val Val 


1 








5 


Thr 


Cys 


Leu 


Leu 


lie Ser Tyr Thr 








20 




He 


Pro 


Thr 


Val 


Phe Asp Asn Tyr 






35 




40 


Lys 


Pro 


Val 


Asn 


Leu Gly Leu Trp 


50 






55 


Asp 


Arg 


Leu 


Arg 


Pro Leu Ser Tyr 


65 








70 


Gly 


Lys 


Asp 


He 


Thr Ser Arg Gly 










85 


Phe 


Leu 


He 


Cys 


Phe Ser Leu Val 








100 




Arg 


Ala 


Lys 


Trp 


Tyr Pro Glu Val 






115 




120 


He 


He 


Leu 


Val 


Gly Thr Lys Leu 



Val Gly Asp Gly Ala Val Gly Lys 

10 ~ 15 
Thr Asn Ala Phe Pro Gly Glu Tyr 
25 30 
Ser Ala Asn Val Met Val Asp Gly 
45 

Asp Thr Ala Gly Gin Glu Asp Tyr 
60 

Pro Gin Thr Val Gly Glu Thr Tyr 

75 80 
Lys Asp Lys Pro He Ala Asp Val 

90 95 
Ser Pro Ala Ser Phe Glu Asn Val 
105 110 
Arg His His Cys Pro Asn Thr Pro 
125 

Asp Leu Arg Asp Asp Lys Asp Thr 
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130 135 140 

lie Glu Lys Leu Lys Glu Lys Lys Leu Thr Pro He Thr Tyr Pro Gin 
145 150 155 "* 160 

Gly Leu Ala Met Ala Lys Glu He Gly Ala Val Lys Tyr Leu Glu Cys 

165 170 175 

Ser Ala Leu Thr Gin Arg Gly Leu Lys- Thr Val Phe Asp Glu Ala He . 

I 80 185 190 

Arg Ala Val Leu Cys Pro Pro Pro Val Lys Lys Arg Lys Arg Lys Cys 

195 200 205 

Leu Leu Leu 
210 

<210> 371 
<211> 576 
<212> DNA 

<213> Pneumocystis carinii 

<220> 

<221> CDS 

<222> (1) . . (576) 

<400> 371 

atg caa a!ct att aaa tgt gtc gtt gtt ggg gat ggc gcg gtt ggt aaa 48 

Met Gin Thr He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 

1 5 10 15 

acg tgt ctt tta ata tec tat acg aca aac aaa ttt cct tea gaa tat 96 

Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Lys Phe Pro Ser Glu Tyr 

20 25 30 

gtt cct act gta ttt gat aat tat gcg gtt acc gta atg att gga gaa 144 
Val Pro Thr Val Phe Asp Asn Tyr Ala Val Thr Val Met He Gly Glu 

35 40 45 

gaa cct tat act tta gga ctt ttt gat aca gca ggt caa gaa gat tat 192 
Glu Pro Tyr Thr Leu Gly Leu Phe Asp Thr Ala Gly Gin Glu Asp Tyr 

50 55 60 

gac aga ttg cgt ccc tta tea tat cca caa acg gat gtt ttt ctt att 240 
Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu He 
65 70 75 80 

tgc ttt tct gtt act agt cca gca agt ttt gaa aat gta aga gaa aag 2 88 

Cys Phe Ser Val Thr Ser Pro Ala Ser Phe Glu Asn Val Arg Glu Lys 

85 90 95 

tgg cat cca gag gtt cgt cat cat tgt cca gga aca ccc tgt ctt att 336 
Trp His Pro Glu Val Arg His His Cys Pro Gly Thr Pro Cys Leu He 

100 105 no 

gtt ggt aca caa ate gat tta cga gat gat cct atg att gta gag aaa 384 
Val Gly Thr Gin He Asp Leu Arg Asp Asp Pro Met He Val Glu Lys 

115 120 125 

etc agt cga caa aga caa acc ccc att aca aaa gaa tta ggt gaa aaa 432 
Leu Ser Arg Gin Arg Gin Thr Pro He Thr Lys Glu Leu Gly Glu Lys 

130 135 140 

ctt tea aaa gaa ttg ggt get gta aaa tat gtt gag tgc tea get ttg 480 
Leu Ser Lys Glu Leu Gly Ala Val Lys Tyr Val Glu Cys Ser Ala Leu 
145 150 155 160 

act caa aaa gga tta aaa aac gtt ttt gat gaa get ata gtt tgc gca 528 
Thr Gin Lys Gly Leu Lys Asn Val Phe Asp Glu Ala He Val Cys Ala 

165 170 175 

ctt gaa cca ccc gtt acg aag aag aaa act aaa tgt ctt att tta 573 
Leu Glu Pro Pro Val Thr Lys Lys Lys Thr Lys Cys Leu He Leu 

180 185 ~ 190 

taa 576 



<210> 372 
<211> 191 
<212> PRT 

<213> Pneumocystis carinii 
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<400> 372 

Met Gin Thr He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 

1 5 10 15 

Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Lys Phe Pro Ser Glu Tyr 

20 25 30 

Val Pro Thr Val Phe Asp Asn Tyr Ala Val Thr Val Met He Gly Glu 

35 40 - 45 

Glu Pro Tyr Thr Leu Gly Leu Phe Asp Thr Ala Gly Gin Glu Asp Tyr 

50 55 60 

Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu He 
65 " 70 75 80 

Cys Phe Ser Val Thr Ser Pro Ala Ser Phe Glu Asn Val Arg Glu Lys 

85 90 95 

Trp His Pro Glu Val Arg His His Cys Pro Gly Thr Pro Cys Leu He 

100 105 HO 

Val Gly Thr Gin He Asp Leu Arg Asp Asp Pro Met He Val Glu Lys 

115 ' 120 . . 125 

Leu Ser Arg Gin Arg Gin Thr Pro He Thr Lys Glu Leu Gly Glu Lys 

130 135 140 

Leu Ser Lys Glu Leu Gly Ala Val Lys Tyr Val Glu Cys Ser Ala Leu 
145 ~ 150 155 160 

Thr Gin Lys Gly Leu Lys Asn Val Phe Asp Glu Ala He Val Cys Ala 

165 170 175 

Leu Glu Pro Pro Val Thr Lys Lys Lys Thr Lys Cys Leu He Leu 
180 185 190 

<210> 373 
<211> 630 
<212> DNA 

<213> Ashbya gossypii 
<220> 

<221> CDS 1 
<222> (1) . . (630) 

<400> 373 

atg gcg tac cag aca ggc ggc aac ata cgc aag aag ttg gtt att gtc 48 

Met Ala Tyr Gin Thr Gly Gly Asn He Arg Lys Lys Leu Val He Val 

1 5 10 15 

ggc gac ggg gcg tgt ggg aag acg tgt ctg etc gtt gtg ttc tec aaa 96 

Gly Asp Gly Ala Cys Gly Lys Thr Cys Leu Leu Val Val Phe Ser Lys 

20 25 30 

ggg cag ttc ccg gag ate cat gtg ccc acg gtg ttc gag aac tac gtg 144 
Gly Gin Phe Pro Glu He His Val Pro Thr Val Phe Glu Asn Tyr Val 

35 40 45 

gca gat gtg gac ate gac ggg cga cgc gta gag ctg gca ctg tgg gat 192 
Ala Asp Val Asp He Asp Gly Arg Arg Val Glu Leu Ala Leu Trp Asp 

50 55 60 

aca gcg ggc cag gag gac tac gac egg ctg egg cca ttg teg tac ccg 
Thr Ala Gly Gin Glu Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro 
65 70 75 80 

gat tec aat gtg gtg ctt ate tgt ttc tct gtg gac eta cct gac teg 288 
Asp Ser Asn Val Val Leu He Cys Phe Ser Val Asp Leu Pro Asp Ser 

85 90 95 

ctg gac aat gtg cag gag aag tgg gtc age gaa gtt ctg cac ttc tgc 33 6 

Leu Asp Asn Val Gin Glu Lys Trp Val Ser Glu Val Leu His Phe Cys 

100 105 HO 

cag ggc gtg cca att tta ctg gta ggc tgc aag gtg gat ctg aga aac 384 
Gin Gly Val Pro He Leu Leu Val Gly Cys Lys Val Asp Leu Arg Asn 

115 120 125 

gac cct cag gtg etc cag cag ctg cag gee gag ggc cag cgc ccc gtg 432 
Asp Pro Gin Val Leu Gin Gin Leu Gin Ala Glu Gly Gin Arg Pro Val 

130 135 140 

ace gee gca gag ggc tea gee gtc gee ggt aag ata ggc gec gtc gec 
Thr Ala Ala Glu Gly Ser Ala Val Ala Gly Lys He Gly Ala Val Ala 
145 150 155 160 

tac etc gag tgc tct gcg cgc aca ggc cag ggt gtg aag gag gtt ttc 528 
Tyr Leu Glu Cys Ser Ala Arg Thr Gly Gin Gly Val Lys Glu Val Phe 



240 



480 
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165 170 175 

gac act gcg acc cgc gcc gca etc tct ggc aag ccc gec gcg tct gca 576 
Asp Thr Ala Thr Arg Ala Ala Leu Ser Gly Lys Pro Ala Ala Ser Ala 

180 185 190 

ggc aag aag aag gtc cac ggt gac aag aag aag aag aaa tgt ctg gtc 624 
Gly Lys Lys Lys Val His Gly Asp Lys Lys Lys Lys Lys Cys Leu Val 

195 200 205 

ctg tga 630 
Leu 



<210> 374 
<211> 209 
<212> PRT 

<213> Ashbya gossypii 

<400> 374 

Met Ala Tyr Gin Thr Gly Gly Asn lie Arg Lys Lys Leu Val lie Val 

1 5 10 15 

Gly Asp Gly Ala Cys Gly Lys Thr Cys Leu Leu Val Val Phe Ser Lys 

20 25 30 

Gly Gin Phe Pro Glu lie His Val Pro Thr Val Phe Glu Asn Tyr Val 

35 .40 45 

Ala Asp Val Asp lie Asp Gly Arg Arg Val Glu Leu Ala Leu Trp Asp 

50 55 60 

Thr Ala Gly Gin Glu Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr Pro 
65 70 75 80 

Asp Ser Asn Val Val Leu lie Cys Phe Ser Val Asp Leu Pro Asp Ser 

85 90 ^ 95 

Leu Asp Asn Val Gin Glu Lys Trp Val Ser Glu Val Leu His Phe Cys 

100 105 no 

Gin Gly Val Pro lie Leu Leu Val Gly Cys Lys Val Asp Leu Arg Asn 

115 120 * 125 

Asp Pro Gin Val Leu Gin Gin Leu Gin Ala Glu Gly Gin Arg Pro Val 

130 135 140 

Thr Ala Ala Glu Gly Ser Ala Val Ala Gly Lys lie Gly Ala Val Ala 
145 150 155 160 

Tyr Leu Glu Cys Ser Ala Arg Thr Gly Gin Gly Val Lys Glu Val Phe 

165 170 " ~ 175 

Asp Thr Ala Thr Arg Ala Ala Leu Ser Gly Lys Pro Ala Ala Ser Ala 

180 185 190 

Gly Lys Lys Lys Val His Gly Asp Lys Lys Lys Lys Lys Cys Leu Val 
195 200 205 

Leu 

<210> 375 
<211> 564 
<212> DNA 

<213> Ashbya gossypii 

<220> 

<221> CDS 

<222> (1) . . (564) 

<400> 375 

atg acg gtc aac gtt gtg aga egg aag ttg gta ate ata ggg gat ggg 48 

Met Thr Val Asn Val Val Arg Arg Lys Leu Val He He Gly Asd Glv 

1 5 10 15 

gca tgc ggc aag acg teg tta eta cat gtg ttc acg ctg ggg aag ttc 96 

Ala Cys Gly Lys Thr Ser Leu Leu His Val Phe Thr Leu Gly Lys Phe 

20 25 30 

cct gag gaa tat ctg ccc acg gtt ttc gag aac tac gtt acg gat tgc 144 
Pro Glu Glu Tyr Leu Pro Thr Val Phe Glu Asn Tyr Val Thr Asp Cys 

35 40 45 

cgt gta gac ggc ata aaa gtg cag ttg gcg eta tgg gat act get ggt 192 
Arg Val Asp Gly He Lys Val Gin Leu Ala Leu Trp Asp Thr Ala Gly 
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50 










cag 


gaa 


gaa 


tac 


gag 


cgt 


Gin 


Glu 


Glu 


Tyr 


Glu 


Arg 


65 










70 


ate 


ata 


tta 


ate 


ggg 


ttt 


He 


He 


Leu 


He 


Gly 


Phe 










85 




gcg 


egg 


gag 


aag 


tgg 


acg 


Ala 


Arg Glu 


Lys 


Trp 


Thr 








100 






ccg 


ate 


ate 


etc 


gtg 


ggg 


Pro 


He 


He 


Leu 


Val 


Gly 






115 








cag 


tgc 


gcg 


atg 


gta 


gcg 


Gin 


Cys 


Ala 


Met 


Val 


Ala 




130 










ate 


ggc 


gca 


aag 


aaa 


tac 


He 


Gly Ala 


Lys 


Lys" 


Tyr 


145 










150 


gtg 


gac gat 


gtg 


ttc 


gag 


Val 


Asp Asp 


Val 


Phe 


Glu 










165 




aag 


gag 


ccg 


ggt 


caa 


ggc 


Lys 


Glu 


Pro 


Gly 


Gin 


Gly 






180 







276/291 

55 60 
ctg cgc ccc atg tec tac 
Leu Arg Pro Met Ser Tyr 
75 

gec ata gac gac ccg ggg 
Ala He Asp Asp Pro Gly 

. 90 
gtc gag gcg ctg cgc tac 
Val Glu Ala Leu Arg Tyr 

105 

etc aaa aag gac ctt cgc 
Leu Lys Lys Asp Leu Arg 
120 

cct teg cag gca caa gag 
Pro Ser Gin Ala Gin Glu 
135 140 
atg gag tgc age gca ctt 
Met Glu Cys Ser Aia Leu 
155 

ctg gec acg aga aca agt 
Leu Ala Thr Arg Thr Ser 
170 

tgt tgc att ate tea tga 
Cys Cys He He Ser 
185 
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teg aag gcg gac 240 
Ser Lys Ala Asp 
80 

teg ttg tec aat 288 
Ser Leu Ser Asn 
95 

tgt ccc aac gee 336 
Cys Pro Asn Ala 
110 

cgc ccc ggg acg 384 

Arg Pro Gly Thr 

125 

gtg gtg cgc gee 432 
Val Val Arg Ala 

acg ggg gag ggc 480 
Thr Gly Glu Gly 
160 

ctt ctg gtg aac 528 
Leu Leu Val Asn 
175 

564 



<210> 376 
<211> 187 
<212> PRT 

<213> Ashbya gossypii 
<400> 376 

Met Thr Val Asn Val Val Arg Arg Lys Leu Val He He Gly Asp Gly 

15 10 15 

Ala Cys Gly Lys Thr Ser Leu Leu His Val Phe Thr Leu Gly Lys Phe 

20 25 30 

Pro Glu Glu Tyr Leu Pro Thr Val Phe Glu Asn Tyr Val Thr Asp Cys 

35 40 45 

Arg Val Asp Gly He Lys Val Gin Leu Ala Leu Trp Asp Thr Ala Gly 

50 55 60 

Gin Glu Glu Tyr Glu Arg Leu Arg Pro Met Ser Tyr Ser Lys Ala Asp 
65 70 75 80 

He He Leu He Gly Phe Ala He Asp Asp Pro Gly Ser Leu Ser Asn 

85 90 95 

Ala Arg Glu Lys Trp Thr Val Glu Ala Leu Arg Tyr Cys Pro Asn Ala 

100 105 HO 

Pro He He Leu Val Gly Leu Lys Lys Asp Leu Arg Arg Pro Gly Thr 

115 120 125 

Gin Cys Ala Met Val Ala Pro Ser Gin Ala Gin Glu Val Val Arg Ala 

130 135 140 

He Gly Ala Lys Lys Tyr Met Glu Cys Ser Ala Leu Thr Gly Glu Gly 
145 150 155 160 

Val Asp Asp Val Phe Glu Leu Ala Thr Arg Thr Ser Leu Leu Val Asn 

165 170 175 

Lys Glu Pro Gly Gin Gly Cys Cys He He Ser 
180 185 



<210> 377 
<211> 1043 
<212> DNA 

<213> Tigriopus japonicus 

<220> 

<221> CDS 

<222> (264).. (842) 



<400> 377 
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aggaggagac gccttgtttg aaatccatcc aacaacctgg gactctctgc tccagcagca 60 

agcatcacca cccccatcac catcatcttc atcctgatca acacccttga gatccagatc 120 

cccatccatc gatccaccca tcctgaggat tgactcacac acctcgaaga ttgtgggctg 180 

tgaaaaacca gtgtgggatc ggcccttgcg tgcgtgaggt gggtggttgt tgaaacccga 240 



ctggcctgac cattgtcgtc teg atg gcg get att cgc aag aaa tta gtg att 293 

Met Ala Ala lie Arg Lys Lys Leu Val lie 
1 5 10 



gtg 


ggc 


gat 


ggc 


gec 


tgc 


ggc 


aag 


acc 


tgc 


etc 


ttg 


ate 


gtg 


ttc 


tec 


341 


Val 


Gly 


Asp 




Ala 


Cys 


Gly 


Lys 


Thr 


Cys 


Leu 


Leu 


He 


Val 


Phe 


Ser 












is 










20 










25 






aag 


gat 


caa 


ttc 


ccc 


gag 


gtg 


tat 


gtg 


ccc 


aca 


gtg 


ttc 


gaa 


aac 


tac 


389 


Lys 


Asp 


Gin 


Phe 


Pro 


Glu 


Val 


Tyr 


Val 


Pro 


Thr 


Val 


Phe 


Glu 


Asn 


Tyr 










30 










35 










40 








gtg 


get 


gac 


ate 


gaa 


gtg 


gac 


ggc 


aag 


caa 


gtg 


gaa 


ctg 


gee 


ttg 


tgg 


437 


Val 


Ala 


Asp 


lie 


Glu 


Val 


Asp 


Gly 


Lys 


Gin 


Val 


Glu 


Leu 


Ala 


Leu 


Trp 








45 










50 










55 








gat 


acg 


gee 


ggt 


cag 


gag 


gac 


tac 


gat 


cga 


etc 


cga 


ccc 


ctg 


tec 


tat 


485 


Asp 


Thr 


Ala 


Gly 


Gin 


Glu 


Asp 


Tyr 


Asp 


Arg 


Leu 


Arg 


Pro 


Leu 


Ser 


Tyr 






60 










65 










70 












ccg 


gat 


acg 


gat 


gtg 


ate 


ctg 


atg 


tgt 


ttc 


teg 


ate 


gac 


teg 


ccc 


gat 


533 


Pro 


Asp 


Thr 


Asp 


Val 


lie 


Leu 


Met 


Cys 


Phe 


Ser 


He 


Asp 


Ser 


Pro 


Asp 




75 










80 










85 










90 




teg 


etc 


gag 


aac 


ate 


cct 


gag 


aag 


tgg 


acg 


ccc 


gaa 


gtc 


aaa 


cac 


ttc 


581 


Ser 


Leu 


Glu 


Asn 


lie 


Pro 


Glu 


Lys 


Trp 


Thr 


Pro 


Glu 


Val 


Lys 


His 


Phe 












95 










100 










105 






tgc 


cct 


aat 


gtg 


ccc 


att 


ate 


etc 


gtg 


gga 


aac 


aaa 


aaa 


gac 


ttg 


cgc 


629 


Cys 


Pro 


Asn 


Val 


Pro 


He 


He 


Leu 


Val 


Gly 


Asn 


Lys 


Lys 


Asp 


Leu 


Arg 










110 










115 










120 








aat 


gat 


ccc 


aat 


acc 


ate 


aag 


gag 


ttg 


ggc 


aag 


atg 


aaa 


caa 


gaa 


ccg 


677 


Asrx 


Asp 


Pro 


Asn 


Thr 


He 


Lys 


Glu 


Leu 


Gly 


Lys 


Met 


Lys 


Gin 


Glu 


Pro 








125 










130 










135 










gtc 


aag 


ccg 


gaa 


gag 


ggt 


cgc 


aca 


atg 


gee 


gaa 


aag 


ate 


aac 


gec 


ttc 


725 


Val 


Lys 


Pro 


Glu 


Glu 


Gly 


Arg 


Thr 


Met 


Ala 


Glu 


Lys 


He 


Asn 


Ala 


Phe 






140 










145 










150 












gee 


tac 


etc 


gag 


tgt 


teg 


gee 


aag 


age 


aag 


gaa 


ggc 


gtc 


aga 


gaa 


gtc 


773 


Ala 


Tyr 


Leu 


Glu 


Cys 


Ser 


Ala 


Lys 


Ser 


Lys 


Glu 


Gly 


Val 


Arg 


Glu 


Val 




155 










160 










165 










170 




ttt 


gag 


aca 


gee 


acc 


cga 


gcg 


gca 


tta 


cag 


gtc 


aag 


aag 


aaa 


aag 


aag 


821 


Phe 


Glu 


Thr 


Ala 


Thr 


Arg 


Ala 


Ala 


Leu 


Gin 


Val 


Lys 


Lys 


Lys 


Lys 


Lys 












175 










180 










185 






aag 


cct 


tgc 


gtt 


etc 


ttt 


taatgtcgga aaccaaacca tttttcctat 




869 


Lys 


Pro 


Cys 


Val 


Leu 


Phe 

























190 

tcttattacc tatcttcatc atcaacatca acatccttca accacaaata caactacaac 929 



tattactata cttctactac tatggacgcg atcttaattt agtegcaatt atcaattggg 989. 



gaaatttacg gaataaaaca gaecctgaga acaaaaaaaa aaaaaaaaaa aaaa 1043 



<210> 378 
<211> 192 
<212> PRT 

<213 > Tigriopus j aponicus 



<400> 378 
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Met 


Ala 


Ala 


He 


Arq 


Lys 


Lys 


Leu Val 


He 


Val Gly 


Asp 


Gly 


Ala 


Cys 


1 








5 








10 








15 




Glv 


Lys 


Thr 


Cvs 


Leu 


Leu 


He 


Val Phe 


Ser 


Lys Asp 


Gin 


Phe 


Pro 


Glu 






20 








25 








30 






val 


Tvr 
±yj. 


Val 


Pro 


Thr 


Val 


Phe 


Glu Asn 


Tyr 


Val Ala 


Asp 


He 


Glu 


Val 




35 










40 






45 








Asp 


Gly 


Lys 


Gin 


Val 


Glu 


Leu 


Ala Leu 


Trp 


Asp Thr 


Ala 


Gly 


Gin 


Glu 




50 








55 






60 










Asp 


± y i - 


Asp 




Leu 


Arcr 


Pro 


Leu S er 


Tvr 


Pro Asp 


Thr 


ASP 


Val 


He 


65 






70 








75 








80 


•UCU 


Met 


Cys 


Phe 


Ser 


He 


Asd 


Ser Pro 


Aso 


Ser Leu 


Glu 


Asn 


He 


Pro 








85 








90 








95 




Glu 


Lys 




Thr 


Pro 


Glu 


Val 


Lvs His 


Phe 


Cys Pro 


Asn 


Val 


Pro 


He 






100 








105 








110 






He 


Leu 


Val 
115 


Gly 


Asn 


Lys 


Lys 


Asd lieu 
120 


Arcr 


Asn Asp 


Pro 
125 


Asn 


Thr 


He 


J-Jjf 2D 


ux tx 


lieu 


Glv 


Lys 


Met 


Lys 


Gin Glu 


Pro 


Val Lys 


_Pro 


Glu 


Glu 


Gly 




130 










135 






140 










Arg 


Thr 


Met 


Ala 


Glu 


Lys 


He 


Asn Ala 


Phe 


Ala Tyr 


Leu 


Glu 


Cys 


Ser 


145 










150 








155 








160 


Ala 


Lys 


Ser 


Lys 


Glu 


Gly 


Val 


Arg Glu 


Val. 


Phe Glu 


Thr 


Ala 


Thr 


Arg 








165 








170 








175 




Ala 


Ala 


Leu 


Gin 
180 


Val 


Lys 


Lys 


Lys Lys 
185 


Lys 


Lys Pro 


Cys 


Val 
190 


Leu 


Phe 



<210> 379 

<211> 1014 

<212> DNA 

<213> Rhopalosiphum pacli 

<220> 

<221> CDS 

<222> (197) . . (772) 

<400> 379 

tcaattgcgt tctgccattt ttttctaccg acgtcgatcc gcgcgtgttg atctgccggg 60 



tacactccgt acacaggttg gagatcgctg caggcagcgt tcgattgcga caacgacgac 120 



cagcgagaac acttctctag tgctctgtgc tgcttattca ccagtgtaca cgtaggagtg 180 



ttcccatagc ggcaac atg cag acc ate 
Met Gin Thr He 











1 








5 


get 


gtc 


ggt 


aag 


act 


tgt 


ctg 


etc 


ata 


Ala 


Val 


Gly 


Lys 


Thr 


Cys 


Leu 


Leu 


He 






15 










20 




cct 


tea 


gaa 


tat 


gta 


ccg 


act 


gtt 


ttc 


Pro 


Ser 


Glu 


Tyr 


Val 


Pro 


Thr 


val 


Phe 




30 










35 






atg 


att 


ggt 


ggg 


gaa 


cca 


tac 


aca 


tta 


Met 


He 


Gly 


Gly 


Glu 


Pro 


Tyr 


Thr 


Leu 


45 










50 








cag 


gaa 


gat 


tat 


gat 


cgc 


etc 


aga 


cct 


Gin 


Glu 


Asp 


Tyr 


Asp 

65 


Arg 


Leu 


Arg 


Pro 


gtg 


ttt 


ctt 


gtt 


tgt 


ttc 


tct 


gtg 


gtt 


Val 


Phe 


Leu 


val 


Cys 


Phe 


Ser 


val 


Val 








80 








85 


gtc 


aaa 


gaa 


aaa 


tgg 


gtt 


ccg 


gag 


ata 


Val 


Lys 


Glu 


Lys 


Trp 


Val 


Pro 


Glu 


He 






95 










100 




cca 


ttc 


ctg 


ttg 


gtt 


ggc 


aca 


caa 


ata 


Pro 


Phe 


Leu 


Leu 


Val 


Gly 


Thr 


Gin 


He 




110 










115 







tgc gtt gtt gtt ggt gat gga 232 
Cys Val Val Val Gly Asp Gly 
10 



teg 


tac 


aca 


aca 


aac 


aaa 


ttt 


280 


Ser 


Tyr 


Thr 


Thr 
25 


Asn 


Lys 


Phe 




gac 


aat 


tat 


gca 


gtg 


aca 


gtt 


328 


Asp 


Asn 


Tyr 
40 


Ala 


Val 


Thr 


Val 




ggt 


tta 


ttt 


gat 


aca 


gca 


ggt 


376 


Gly 


Leu 
55 


Phe 


Asp 


Thr 


Ala 


Gly 
60 




ttg 


agt 


tat 


cca 


caa 


act 


gat 


424 


Leu 


Ser 


Tyr 


Pro 


Gin 


Thr 


Asp 




70 










75 






tta 


cca 


tct 


tea 


ttt 


gaa 


aat 


472 


Leu 


Pro 


Ser 


Ser 


Phe 
90 


Glu 


Asn 




acg 


cat 


cac 


tgt 


caa 


aaa 


aca 


520 


Thr 


His 


His 


Cys 
105 


Gin 


Lys 


Thr 




gac 


ctt 


aga 


gaa 


gat 


gee 


aca 


568 


Asp 


Leu 


Arg 
120 


Glu 


Asp 


Ala 


Thr 
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act 


gtg 


gag 


aaa 


eta 


gec 


aaa 


aat 


aaa 


caa 


aaa 


tea 


ata tea 


ttt 


gaa 


616 


Thr 


Val 


Glu 


Lys 


Leu 


Ala 


Lys 


Asn 


Lys 


.Gin 


Lys 


Ser 


He Ser 


Phe 


Glu 




125 










13 0 










135 








140 




caa 


gga 


gag 


aag 


eta 


get 


aaa 


gaa 


ctt 


aaa 


get 


gtg 


aaa tat 


gtc 


gaa 


664 


Gin 


Gly 


Glu 


Lys 


Leu 


Ala 


Lys 


Glu 


Leu 


Lys 


Ala 


Val 


Lys Tyr 


Val 


Glu 












145 










150 






155 






tgc 


tea 


gca 


ctt 


aca 


caa 


aaa 


gga 


eta 


aaa 


aat 


-gta 


ttt gat 


gaa get 


712 


Cys 


Ser 


Ala 


Leu 


Thr 


Gin 


Lys 


Gly Leu 


Lys 


Asn 


Val 


Phe Asp 


Glu Ala 










160 










165 








170 








att 


ctt 


gca 


get 


tta 


gag 


cct 


cct 


gaa 


cca 


gtt 


aag 


aag agg 


aag 


tgt 


760 


lie 


Leu 


Ala 


Ala 


Leu 


Glu 


Pro 


Pro 


Glu 


Pro 


Val 


Lys 


Lys Arg 


Lys 


Cys 








175 










180 










185 





gtt ata ttg taaggctgcg gataaataaa caggtgegae aattatgtca taaaaatatt 819 
Val He Leu 



190 

taagataaaa caatttaaat catgatttag catggataca ataatgaaat aataattatt 879 

ttgtttttac taatctataa atatatatat ataaataatt tattttatat tttacaagaa 939 

aatatgtgca ttcattgaat aataaataaa taggtttttt atacgecaaa aaaaaaaaaa 999 

aaaaaaaaaa aaaaa 1014 



<210> 380 
<211> 191 
<212> PRT 

<213> Rhopalosiphum padi 
<400> 380 



Met 


Gin 


Thr 


He 


Lys Cys 


Val 


Val 


1 








5 






Thr 


Cys 


Leu 


Leu 


He Ser 


Tyr 


Thr 








20 








Val 


Pro 


Thr 


Val 


Phe Asp 


Asn 


Tyr 






35 








40 


Glu 


Pro 


Tyr 


Thr 


Leu Gly 


Leu 


Phe 




50 








55 




Asp 


Arg 


Leu 


Arg 


Pro Leu 


Ser 


Tyr 


65 








70 






Cys 


Phe 


Ser 


Val 


Val Leu 


Pro 


Ser 










85 






Trp 


Val 


Pro 


Glu 


He Thr 


His 


His 








100 








Val 


Gly 


Thr 


Gin 


He Asp 


Leu 


Arg 






115 








120 


Leu 


Ala 


Lys 


Asn 


Lys Gin 


Lys 


Ser 




130 








135 




Leu 


Ala 


Lys 


Glu 


Leu Lys 


Ala 


Val 


145 








150 






Thr 


Gin 


Lys 


Gly 


Leu Lys 


Asn 


Val 










165 






Leu 


Glu 


Pro 


Pro 


Glu Pro 


Val 


Lys 



180 

<210> 381 
<211> 679 
<212> DNA 

<213> Hordeum vulgare var 

<220> 

<221> CDS 

<222> (14) . . (655) 



Val Gly Asp Gly Ala Val Gly Lys 

10 15 
Thr Ash Lys Phe Pro Ser Glu Tyr 
25 30 
Ala Val Thr Val Met He Gly Gly 
45 

Asp Thr Ala Gly Gin Glu Asp Tyr 
60 

Pro Gin Thr Asp Val Phe Leu Val 

75 80 
Ser Phe Glu Asn Val Lys Glu Lys 

90 95 
Cys Gin Lys Thr Pro Phe Leu Leu 
105 110 
Glu Asp Ala Thr Thr Val Glu Lys 
125 

He Ser Phe Glu Gin Gly Glu Lys 
140 

Lys Tyr Val Glu Cys Ser Ala Leu 
155 160 
Phe Asp Glu Ala He Leu Ala Ala 

170 175 
Lys Arg Lys Cys Val He Leu 
185 190 
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<400> 381 

cgcgcggcga gcc atg gcg tec age gec tec egg ttc ate aag tgc gtc 49 
Met Ala Ser Ser Ala Ser Arg Phe lie Lys Cys Val 
15 10 
acg gtg ggc gac ggc gcc gtc ggc aag ace tgc atg etc ate tgc tac 97 
Thr Val Gly Asp Gly Ala Val Gly Lys Thr Cys .Met Leu lie Cys Tyr 

15 20 25 

acc age aac aag ttc ccc ace gac tac ata. ccc acg gtg ttc gac aat 145 
Thr Ser Asn Lys Phe Pro Thr Asp Tyr He Pro Thr Val Phe Asp Asn 

30 35 40 

ttc age gcg aac gtg gtg gcg gac ggc acc acg gtg aat ttg ggc ctt 193 
Phe Ser Ala Asn Val Val Ala Asp Gly Thr Thr Val Asn Leu Gly Leu 
45 50 55 60 

tgg gac acc gcc ggg cag gag gat tac aac egg ctg agg cct eta age 241 
Trp Asp Thr Ala Gly Gin Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser 

65 70 75 

"tac cgc ggc gcc gac gift ttc gtg ctt gcc ttc tec ctt gtg age cga 289 
Tyr Arg Gly Ala Asp Val Phe Val Leu Ala Phe Ser Leu Val Ser Arg 

80 85 90 

get age tat gag aat ate atg aag aag tgg ata ccg gag ctt cag cat 337 
Ala Ser Tyr Glu Asn He Met Lys Lys Trp He Pro .Glu Leu Gin His 

95 100 105 

tac gcg ccc ggc gta cct gtt gtg ctg gta ggc aca aaa ctg gat ctt 385 
Tyr Ala Pro Gly Val Pro Val Val Leu Val Gly Thr Lys Leu Asp Leu 

110 115 120 

cgt gaa gat aag cac tat ttg ctg gac cac cct ggg atg ata ccc gtt 433 
Arg Glu Asp Lys His Tyr Leu Leu Asp His Pro Gly Met He Pro Val 
125 "* 130 135 140 

acc aca gca cag ggg gag gaa ctt cgt aag caa gtt ggt get tta tat 481 
Thr Thr Ala Gin Gly Glu Glu Leu Arg Lys Gin Val Gly Ala Leu Tyr 

145 150 155 

tac ata gag tgc age tea aag aca caa cag aat gtc aaa get gtg ttt « 529 

Tyr He Glu Cys Ser Ser Lys Thr Gin Gin Asn Val Lys Ala Val Phe 

160 165 170 

gat get get ate aag gta gta ate cag ccc cca act aaa caa aga gaa 577 
Asp Ala Ala He Lys Val Val He Gin Pro Pro Thr Lys Gin Arg Glu 

175 180 185 

aag aag aaa aag aaa cag cgt egg gga tgt tct atg atg aac ttc age 625 
Lys Lys Lys Lys Lys Gin Arg Arg Gly Cys Ser Met Met Asn Phe Ser 

190 195 200 

gga agg aaa atg eta tgc ttc aaa tec tgaatgatga aagagaaggt 672 
Gly Arg Lys Met Leu Cys Phe Lys Ser 
205 210 
tecttge 



679 



<210> 382 
<211> 213 
<212> PRT 

<213> Hordeum vulgare var 



<400> 382 



Met Ala 


Ser 


Ser 


Ala 


Ser Arg 


Phe 


1 






5 






Gly Ala 


Val 


Gly 


Lys 


Thr Cys 


Met 






20 








Phe Pro 


Thr 


Asp 


Tyr 


He Pro 


Thr 




35 








40 


Val Val 


Ala 


Asp 


Gly 


Thr Thr 


Val 


50 








55 




Gly Gin 


Glu 


Asp 


Tyr 


Asn Arg Leu 


65 








70 




Asp Val 


Phe 


Val 


Leu 


Ala Phe 


Ser 






85 






Asn He 


Met 


Lys 


Lys 


Trp He 


Pro 



He Lys Cys Val Thr Val Gly Asp 

10 15 
Leu He Cys Tyr Thr Ser Asn Lys 
25 * 30 

Val Phe Asp Asn Phe Ser Ala Asn 
45 

Asn Leu Gly Leu Trp Asp Thr Ala 
60 

Arg Pro Leu Ser Tyr Arg Gly Ala 

75 80 
Leu Val Ser Arg Ala Ser Tyr Glu 

90 95 
Glu Leu Gin His Tyr Ala Pro Gly 
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100 105 110 

Val Pro Val Val Leu Val Gly Thr Lys Leu Asp Leu Arg Glu Asp Lys 

115 120 125 

His Tyr Leu Leu Asp His Pro Gly Met lie Pro Val Thr Thr Ala Gin 

130 135 140 

Gly Glu Glu Leu Arg Lys Gin Val Gly Ala Leu Tyr Tyr lie Glu Cys 
145 150 155 . 160 

Ser Ser Lys Thr Gin Gin Asn Val Lys Ala Val Phe Asp Ala Ala lie 

165 170. 175 

Lys Val Val He Gin Pro Pro Thr Lys Gin Arg Glu Lys Lys Lys Lys 

180 185 190 

Lys Gin Arg Arg Gly Cys Ser Met Met Asn Phe Ser Gly Arg Lys Met 

195 200 205 

Leu Cys Phe Lys Ser 
210 

<210> 383 

<211> 762 : 

<212> DNA 

<213> Hordeum vulgare var 

<220> 

<221> CDS 

<222> (33) . . (689) 

<400> 383 

ggatccgctg gagaggagag gagaggagag ac atg age ggc gga gcg ggg gcg 53 

Met Ser Gly Gly Ala Gly Ala 
1 5 

gcg acg gcg gtg age agg ttc ate aag tgc gtg gee gtg ggc gac ggc 101 
Ala Thr Ala Val Ser Arg Phe lie Lys Cys Val Ala Val Gly Asp Gly 

10 15 20 

gec gtg ggc aag ace tgc atg etc ate tgc tac acc tgc aac aag ttc 149 
Ala Val Gly Lys Thr Cys Met Leu lie Cys Tyr Thr Cys Asn Lys Phe 

25 30 35 . 

ccc acc gac tac ate ccc acc gtg ttc gac aac ttc age gee aat gtc 197 
Pro Thr Asp Tyr He Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val 
40 45 50 55 

tec gtg gac ggg age ate gtc aac etc ggc etc tgg gac acc gca ggt 245 
Ser Val Asp Gly Ser He Val Asn Leu Gly Leu Trp Asp Thr Ala Gly 

60 65 70 

cag gag gat tac age agg ctg agg cct ctg age tac agg gga gee gat 293 
Gin Glu Asp Tyr Ser Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp 

75 80 85 

gtc ttc ate etc tec ttc tec etc acc age aga gca age tat gag aat 341 
Val Phe He Leu Ser Phe Ser Leu Thr Ser Arg Ala Ser Tyr Glu Asn 

90 95 100 

gtg cac aag aag tgg atg ccg gag ett 'ege egg tac gee ccc ggc att 389 
Val His Lys Lys Trp Met Pro Glu Leu Arg Arg Tyr Ala Pro Gly He 

105 110 lis 

cct gta ctg ctt gtt gga acc aag ttg gat etc egg gag gat aga get 437 
Pro Val Leu Leu Val Gly Thr Lys Leu Asp Leu Arg Glu Asp Arg Ala 
120 125 130 ~ 135 

tat ctt get gat cat gca get gat tec ate ata aca act gag cag ggt 485 
Tyr Leu Ala Asp His Ala Ala Asp Ser He He Thr Thr Glu Gin Gly 

140 145 150 

gag gat ctt agg aga caa ata ggt get gtg gca tac ata gaa tgc age . 533 

Glu Asp Leu Arg Arg Gin He Gly Ala Val Ala Tyr He Glu Cys Ser 

155 160 165 

tec aag aca caa agg aac att aag get gtt ttc gac acc gca ate aag 581 
Ser Lys Thr Gin Arg Asn He Lys Ala Val Phe Asp Thr Ala He Lys 

170 175 180 

gcg gtt ctt caa cct caa agg: cac aag gag gta gec aga aag gaa act 629 
Ala Val Leu Gin Pro Gin Arg His Lys Glu Val Ala Arg Lys Glu Thr 

185 190 195 

egg aca cgc tct agt egg tea gta agg cag tac ttc tgt ggg agt tct 677 
Arg Thr Arg Ser Ser Arg Ser Val Arg Gin Tyr Phe Cys Gly Ser Ser 
200 205 210 215 
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tgt ttc gcg tagccgagaa gaacattctg gagtctttcc tcgacacgat gctcagatgg 7; 
Cys Phe Ala 

cccgtggttg ttctccaaat ggggta 7i 



<210> 384 

<211> 218 
<212> PRT 

<213> Hordeum vulgare var 
<400> 384 

Met Ser Gly Gly Ala Gly Ala Ala Thr Ala Val Ser Arg Phe He Lys 

15 10 15 

Cys Val Ala Val Gly Asp Gly Ala Val Gly Lys Thr Cys Met Leu He . 

~ 20 25 30 

Cys Tyr Thr Cys Asn Lys Phe Pro Thr Asp Tyr He Pro Thr Val Phe ■ 

35 " 40 45 

Asp Asn Phe Ser Ala Asn Val Ser Val Asp Gly Ser He Val Asn Leu 

50 55 60 . 

Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr Ser Arg Leu Arg Pro 
65 70 75 80 

Leu Ser Tyr Arg Gly Ala Asp Val Phe He Leu Ser Phe Ser Leu Thr 

85 90 95 

Ser Arg Ala Ser Tyr Glu Asn Val His Lys Lys Trp Met Pro Glu Leu 

100 105 HO 

Arg Arg Tyr Ala Pro Gly He Pro Val Leu Leu Val Gly Thr Lys Leu 

115 ■ • ~ 120 125 

Asp Leu Arg Glu Asp Arg Ala Tyr Leu Ala Asp His Ala Ala Asp Ser 

130 135 140 

He He Thr Thr Glu Gin Gly Glu Asp Leu Arg Arg Gin He Gly Ala 
145 150 155 160 

Val Ala Tyr He Glu Cys Ser Ser Lys Thr Gin Arg Asn He Lys Ala 

165 170 175 

Val Phe Asp Thr Ala He Lys Ala Val Leu Gin Pro Gin Arg His Lys 

180 185 190 

Glu Val Ala Arg Lys Glu Thr Arg Thr Arg Ser Ser Arg Ser Val Arg 

195 ' 200 205 

Gin Tyr Phe Cys Gly Ser Ser Cys Phe Ala 
210 215 

<210> 385 
<211> 721 
<212> DNA 

<213> Hordeum vulgare 

<220> 

<221> CDS 

<222> (38) . . (676) 

<400> 385 

gtggaggcgc ggcgagagcg gcggaggcgg aggagag atg age gtg acc aag ttc 

Met Ser Val Thr Lys Phe 
1 5 
ate aag tgc gtc acg gtg ggg gac ggc gec gtc ggc aag acc tgc atg 
He Lys Cys Val Thr Val Gly Asp Gly Ala Val Gly Lys Thr Cys Met 

10 15 20 

etc ate tgc tac acc age aac agg ttc ccc agt gat tac ate ccc acg 
Leu He Cys Tyr Thr Ser Asn Arg Phe Pro Ser Asp Tyr He Pro Thr 

25 " 30 35 

gtg ttc gac aac ttc age gee aac gtc tec gtc gac ggc aac ate gtc 
Val Phe Asp Asn Phe Ser Ala Asn Val Ser Val Asp Gly Asn He Val 

40 45 50 

aac etc ggc eta tgg gac acc gee ggg caa gaa gac tac age egg ctg 
Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr Ser Arg Leu 
55 60 65 70 
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agg ccg ctg age tac aga ggc gec gac gtg ttc gtg etc gec ttc tec 295 
Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val Phe Val Leu Ala Phe Ser 

75 80 85 

etc ate age age gec age tac gag aat gtt ctt aag aag tgg atg cca 343 
Leu lie Ser Ser Ala Ser Tyr Glu Asn Val Leu Lys Lys Trp Met Pro 

90 95 100 

gag etc cgc egg ttc gcg ccg aat gtc ccc att gtt ctt gtt ggg acc 391 
Glu Leu Arg Arg Phe Ala Pro Asn Val Pro lie Val Leu Val Gly Thr 

105 no 115 

aag eta gat ctg cgt gac cac aga gec tac etc gec gac cac ccc ggt 439 
Lys Leu Asp Leu Arg Asp His Arg Ala Tyr Leu Ala Asp His Pro Gly 

120 125 130 

get tea gca ate aca act gca cag ggt gaa gaa ctt agg aag cag ate 487 
Ala Ser Ala lie Thr Thr Ala Gin Gly Glu Glu Leu Arg Lys Gin lie 
135 140 145 150 

ggc gee gcg get tac ate gag tgc age tec aag aca cag cag aac gtc 535 
Gly Ala Ala Ala Tyr lie Glu Cys Ser Ser Lys Thr Gin Gin Asn Val 

155 160 165 
aag get gtg ttt gac acc gee ata aag gtg gtc etc cag ccg ccg agg 583 
Lys Ala Val Phe Asp Thr Ala lie Lys Val Val Leu Gin Pro Pro Arg 

170 175 180 

aga agg gag gtg atg tec gee agg aag aaa acc agg cga age tct gga 631 
Arg Arg Glu Val Met Ser Ala Arg Lys Lys Thr Arg Arg Ser Ser Gly 

185 190 • 195 

tgc tec ate aag cac ttg ate tgc ggg agt acg tgc get get tgaattagca 683 
Cys Ser lie Lys His Leu lie Cys Gly Ser Thr Cys Ala Ala 

200 205 210 

ccatggaggc ctggactgac tatggagatg aagcatgg 721 



<210> 386 
<211> 212 
<212> PRT 

<213> Hordeum vulgare 
<400> 386 

Met Ser Val Thr Lys Phe lie Lys Cys Val Thr Val Gly Asp Gly Ala 

1 5 10 15 

Val Gly Lys Thr Cys Met Leu lie Cys Tyr Thr Ser Asn Arg Phe Pro 

20 25 30 

Ser Asp Tyr lie Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val Ser 

35 40 45 

Val Asp Gly Asn lie Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin 

50 55 60 

Glu Asp Tyr Ser Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp Val 
65 70 75 ~ 80 

Phe Val Leu Ala Phe Ser Leu lie Ser Ser Ala Ser Tyr Glu Asn Val 

85 90 95 

Leu Lys Lys Trp Met Pro Glu Leu Arg Arg Phe Ala Pro Asn Val Pro 

100 105 110 

lie Val Leu Val Gly Thr Lys Leu Asp Leu Arg Asp His Arg Ala Tyr 

115 120 ~ 125 

Leu Ala Asp His Pro Gly Ala Ser Ala lie Thr Thr Ala Gin Gly Glu 

130 135 140 

Glu Leu Arg Lys Gin He Gly Ala Ala Ala Tyr lie Glu Cys Ser Ser 
145 150 155 160 

Lys Thr Gin Gin Asn Val Lys Ala Val Phe Asp Thr Ala He Lys Val 

165 170 175 

Val Leu Gin Pro Pro Arg Arg Arg Glu Val Met Ser Ala Arg Lys Lys 

180 185 190 

Thr Arg Arg Ser Ser Gly Cys Ser He Lys His Leu He Cys Gly* Ser 

195 200 205 

Thr Cys Ala Ala 
210 



<210> 387 
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<211> 699 
<212> DNA 

<213> Hordeum vulgare 

<220> 

<221> CDS 

<222> (67).. (660) 

<400> 387 

ggatcccgat tccatcagga aagcatatag actagcccag taaatagaaa . taagnaaaga 60 



156 



300 



348 



396 



tcggcg atg age gca tct egg ttc ate aag tgc gtg acg gtg ggg gac 108 
Met Ser Ala Ser Arg Phe lie Lys Cys Val Thr Val Gly Asp 
1 5 10 

ggc gec gtg gga aag aca tgc etc etc ate tea tac aca tec aac ace 
Gly Ala Val Gly Lys Thr Cys Leu Leu lie Ser Tyr , Thr Ser Asn . Thr 
15 20 ^ 25 30 

ttc ccc aca gac tat gtc cca aca gtt ttc gac aac ttc age get aac 204 
Phe Pro Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn 

35 40 45 

gtc gtg gtt gac ggc age acc gtc aac etc gga tta tgg gat act gca 252 
Val Val Val Asp Gly Ser Thr Val Asn Leu Gly Leu Trp Asp Thr Ala 

50 55 60 

gga caa gaa gac tat aat cga eta cgc cca eta age tac cgt ggt gee 
Gly Gin Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala 

65 70 75 

gat gtc ttc ctg etc gee ttt tct etc ate age aaa gca age tac gag 
Asp Val Phe Leu Leu Ala Phe Ser Leu He Ser Lys Ala Ser Tyr Glu 

80 85 90 

aat gtc act aag aag tgg att cca gag tta egg cac tat get cct ggc 
Asn Val Thr Lys Lys Trp He Pro Glu Leu Arg His Tyr Ala Pro Gly 
95 100 105 HO 

gtg ccc ata att ctt gtt gga aca aag ctt gat ctg egg gat gac aag 444 
Val Pro He He Leu Val Gly Thr Lys Leu Asp Leu Arg Asp Asp Lys 

■' 115 120 125 

cag ttt ttt gtg gat cac cct ggg gcg gtt cct att tec act get cag 492 
Gin Phe Phe Val Asp His Pro Gly Ala Val Pro He Ser Thr Ala Gin 

130 135 140 

ggt gaa gag ctg aag aag gtg att ggc gcg act gee tac ate gag tgc 540 
Gly Glu Glu Leu Lys Lys Val He Gly Ala Thr Ala Tyr He Glu Cys 

145 150 155 

age tea aaa aca cag cag aac ate aag gcg gtg ttt gat gcg gcg ate 588 
Ser Ser Lys Thr Gin Gin Asn He Lys Ala Val Phe Asp Ala Ala He 

160 165 170 

aag gtg gtc etc cag cct ccg aag cag aag egg aag aag agg aag tea 636 
Lys Val Val Leu Gin Pro Pro Lys Gin Lys Arg Lys Lys Arg Lys Ser 
175 180 185 190 

cag aaa gga tgc age ate ttg taaagctaaa atcccttttg ttttgcagtg 687 
Gin Lys Gly Cys Ser He Leu 
195 

tctcgcgtcg ac 699 



<210> 388 
<211> 197 
<212> PRT 

<213> Hordeum vulgare 
<400> 388 

Met Ser Ala Ser Arg Phe He Lys 
1 5 
Val Gly Lys Thr Cys Leu Leu He 
20 

Thr Asp Tyr Val Pro Thr Val Phe 
35 40 



Cys Val Thr Val Gly Asp Gly Ala 

10 15 
Ser Tyr Thr Ser Asn Thr Phe Pro 
25 30 
Asp Asn Phe Ser Ala Asn Val Val 
45 



WO 2005/014828 PCT/EP2004/008136 

285/291 



val Asp 






vai 


Asn Leu Gly Leu 


Trp 


Asp 


Thr 


Ala 


Gly Gin 


50 








55 




60 








lyr ASH 


Arg 


lieu 


Arg Pro Leu Ser 


Tyr 


Arg 


Gly 


Ala 


Asp Val 


65 






70 




75 








80 


xrxxc lieu 


Leu Ala. 


Pile 


Ser 


Leu lie Ser Lys 


Ala 


Ser 


Tyr 


Glu 


Asn Val 


rpy» t vjra 
1 IlA Jjy S 








Qn 








95 


ijys i rp 
inn 

1UU 


lie 


Pro 


Glu Leu Arg His 
105 


Tyr 


.Ala 


Pro 


Gly 
110 


Val Pro 


lie lie 


Leu vai 
115 


Gly 


Thr 


Lys Leu Asp Leu 
120 


Arg 


Asp 


Asp 
125 


Lys 


Gin Phe 


irlie VaJ. 


ASp HXS 


Pro 


Gly 


Ala Val Pro He 


Ser 


Thr 


Ala 


Gin 


Gly Glu 










li D 




140 






Glu Leu 


Lys Lys 


Val 


lie 


Gly Ala Thr Ala 


Tyr 


He 


Glu 


Cys 


Ser Ser 


145 






150 




155 






160 


Lys Thr 


Gin Gin 


Asn 
165 


lie 


Lys Ala Val Phe 
170 


Asp 


Ala 


Ala 


He 


Lys Val 
175 


Val Leu 


Gin Pro 


Pro 


Lys 


Gin Lys Arg Lys 


Lys 


Arg 


Lys 


Ser 


Gin Lys 




18"0 






185 








190 


Gly Cys 


Ser lie 
195 


Leu 

















<210> 389 
<211> 677 
- <212> DNA 
<213> Hordeum vulgare 

<220> 

<221> CDS 

<222> (27) . . (668) 

<400> 389 

ggatccttct cgtccattta geegge atg gcg tec age gee tec egg ttc ate 53 

Met Ala Ser Ser Ala Ser Arg Phe lie 
1 5 

aag tgc gtc acc gtc ggg gac ggc gec gtc ggc aag ace tgc atg etc 101 

Lys Cys Val Thr Val Gly Asp Gly Ala Val Gly Lys Thr Cys Met Leu 

10 15 20 25 

ate tgc tac acc age aac aag ttc ccc ace gac tac gtg ccc acc gtg 149 

He Cys Tyr Thr Ser Asn Lys Phe Pro Thr Asp Tyr Val Pro Thr- val 

30 35 40 

ttc gac aat ttc age gcg aac gtg gtg gtg gac ggc acc acc gtg aac 197 
Phe Asp Asn Phe Ser Ala Asn Val Val Val Asp Gly Thr Thr Val Asn 

45 50 55 

ctg ggc ctc.tgg gac act- gca ggg cag gag gac tac aac aga ttg aga 245 
Leu Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr Asn Arg Leu Arg 

60 65 70 

ccg ctg age tac egg gga gee gac gtc ttc gtg etc tec ttc teg etc 293 
Pro Leu Ser Tyr Arg Gly Ala Asp Val Phe Val Leu Ser Phe Ser Leu 

75 80 ,85 

gtc age cga gee age tac gag aat gtc atg aag aag tgg eta ccg gag 341 
Val Ser Arg Ala Ser Tyr Glu Asn Val Met Lys Lys Trp Leu Pro Glu 
90 95 100 " 105 

ctt cag cac cat gca ccc ggc gtg cca aca gtg ctg gtt ggt aca aag 389 
Leu Gin His His Ala Pro Gly Val Pro Thr Val Leu Val Gly Thr Lys 

110 lis 120 

eta gat eta cgt gaa gac aag caa tac tta ctt gac cac ccc ggc gtg ,43 7 

Leu Asp Leu Arg Glu Asp Lys Gin Tyr Leu Leu Asp His Pro Gly Val 

125 130 135 

gtg cct gtt act aca get cag ggg gag gaa etc cgc aag cac ate ggt 485 
Val Pro Val Thr Thr Ala Gin Gly Glu Glu Leu Arg Lys His He Gly 

140 145 ~ 150 

gca act tgt tat gtc gaa tgc age tea aag aca cag cag aat gtc aaa 533 
Ala Thr Cys Tyr Val Glu Cys Ser Ser Lys Thr Gin Gin Asn Val Lys 

155 160 165 

get gtg ttt gat get gec ate aag gta gtg ate aaa cct cca aca aag 581 
Ala Val Phe Asp Ala Ala He Lys Val Val He Lys Pro Pro Thr Lys 
170 175 180 185 

cag agg gaa agg agg aag aag aaa gca egg caa gga tgt gca tea ttg 629 
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Gin Arg Glu Arg Arg Lys Lys Lys Ala Arg Gin Gly Cys Ala Ser Leu 

190 195 200 

ggt acc ctg tea aga agg aag ctg gca tgc ttc aag tgatcagtcg 675 
Gly Thr Leu Ser Arg Arg Lys Leu Ala Cys Phe Lys 

205 210 
ac 677 



<210> 390 
<211> 213 
<212> PRT 

<213> Hordeum vulgare 



<400> 390 





Al a 




oci 


Al a 


Ser Arg 


P>ip ti a Tiva fvfi Val TVit* Val m v Asn 

JtrXXC XXCS XJjrD vjr O V CL JL X JLXX Vfl 1 _ VJX jr 










5 






10 15 




Al a 


v ai 






Thr 


Cys 


Mph T .fan Tip (*!vej Tvt Thi* Re?* &ag Lvs 

1 1C L. J— id LI XXC w jr 0 XjrX X XXX QC1 n9U XJjr •» 








20 








25 30 




JTX VJ 


Thr 


Asp 




Val 


Pro 


Thi* Val Phe Asti A an Phe Ser Ala Asn 






35 










40 45 


val 


Val 


Val 


Asp 


Gly 


Thr 


Thr 


Val Asn Leu Gly Leu Trp Asp Thr Ala 




50 










55 


60 


Gly 


Gin 


Glu 


Asp 


Tyr 


Asn 


Arg 


lieu Arg Pro Leu Ser Tyr Arg Gly Ala 


65 










70 




75 80 


Asp 


Val 


Phe 


Val 


Leu 


Ser 


Phe 


Ser Leu Val Ser Arg Ala Ser Tyr Glu 








85 






90 ^ 95 


Asn 


Val 


Met 


Lys 


Lys 


Trp 


Leu 


Pro Glu Leu Gin His His Ala Pro Gly 








100 






105 110 


Val 


Pro 


Thr 


Val 


Leu 


Val Gly 


Thr Lys Leu Asp Leu Arg Glu Asp Lys 






115 










120 125 


Gin 


Tyr 


Leu 


Leu 


Asp 


His 


Pro 


Gly Val Val Pro Val Thr Thr Ala Gin 




130 










135 


140 


Gly 


Glu 


Glu 


Leu 


Arg 


Lys 


His 


lie Gly Ala Thr Cys Tyr Val Glu Cys 


145 










150 




155 160 


Ser 


Ser 


Lys 


Thr 


Gin 


Gin 


Asn 


Val Lys Ala Val Phe Asp Ala Ala lie 








165 






170 175 


Lys 


Val 


Val 


lie 


Lys 


Pro 


Pro 


Thr Lys Gin Arg Glu Arg Arg Lys Lys 








180 








185 190 


Lys 


Ala 


Arg 


Gin 


Gly 


Cys 


Ala 


Ser Leu Gly Thr Leu Ser Arg Arg Lys 






195 










200 205 


Leu 


Ala 


Cys 


Phe 


Lys 










210 















<210> 391 
<211> 565 
<212> DNA 

<213> Nicotiana tabacum 

<220> 

<221> CDS 

<222> (71) . . (565) 

<400> 391 

ctttttccaa tttcaactcc ataaaactaa gaagctagtg ttcttctttc ttatctttct 60 



aattgatgag atg aat act agt agt agt gec agt aat agt get tct act 109 
Met Asn Thr Ser Ser Ser Ala Ser Asn Ser Ala Ser Thr 
15 10 
gca aca gga aca aag ttc ate aaa tgt gtg aca gtt gga gat ggt get 157 
Ala Thr Gly Thr Lys Phe lie Lys Cys Val Thr Val Gly Asp Gly Ala 

15 20 25 

gtt ggc aag act tgc ctt etc ate tec tac act age tgc ctt etc ate 205 
Val Gly Lys Thr Cys Leu Leu He Ser Tyr Thr Ser Cys Leu Leu He 
30 35 40 45 

tec tac act age aac act ttt cca act gat tat gtg cca act gtt ttt 253 
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Ser Tyr Thr Ser Asn Thr Phe Pro Thr Asp Tyr Val Pro Thr Val Phe 

50 55 60 

gac aat ttc agt get aat gtc aat gtt gat ggg aag att gtg aat ttg 301 
Asp Asn Phe Ser Ala Asn Val Asn Val Asp Gly Lys lie Val Asn Leu 

65 70 75 

ggt ctt tgg gat act get ggt caa gag gat tat aac agg ctt agg cct 349 
Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr Asn Arg Leu Arg Pro 

80 85 90 

ctt agt tat cga gga get gac gtc ttc ttg ctt gca ttc tct etc ata 397 
Leu Ser Tyr Arg Gly Ala Asp Val Phe Leu Leu Ala Phe Ser Leu lie 

95 100 105 

agt agg cct age ttt gaa aat ata tea aaa aag tgg gtt cct gag eta 445 
Ser Arg Pro Ser Phe Glu Asn He Ser Lys Lys Trp Val Pro Glu Leu 
110 115 120 125 

aga cat tat gee cca tea gtg cct att gtt ctt gtg ggg act aaa ttg 493 
Arg His Tyr Ala Pro Ser Val Pro He Val Leu Val Gly Thr Lys Leu 

130 135 ■ 140 

gat tta aga gag gac aag cag ttt aga agg gac tac cct ggt gca tct 541 
Asp Leu Arg Glu Asp Lys Gin Phe Arg Arg Asp Tyr Pro Gly Ala Ser 

145 150 ~ 155 

aca att tea aca gaa cag ggc tag 565 
Thr He Ser Thr Glu Gin Gly 
160 



<210> 392 
<211> 164 
<212> PRT 

<213> Nicotiana tabacum 
<400> 392 

Met Asn Thr Ser Ser Ser Ala Ser .Asn Ser Ala Ser Thr Ala Thr Gly 

1 5 10 15 

Thr Lys Phe He Lys Cys Val Thr Val Gly Asp Gly Ala Val Gly Lys 

20 25 30 

Thr Cys Leu Leu He Ser Tyr Thr Ser Cys Leu Leu He Ser Tyr Thr 

35 40 45 

Ser Asn Thr Phe Pro Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe 

50 55 60 

Ser Ala Asn Val Asn Val Asp Gly Lys He Val Asn Leu Gly Leu Trp 
65 70 75 80 

Asp Thr Ala Gly Gin Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr 

85 90 95 

Arg Gly Ala Asp Val Phe Leu Leu Ala Phe Ser Leu lie Ser Arg Pro 

100 105 no 

Ser Phe Glu Asn He Ser Lys Lys Trp Val Pro Glu Leu Arg His Tyr 

115 120 125 

Ala Pro Ser Val Pro He Val Leu Val Gly Thr Lys Leu Asp Leu Arg 

130 135 140 

Glu Asp Lys Gin Phe Arg Arg Asp Tyr Pro Gly Ala Ser Thr He Ser 
145 150 155 160 

Thr Glu Gin Gly 



<210> 393 
<211> 1057 
<212> DNA 

<213> Nicotiana tabacum 

<220> 

<221> CDS 

<222> (176) . . (808) 

<400> 393 

ggcacgagaa gctattaatt ccaagctatc tccacaccct ctccccaaac aagaaaatta . 60 



ttaaatgtaa aaegtactae tagaccccca gttacataca taaacacaca teatctgeag 12 0 
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aattttggta tgcaatttta cgtactttct tgaggtattt ttttttttat gttca atg 178 

Met 
1 

gcc tea agt get tea aga ttc ate aaa tgt gtc acg gtt ggt gat ggt 226 
Ala Ser Ser Ala Ser Arg Phe lie Lys Cys Val Thr Val Gly Asp Gly 

5 10 15 

gcc gtt gga aag act tgt atg ctt att tgc tat ace agt aac aag ttc 274 
Ala Val Gly Lys Thr Cys Met Leu He Cys Tyr Thr Ser Asn Lys Phe 

20 25 30 

ccc act gat tat gtt ccc aca gtg ttt gac aac ttc agt gcc aat gtg 322 
Pro Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn Val 

35 " 40 45 

gtt gtc gaa ggg acc aca gta aat tta ggt ctt tgg gat act gca ggc 370 
Val Val Glu Gly Thr Thr Val Asn Leu Gly Leu Trp Asp Thr Ala Gly 

50 55 60 .65. 

caa gaa gat tat aac aga tta agg cca ctg age tac cga gga gca gat 418 
Gin Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala Asp 

70 75 80 

gtt ttt gtc eta gcg ttc tec ttg gtt agt cgc gca age tac gag aac 466 
Val Phe Val Leu Ala Phe Ser Leu Val Ser Arg Ala Ser Tyr Glu Asn 

85 90 95 

at a ctt aaa aag tgg att cct gaa ctt cag cat tat get cct gga ata 514 
He Leu Lys Lys Trp He Pro Glu Leu Gin His Tyr Ala Pro Gly He 

100 * 105 HO 

ccg gtg gta tta get ggc acc aaa ctt gat ctt cgt gag gat aag cac 562 
Pro Val Val Leu Ala Gly Thr Lys Leu Asp Leu Arg Glu Asp Lys His 

115 120 125 

ttc ttg get gat cat cct gga tta gtt cct gtc acc acc gcg cag gga 610 
Phe Leu Ala Asp His Pro Gly Leu Val Pro Val Thr Thr Ala Gin Gly 
13 0 ^ 135 140 145 

gag gag eta egg aaa caa att ggt get gcc tat tac ate gaa tgt age . 658 
Glu Glu Leu Arg Lys Gin He Gly Ala Ala Tyr Tyr He Glu Cys Ser 

150 155 160 

tct aaa aca caa cag aat gtg aaa get gtc ttt gat get gca ate aag 706 
Ser Lys Thr Gin Gin Asn Val Lys Ala Val Phe Asp Ala Ala He Lys 

165 170 175 

gtc gtc ate aag cca ccg cag aag caa aag gag aag aag aaa caa cgt 754 
Val Val He Lys Pro Pro Gin Lys Gin Lys Glu Lys Lys Lys Gin Arg 

180 185 190 

cga gga tgt etc atg aat gtg atg tgc gga agg aag etc gtt tgt ttg 802 
Arg Gly Cys Leu Met Asn Val Met Cys Gly Arg Lys Leu Val Cys Leu 

195 200 205 

aag tgacatttca atgtacatat ttcccatctg ttcggcctct aacatgattt 855 
Lys 
210 

atatccatgg etttagegtg ctgcctataa aacttacatt tctatcagaa cttttgtgac 915 
gagcataatg tatttgtttt tccataaccc aacttactgc tggaeggect tgatttgtct 975 
gtaaaatacg ttacttgeta ggtttatatt tcaagtacat gtattaattc tattaaacca 1035 



acaaaaaaaa aaaaaaaaaa aa 



1057 



<210> 394 
<211> 210 
<212> PRT 

<213> Nicotiana tabacum 
<400> 394 

Met Ala Ser Ser Ala Ser Arg Phe He Lys Cys Val Thr Val Gly Asp 
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1 5 10 15 

Gly Ala Val Gly Lys Thr Cys Met Leu lie Cys Tyr Thr Ser Asn Lys 

20 25 * 30 

Phe Pro Thr Asp Tyr Val Pro Thr Val Phe Asp Asn Phe Ser Ala Asn 

35 40 45 

Val Val Val Glu Gly Thr Thr Val Asn Leu Gly Leu Trp Asp Thr Ala 

50 55 60 

Gly Gin Glu Asp Tyr Asn Arg Leu Arg Pro Leu Ser Tyr Arg Gly Ala 
65 70 75 80 

Asp Val Phe Val Leu Ala Phe Ser Leu Val ' Ser Arg Ala Ser Tyr Glu 

85 90 ^ 95 

Asn lie Leu Lys Lys Trp lie Pro Glu Leu Gin His Tyr Ala Pro Gly 

100 105 no 

He Pro Val Val Leu Ala Gly Thr Lys Leu Asp Leu Arg Glu Asp Lys 

115 120 125 

His Phe Leu Ala Asp His Pro Gly Leu Val Pro Val Thr Thr Ala Gin 

130 135 140 

Gly Glu Glu Leu Arg Lys Gin lie Gly Ala Ala Tyr Tyr He Glu Cys 
145 150 155 160 

Ser Ser Lys Thr Gin Gin Asn Val Lys Ala Val Phe Asp Ala Ala He 

165 170 175 

Lys Val Val He Lys Pro Pro Gin Lys Gin Lys Glu Lys Lys Lys Gin 

180 185 190 

Arg Arg Gly Cys Leu Met Asn Val Met Cys Gly Arg Lys Leu Val Cys 
195 200 205 

Leu Lys 
210- 

<210> 395 
<211> 24 
<212> DNA 
<213> Primer 

<400> 395 

atgtctgaaa aggccgttag aagg 24 

<210> 396 
<211> 26 
<212> DNA 
<213> Primer 

<400> 396 

ttataaaatt atgcaacagt tagccc 26 

<210> 397 

<211> 20 

<212> PRT 

<213 > consensus 



<220> 

<221> MIS COFEATURE 
<222> (1) . . (20) 

<223> Xaa in postion 2 is any amino acid, Xaa in postion 4 is Val, Thr, 
He or Leu, Xaa in position 5 is Val, lie or Leu, Xaa in position 
9 is Ala or Gly, Xaa in position 10 is any amino acid, Xaa in pos 
ition 14 is Cys, Ser or Ala, Xaa in position 15 is Leu or Met, Xa 
a in position 16 is Leu or Met, Xaa in position 17 is any amino a 
cid, Xaa in position 18 is any amino acid, Xaa in position 19 is 
Tyr or Phe, Xaa in position 20 is Thr, Ser or Ala. 
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<400> 397 

Lys Xaa Val Xaa Xaa Gly Asp Gly Xaa Xaa Gly Lys Thr Xaa Xaa Xaa 
1 5 10 15 

Xaa Xaa Xaa Xaa 
20 



<210> 398 

<211> 13 

<212> PRT 

<2 13 > consensus 



<220> 

<221> MISC_FEATURE 
<222> (1)..(13) 



<223> xaa in postion 1 is Phe or Tyr, Xaa in position 3 is any amino ac 
id, xaa in position 4 is any amino acid, Xaa in position 6 is. Val, 
lie or Glu, Xaa in position 9 is Val or lie, Xaa in position 11 is 
Asp or Glu, Xaa in position 12 is any amino acid, Xaa in position 
13 is Tyr or Phe. 



<400> 398 

Xaa Pro Xaa Xaa Tyr Xaa Pro Thr Xaa Phe Xaa Xaa Xaa 
i 5 10 



<210> 399 

<211> 22 

<212> PRT 

<213> consensus 



<220> 

<221> MI SC — FEATURE 
<222> (1)..(22) 



<223> xaa in postion 1 is Val, He or Tyr, Xaa in position 2 is any amin 
o acid, xaa in position 3 is Leu, Val or He, Xaa in position 4 is 
any amino acid, Xaa in position 5 is Leu, He or Met, Xaa in posi 
tion 6 is Trp or Phe, Xaa in position 9 is Ala or Ser, Xaa in posi 
tion 12 is Glu or Asp, Xaa in position 13 is Asp or Glu, Xaa in po 
sition 14 is Tyr or Phe, Xaa in position 15 is any amino acid, Xaa 
in position 16 is Arg, Lys or Asn, Xaa in position 17 is Leu, lie 
or Val, Xaa in position 19 is Pro, Ser or Thr, Xaa in position 20 
is Leu or Met, Xaa in position 21 is Ser, Ala or Phe. 
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<400> 399 

Xaa Xaa Xaa Xaa Xaa Xaa Asp Thx Xaa Gly Gin Xaa Xaa Xaa Xaa Xaa 
1 .5 10 15 

Xaa Arg Xaa Xaa Xaa Tyr 
20 



<210> 400 

<211> 143 

<212> PRT 

<213> consensus 

<220> 

<221> MI S C_FEATURE 

<222> (1) . . (143) 

<223> Xaa in all postions is any amino acid. 

<400> 400 

Lys Xaa Val Xaa Val Gly Asp Gly Ala Xaa Gly Lys Thr Cys Leu Leu 

1 5 10 15 



lie Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe 
20 25 25 

Pro Xaa Xaa Tyr Xaa Pro Thr Val Phe Xaa Asn Xaa Xaa Xaa Xaa Xaa 
30 35 40 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Xaa Leu Trp Asp Thr 
45 50 55 

Ala Gly Gin Glu Asp Tyr Xaa Arg Leu Arg Pro Leu Ser Tyr Xaa Xaa 
60 65 70 75 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
80 85 90 

Xaa Xaa Xaa Xaa Asp Val Xaa Xaa Xaa Xaa Phe Ser Xaa Xaa Xaa Xaa 
95 100 105 

Xaa Ser Xaa Xaa Asn Xaa Xaa Xaa Lys Trp Xaa Pro Glu Xaa Xaa Xaa 
110 115 120 

Xaa Xaa Xaa Pro Xaa Xaa Pro Xaa Xaa Leu Val Gly Xaa Lys Xaa Asp 
125 130 135 

Leu Arg Xaa Asp 
140 



(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property 
Organization 

International Bureau 

(43) Internationa] Publication Date 
17 February 2005 (17.02.2005) 




PCT 



IIIIIH 

(10) International Publication Number 

WO 2005/014828 A3 



(51) International Patent Classification 7 : C12N 15/82, 

C07K 14/395, C12N 5/10, G01N 33/50, C07K 16/14, 
A01H 5/00 



[CA/USJ; 102 Parkarbor Lane, Cary, NC 27519 (US). 
CHEN, Ruoying [CN/US]; 105 Rustic Pine Court, Apex, 
NC 27502 (US). 



(21) International Application Number: 



PCT/EP2004/008136 



(74) Agent: BIEBERBACH, Andreas; c/o BASF Aktienge- 
sellschaft, 67056 Ludwigshafen (DE). 



(22) International Filing Date: 2 1 July 2004 (2 1 .07.2004) 

(25) Filing Language: English 

(26) Publication Language: English 

(30) Priority Data: 

03016672.2 1 August 2003 (01.08.2003) EP 

PCT/US04/1 1887 15 April 2004 (15.04.2004) US 



(81) Designated States (unless otherwise indicated, for every 
kind of national protection available): AE, AG, AL, AM, 
AT, AU, AZ, BA, BB, BG, BR, BW, BY, BZ, CA, CH, CN, 
CO, CR, CU, CZ, DE, DK, DM, DZ, EC, EE, EG, ES, FI, 
GB, GD, GE, GH, GM, HR, HU, ID, IL, IN, IS, JP, KE, 
KG, KP, KR, KZ, LC, LK, LR, LS, LT, LU, LV, MA, MD, 
MG, MK, MN, MW, MX, MZ, NA, NI, NO, NZ, OM, PG, 
PH, PL, PT, RO, RU, SC, SD, SE, SG, SK, SL, S Y, TJ, TM, 
TN, TR, TT, TZ, UA, UG, US, UZ, VC, VN, YU, ZA, ZM, 
ZW. 



(71) Applicant (for all designated States except US): BASF 
PLANT SCIENCE GMBH [ — /DE]; Carl-Bosch-Str. 38, 
67056 Ludwigshafen (DE). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): PLESCH, Gun- 
nar [DE/DE]; Plantagenstr. 16a, 14482 Potsdam (DE). 
PUZIO, Piotr [DE/DE]; Edeltraudweg 21, 13505 Berlin 
(DE). BLAU, Astrid [DE/DE]; Rotkehlchenweg 33, 
14532 Stahnsdorf (DE). LOOSER, Ralf [DE/DE]; Haupt- 
str. 2, 13158 Berlin (DE). WENDEL, Birgit [DE/DE]; 
Feuerbachstr.53, 12163 Berlin (DE). KAMLAGE, Beate 
[DE/DE]; Hektorstr.19, 10711 Berlin (DE). CHARDON- 
NENS, Agnes [NL/DE]; Edelweissstr. 40, 13158 Berlin 
(DE). SHIRLEY, Amber [US/US]; 2832 Kimmon Way, 
Wake Forest, NC 27587 (US). WANG, Xi-Qing [CN/US]; 
205 Laurens Way, Chapel Hill, NC 27516 (US). SAR- 
RIA-MILLAN, Rodrigo [CO/US]; 2324 Winter Walk 
Circle, Morrisville, NC 27560 (US). MCKERSIE, Bryan 



(84) Designated States (unless otherwise indicated, for every 
kind of regional protection available): ARIPO (BW, GH, 
GM, KE, LS, MW, MZ, NA, SD, SL, SZ, TZ, UG, ZM, 
ZW), Eurasian (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), 
European (AT, BE, BG, CH, CY, CZ, DE, DK, EE, ES, FI, 
FR, GB, GR, HU, IE, IT, LU, MC, NL, PL, PT, RO, SE, SI, 
SK, TR), OAPI (BF, BJ, CF, CG, CI, CM, GA, GN, GQ, 
GW, ML, MR, NE, SN, TD, TG). 

Published: 

— with international search report 

(88) Date of publication of the international search report: 

14 July 2005 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



3 

00 
00 



J£ (54) Title: PROCESS FOR THE PRODUCTION OF FINE CHEMICALS IN PLANTS 

(57) Abstract: The present invention relates to a process for the production of the fine chemical in a microorganism, a plant cell, 
a plant, a plant tissue or in one or more parts thereof. The invention furthermore relates to nucleic acid molecules, polypeptides, 
nucleic acid constructs, vectors, antisense molecules, antibodies, host cells, plant tissue, propagation material, harvested material, 
j^- plants, microorganisms as well as agricultural compositions and their use. 



INTERNATIONAL SEARCH REPORT 



Intetj^femal Application No 

PCT/EP2004/OQ8136 



iOT^WSW" 1 ^^^ C12N5 / 10 GQ1N33/50 C07K16/14 
A01H5/00 

Accotdlng to ImemaBonal Patent Classification (IPC) or to both national classification and IPC 

B. FIELDS SEARCHED — 

Minimum documentation searched (classification system followed by classification symbols) 

IPC 7 C12N C07K 



Documentation searched other than minimum documentation to the extent that such documents are Included in the fields searched 



Electronic database consulted during the international search (name of data base and, where practical, search terms used) 

EPO-Internal, Sequence Search, WPI Data, PAJ. BIOSIS, EMBASE, CHEM ABS Data 


DOCUMENTS CONSIDERED TO BE RELEVANT 


Category" 


Citation of document with indication, where appropriate, of the relevant passages 


Relevant to claim Mo. 


X 


MADAULE P ET AL: "CHARACTERIZATION OF TWO 
MEMBERS OF THE RHO GENE FAMILY FROM THE 
YEAST SACCHAROMYCES CEREVISIAE" 
PROCEEDINGS OF THE NATIONAL ACADEMY OF 
SCIENCES OF USA, NATIONAL ACADEMY OF 
SCIENCE. WASHINGTON, US, 
vol. 84, February 1987 (1987-02), pages 
779-783, XP002038042 
ISSN: 0027-8424 
the whole document 


1-26 


[x] ™ 


her documents are listed In the continuation of box C. J)rf] Patent famDy members are listed in annex. 


• Special categories of cited documents : ^ |atQ{ . document published after the Internationa] filing data 

or priority date and not In conflict with the application but 
"A" document defining the general state of the art which is not c | t0£ j to understand the principle or theory underlying the 

considered to be of particular relevance invention 
"E" earlier document but published on or after the international « X " document of particular relevance; the claimed Invention 

filing date cannot be considered novei or cannot be considered to 
V document which may throw doubts on priority claim(s) or involve an inventive step when the document Is taken alone 

which Is cited to establish the publication date of another try" document of particular relevance; the claimed Invention 

citation or other special reason (as specified) cannot be considered to involve an Inventive step when the 
•O- document referring to an oral disclosure, use, exhibition or document is combined with one ^™<^J^™^ 

other means ments, such combination being obvious to a person skilled 

-P' document published prior to the international filing date but in the ®* 

later than the priority date claimed "& tt document member of the same patent family 


Date of the actual completion of the International search 

28 January 2005 


Date of mailing of the International search report 

2 9. 04. 05 


Name and mailing address of the ISA 

European Patent Office, P.B. 5818 Patentlaan 2 
NL-2280HVRijswiik 
Tel. (+31-70) 340-2040, Tx. 31 651 epo nf, 
Fax: (+31-70) 340-3016 


Authorized officer 

Burkhardt, P 



Form PCTTISA/210 (second sheet) (January 2004) 



INTERNATIONAL SEARCH REPORT 



Interracial Application No 

PCT/EP2004/0O8136 



C.(Cont.nuation) DOCUMENTS CONSIDERED TO BE RELEVANT 


Category 0 


Citation of document, with fncNcatfon, where appropriate, of the relevant passages 


Relevant to dalm No. 



-& DATABASE EMBL [Online] Yeast 

(S.cerevi siae) ras-related RH02 gene 

19 September 1987 (1987-09-19), 

MADAULE ET AL. : "Characterization of two 

members of the rho gene family from the 

yeast Saccharomyces cerevisiae" 

XP0023134 27 

retrieved from EBI 

Database accession no. M15190 

abstract 

WO 01/59128 A (ZAEHRINGER ULRICH ; BASF AG 
(DE); HEINZ ERNST (DE) ; LERCHL JENS (DE);) 
16 August 2001 (2001-08-16) 
the whole document 



1-26 



WO 01/44276 A (BASF PLANT SCIENCE GMBH ; 
LERCHL JENS (DE) ; BADUR RALF (DE); CIRPUS 
PE) 21 June 2001 (2001-06-21) 
the whole document 

WO 01/00804 A (BASF AG) 
4 January 2001 (2001-01-O4) 
the whole document 



WO 03/040293 A (HABERHAUER GREGOR ; 
SCHROEDER HARTWIG (DE) ; BASF AG (DE); 
POMPEJUS MAR) 15 May 2003 (2003-05-15) 
the whole document 



1-26 



1-26 



1-26 



1-26 



Form PCrniSA/210 (continuation ol second sheet) (January 2004) 



INTERNATIONAL SEARCH REPORT 



In^fcational application No. 

^PCT/EP2O04/G08136 



Box II Observations where certain ciaims were found unsearchable (Continuation of Item 2 of first sheet) 



This International Search Report has not been established in respect of certain claims under Article 1 7<2)(a) for the f ofiowing reasons: 
1. I I Claims Nos.: 

— because they relate to subject matter not required to be searched by this Authority, namely: 



pn claims Nos.-. 25 (partially) 

— because they relate to parts of the International Application that do not comply with the prescribed requirements to such 
air extent that no meaningful International Search can be carried out, specifically: 

see FURTHER INFORMATION sheet PCT/ISA/210 



3. Claims Nos.: 

— because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box III Observations where unity of invention is lacking (Continuation of item 3 of first sheet) 



This International Searching Authority found multiple inventions In this international application, as follows: 

see additional sheet 

1 . I I As all required additional search fees were timely paid by the applicant, this International Search Report covers all 
I — ' searchable claims. 

2. r~| As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 

of any additional fee. 



3. I I As only some of the required additional search fees were timefy paid by the applicant, this 
• — ! covers only those claims for which fees were paid, specifically claims Nos.: 



International Search Report 



4. No required additional search fees were timely paid by the applicant. Consequently, this International Search Report is 
restricted to the Invention first mentioned in the claims; It is covered by claims Nos.: 

1-26 (all partially) 



Remark on Protest [^] The additional search fees were accompanied by the applicant's protest 

| j No protest accompanied the payment of additional search fees. 



Form PCT/1SA/21 0 (continuation of first sheet (2)) (January 2004) 



Internationa! Application No. PCT/ EP2004/ 008136 



FURTHER INFORMATION CONTINUED FROM PCT/ISA/ £10 



Continuation of Box 1 1. 2 
Claims Nos.: 25 (partially) 



Present claim 25 relates to a composition comprising a product defined 
by reference to a desirable characteristic or property, namely to act as 
an agonist or antagonist of the protein as defined by SEQ ID N0:2. 
The application does not provide support within the meaning of Article 6 
PCT nor disclosure within the meaning of Article 5 PCT for such a 
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Independent of the above reasoning, the claim also lacks clarity 
(Article 6 PCT). An attempt is made to define a product within a process 
by reference to a result to be achieved. Again, this lack of clarity in 
the present case is such as to render a meaningful search impossible. 
The search has therefore been limited to those parts of the claim that 
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methods and products comprising said sequences. 
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as invention 1 but relating to the isolated nucleic acid 
sequences with SEQ ID N0s:3, 5, 7, 9, 11, 13, 15, 17, 19, 
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Inventions 194: Claims 1-26 (all partially) 

relating to an amino acid sequence comprising the sequence 
motif as displayed in SEQ ID N0:47, nucleic acid sequence 
encoding said amino acid and methods and products comprising 
said sequences. 
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Inventions 195-203: Claims 1-26 (all partially) 

as invention 1 but relating to amino acid sequences 
comprising the sequence motifs as displayed in SEQ ID 
N0s:48, 49, 50, 51, 52, 397, 398, 399, 400. 



Invention 204: Claims 2-26 (all partially) 

relating to nucleic acid molecules apmplified from a library 
using the primers in SEQ ID NO: 53 and methods and products 
comprising said sequence. 



Invention 205: Claims 2-26 (all partially) 

relating to nucleic acid molecules apmplified from a library 
using the primers in SEQ ID N0:54 and methods and products 
comprising said sequence. 
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